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Award for Distinguished Service to Leon Henkin 


HENRY L. ALDER University of California, Davis, CA 95616 


IVAN NIVEN University of Oregon, Eugene, OR 97403 


The 1990 Yueh-Gin Gung and Dr. Charles Y. Hu Award for Distinguished 
Service to Mathematics is presented to Leon Henkin for his leadership in so many 
different areas of mathematics education and his initiatives to increase opportunities 
for minorities and women to succeed in mathematics. 

As a professor of mathematics at the University of California, Berkeley, Leon 
Henkin has an outstanding record of scholarly work in his field, logic and the 
foundations of mathematics. As a result, he was elected President of the Association 
for Symbolic Logic for a 3-year term. He served two terms as chair of his 
department, one of the largest in the country. Yet he has found the time and energy 
to make a great variety of contributions to the improvement of mathematical 
instruction at the precollege and college levels. He regards these activities as an 
integral part of his work as a university professor. He was one of the pioneers to 
recognize the importance of mathematicians involving themselves actively in the 
improvement of mathematics education at all levels. The following is only a sample 
of his many contributions. 

In 1964, Leon Henkin and Jerzy Neyman started a program at Berkeley to 
increase the number of minority students entering college from Bay Area high 
schools. In 1974, this was formally established as the Professional Development 
Program (PDP), designed to increase the number of high-achieving minority under- 
graduates going on to graduate or professional study leading to mathematics related 
careers; it is operated by a committee of the Academic Senate. 

In 1969, Leon Henkin headed a statewide University of California program, still 
going, which sends graduate students in mathematics to teach advanced mathemati- 
cal topics in elementary school classrooms on a daily basis. In 1974, Robert Davis 
and Leon Henkin led an NSF program to identify and develop mathematical 
leaders among teachers in each elementary and middle school in Berkeley. 

In addition to his students in the graduate mathematics program at the Univer- 
sity of California, Leon Henkin has had five Ph.D. students over the past twenty 
years in an interdisciplinary program in Mathematics and Science Education. One 
of these students, Uri Treisman, won the $50,000 Dana Award in Higher Education 
in 1987 for his successful design of PDP Mathematical Workshops, which dramati- 
cally increased the number of high-achieving calculus students from underrepre- 
sented minority groups. 

Leon Henkin played a central role in the development of the Bay Area Mathe- 
matics Project (BAMP), one of the early collaborative projects linking students, 
teachers, parents, administrators, and community leaders to improve jointly the 
teaching of mathematics in the schools. Its remarkable effectiveness has made it a 
model for many such collaborative projects throughout the nation. 

Fortunately, when BAMP started in the summer of 1983 with the first group of 
50 teachers, he served on the U.S. Commission on Mathematical Instruction 
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(USCMI). At that time, USCMI’s almost exclusive focus was on matters pertaining 
to the meetings of the International Congress on Mathematical Education (ICME), 
held quadrennially. He urged the commission to broaden its scope by supporting a 
national program to create projects like BAMP in many regions. The USCMI 
supported his idea. 

This is what led to the now-thriving American Mathematics Project (AMP), an 
organization designed to establish collaborative mathematics projects of the 
BAMP-type throughout the country. In its short existence, it has already given 
birth to at least 20 new projects, established a project office at the University of 
California, Berkeley, to assist the activities of these projects, and publishes a 
quarterly (AMP-LINE). 

As a result of his leadership in establishing AMP, Leon Henkin has been called 
“The Father of the American Mathematics Project.” 

In spite of these many time-consuming activities, Leon Henkin has found time to 
expand his earlier initiatives to increase opportunities for minority and women 
students. In 1989, together with Uri Treisman, he organized a six-week summer 
program at the University of California, Berkeley, for such students who have 
completed two years of undergraduate mathematics courses beginning with calculus. 
This nationwide program, designed to increase the number of minority and women 
students seeking careers requiring a Ph.D. in mathematics, is supported by a grant 
from the Alfred P. Sloan Foundation. In his own department he continues to serve 
on the Mathematics Opportunity Committee, which has contributed significantly to 
the number of Ph.D. degrees granted to minority and women mathematicians since 
he helped to found the committee 25 years ago. 

In addition, Leon Henkin has made significant contributions as a member of 
many key committees of the MAA and other organizations. He served with 
distinction on the Committee on the Undergraduate Program in Mathematics 
(CUPM) from 1962 to 1967, its Panel on Pre-Graduate Training from 1962 to 1965, 
the Committee on Educational Media from 1965 to 1967, among many others, 
including committees of AMS, CBMS, and NCTM. 

Leon Henkin has, however, not confined his interest to projects at the national 
level. Recognizing the important benefits of close collaboration on all these activi- 
ties also at the international level, he, together with John Kelley, took the initiative 
to bring the Fourth International Congress on Mathematical Education to the 
University of California at Berkeley in 1980. Participants from 92 countries at- 
tended the congress, the first ICME held outside Europe. 

Leon Henkin is well known also for being an outstanding expositor of mathemat- 
ics, both as a lecturer (in person and on film) and as a writer. He was honored with 
the 1964 Chauvenet Prize for outstanding expository writing for his paper “Are 
Logic and Mathematics Identical?,” published in Science, 138(1962), 788-794. 

Few individuals of our era have had a greater impact on the health of American 
mathematics than has Leon Henkin. For this, all of us owe him our deepest 
gratitude. Those of us fortunate enough to have been associated with him know that 
his service to mathematics will continue with undiminished vigor. 

It is, indeed, fortunate that the first Award for Distinguished Service to Mathe- 
matics with its new title and its substantially increased financial award is given to an 
individual with such an exceptionally rare combination of talents that have bene- 
fited mathematics so enormously. 


The Bitangent Sphere Problem 


PETER J. GIBLIN, The University of Liverpool, Liverpool L69 3BX, England 
DONAL B. O’SHEA, Mount Holyoke College, South Hadley, MA 01075 


PETER J. GIBLIN: I did my graduate work in algebraic geometry and topology 
at King’s College, London, and moved to the University of Liverpool in 
1967. I have been there ever since, apart from years as visiting professor at 
Chapel Hill, N.C., and Amherst, Mass. It was in Amherst that I met Don 
O’Shea and somehow we couldn’t let the bitangent sphere problem alone. My 
research interests now are in differential geometry, singularity theory, and 
computer vision: I am interested particularly in applying modern geometry 
to the problems of understanding how we perceive shape and symmetry in 
the world. I am also chairman of Mathematical Education on Merseyside, 
which, with industrial and business sponsorship, promotes competitions in 
mathematics for school children in the Liverpool area. 


DONAL B. O’SHEA: I did my undergraduate work at Harvard and my 
graduate work at Queen’s University at Kingston, Canada, under the super- 
vision of A. J. Coleman. Except for a year each at I.LH.ES. and the 
University of Massachusetts, I have been at Mount Holyoke College since 
receiving my Ph.D. in 1981. My research interests are in the algebraic and 
differential geometry of singular complex analytic varieties, but, like Peter, I 
am interested in geometry, broadly defined, and its interplay with other areas 
of mathematics and science. I have served as chairman of the Valley 
Geometry Seminar and the Five College Applied Mathematics Group. 


In this article, we shall introduce and solve a problem which had its origins in 
shape recognition. Our chief motivation for so doing is that the problem is an 
attractive and somewhat unusual one, and the solutions which we have found 
provide pleasant and instructive illustrations of techniques from differential topol- 
ogy. Thus, construction of differentiable manifolds (actually surfaces), vector fields, 
indices, “blowing up,” and Morse functions will all make an appearance, some in 
more important roles than others. 

Here is the problem: choose a point p, on a (smooth, closed, embedded) surface 
M in R?. Does there always exist a sphere which is tangent to M at p, and at some 
other point p © M? 

Here is the answer: no. But the answer is very nearly yes, for there is always a 
sphere or plane having the required property; see §2.4 below for examples in which 
no sphere exists, but a plane does. 

Here is a line of thought: start with a tiny sphere inside M, tangent to M at po, 
and pump it up until it touches M somewhere else! 

Many promising arguments based on this have come to grief. Almost irresistible 
is the temptation to rush on and triumplantly let p © M be the first point at which 
the pumped up sphere again touches M, thereby exhibiting a sphere and a point 
showing that the answer to the problem is yes. Other traps await those wary enough 
to reject the temptation, but unwilling to totally abandon the proffered line of 
thought. In fact, the first author made a habit for several years of asking nearly 
every friendly mathematician he met for a solution of the problem. Nearly all of 
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Fic. 1. 


them came back rapidly with a solution, too, and the dossier of false arguments 
bears witness to the problem’s subtlety. There is an obscure and almost certainly 
false argument in [3, p. 181]. We do not yet know how to extend the result to a 
hypersurface M” in R"*' for n > 3. Our attempts to extend existing arguments 
have run into grave technical problems (but see the endnote to §6). 

The case n = 1 (a smooth simple closed curve in R”) is, however, well under- 
stood and we set the scene by discussing this case in §1, where we also say a little 
about the origins of the problem in shape recognition. 

We thank Bill Bruce, Terry Gaffney and Leo Jonker for helpful conversations. 
Although it would be invidious to thank here those who have contributed to the 
dossier of false proofs, we are nonetheless grateful to all those with whom we have 
discussed this problem. 


1. Motivation and the curve case n = 1. A number of ways have been proposed 
for measuring or comparing the shapes of regions in the plane or in space. One 
method, which picks out local reflectional symmetries of the curve bounding a 
region in the plane (or of the surface bounding a region in space) is as follows. 

For a smooth, simple closed curve M in R?’, consider circles which are bitangent 
to M, that is, tangent to M at more than one point. Such a circle picks out a “local” 
(first order) reflectional symmetry of the curve: the tangent lines and points of 
contact py), p, are symmetrical about a line (Fic. 2, left). The centers of all such 
bitangent circles trace out a locus of points called the symmetry set of M (closely 
related to the symmetric axis transform, medial axis, central set,...). This set 
figures quite prominently in the literature of shape recognition. See [1,2] and, for 
more mathematical details, [3, 4]. 

Question: can M be recovered from the bitangent circles? We need, clearly, not 
just the locus of their centers, but also their radii. It is certainly very plausible (Fic. 
2, right) that M is just the envelope of the bitangent circles. But note that, if M is to 
be recoverable, then every point p, of M—or nearly every point—must contribute 
to the symmetry set. If a whole open arc U of M has the property that, for p, € U, 
there is no bitangent circle with p, as a point of tangency, then there is no way of 
recovering U from information about bitangent circles. 


1990] THE BITANGENT SPHERE PROBLEM 7 


tangent lines 


FIG. 2. 


In fact, it is proved in [3, Proposition 3.5, p. 181] that, for a generic curve M (i.e., 
for “almost all” M), the whole of M is the closure of the subset M, reconstructed 
as the envelope of circles centered on the smooth part of the symmetry set: it is just 
necessary to fill in some limit points of M, to have the whole of M. This fact relies 
on the case n = 1 of our problem, namely 


(1.1) Proposition. Let M be a closed curve smoothly embedded in R?, and let 
Po = M. Then there is a circle or line tangent to M at py) and at some point p © M, 
Pp # Po: 


Proof. Let us choose coordinates with p, = (0,0) and the tangent to M along the 
x-axis (FIG. 3). Let a: S!— R* be a smooth parametrization of M, a(t) = 
(X(t), Y(t)), and let a(t.) = py. Note that X(t.) = Y(to) = Y(t.) = 0. For t # 
ty, t © S', we write down the reciprocal of the radius r(t) of the unique circle C(t) 
through a(t), tangent to the x-axis at (0,0). A short calculation shows that 1/r(t) is 
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f(t), where 
2Y(t) 
(X(2))° + (YX) 


For a point such as p, = a(t,) in Fic. 3, Y(t,) = 0, meaning that the “circle” is a 
straight line. In any case, let us consider f as a function on M — { po}. 

The condition for f to have a stationary point (f’(t) = 0, ¢ # ¢)) is, intuitively, 
that the radius r increase up to ¢ and then decrease (or vice versa), which means 
that the circle is tangent to M at ¢. Indeed, it is easy to check that the normal to the 
above circle (or line) has direction (2X Y, Y? — X7), and this is perpendicular to the 
tangent to M precisely when f’ = 0. Hence (1.1) is equivalent to: there exists t # to 
for which f’(t) = 0. 

Now f can actually be extended to a smooth function on all of S'. For two 
applications of |’H6pital’s rule to the quotient giving f reveal that, as t > to, f(t) 
> Y"(ty)/[ X"(to)]”, which is, up to sign, the curvature x of M at pp. (Indeed, the 
circle C(t) tends to the circle of curvature of M at to, as ¢ — ty.) Thus, we define 
f (ty) = k(t). By writing M locally as a graph y = h(x) = x*h,(x) say, for smooth 
h and h,, it is then easy to check that f is a smooth function at ¢) as well as 
everywhere else on S'. 

Since S' is compact, f must achieve its maximum and minimum values, and the 
values of ¢ at which this happens cannot all be at tf). Consequently, f(t) = 0 
somewhere else, proving the result. 


f(t) = 


(1.2) REMARKS. (1) Note that the above proof works for any smooth a which is 
an immersion (i.e. X’(t) = Y(t) =O never happens) so long as a(t) = py only 
occurs for ¢ = fy. In other words, the condition that a(S') be a simple curve is not 
used: it could have crossings except at Do. 

(2) It is of course possible for f’(t)) to be zero. The condition for this to happen 
is that p, be a vertex of, M, that is, the curvature of M be stationary at py. Note 
that if p, is not a vertex of M, then there must be at least two values of t (different 
from ¢,) where f’(¢) = 0; that is, there must be at least two bitangent circles or lines 
which are tangent to M at pp. 

(3) Suppose p, is a vertex of M, and suppose /’ vanishes at precisely one other 
point t, # ty. If we also have f(t,) = 0, then there is no circle, only a line, tangent 
to M at p, and elsewhere. Here is an explicit construction for a curve M where this 
happens (Fic. 4). Let M* be an ellipse, and let C* be the circle tangent to M™* at 
the ends pé* and p}# of the minor axis. It is easy to check that no other circle (or 
line) is tangent to M* at p;* and elsewhere. Now invert the figure with center g on 
C*, That is, consider the plane to be the complex plane with origin at g and 
consider the image of the circle and the ellipse under the map z — 1/2. Inversion 
carries circles passing through g into lines and all other circles into circles (see, for 
example, §36 of [6]). Hence, /* becomes a non-convex curve M with bitangent line 
C, tangent at p, and p,. Moreover, if a circle were tangent to M at p, and some 
other point, then inverting back would give a corresponding circle or line for M*, an 
impossibility. Note that p(* is a vertex of M*; hence, as inversion preserves contact 
with the circle of curvature, p, (also p,) is a vertex of M. 

Question. Can this procedure be modified so as to produce a whole arc of points 
on M for each of which no bitangent circle (only a line) exists? (Presumably such an 
M would contain two straight line segments, a highly nongeneric situation!) 
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M 
pt 
Py 
c* M* ; 
Invert 
q 
* 
Po Po 
C 
Fic. 4. 


In the special case when g lies on the major axis of M*, the curve M will have a 
line of symmetry. In fact, it is then a limacon. Up to similarity, there is just one 
limacgon whose bitangent line is tangent at two vertices. In terms of its usual polar 
form r = a+ bsint (which can be derived from inversion of an ellipse), this is the 
limacgon for which a = b,2. 

(4) The existence of vertices is precisely what goes wrong with the line of thought 
we mentioned at the outset. Given a curve M and a point pp, any sufficiently tiny 
circle tangent to M at p, will meet M only at pj. If the circle lies inside the region 
bounded by M it is also true that there must be circles tangent to M at py, obtained 
by pumping up the tiny circle, which meet M at points other than py. Thus, it is 
true that there must be a smallest circle, call it C, with the property that any smaller 
circle meets M only at p,. Moreover, C will be tangent to M at any point p at 
which C meets M. However, it may happen that the only point at which C meets M 
iS Py (in which case p, must be a vertex). An example of this is M an ellipse and py 
one end of the major axis. 

(5) We can pursue the inversion idea further to start with any M (Fic. 3) and 
invert with center p,. The circles tangent to M at p, become horizontal straight 
lines, and we are looking for one of these which is tangent to M at a finite point ( py 
has disappeared to infinity). Once we know the inverted M, say M*, has the same 
horizontal asymptote in both directions (a fact equivalent to the extendability of f 
in the proof of (1.1)) the result is obvious (FIG. 5). 


a a i eed 
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(6) The same argument used to prove (1.1) establishes the existence of a bitangent 
(m — 1)-sphere or hyperplane for each point p, of a smooth closed curve embedded 
in R™ (or immersed, with no crossing at p,). 

(7) Before leaving the case n = 1, we mention one interesting refinement of (1.1). 
Some bitangent circles are undoubtedly more significant than others for the pur- 
poses of shape description. Let us orient the curve M; that is, give M a direction. In 
Figure 6, left, the upper circle is “coherent” in that the orientations of M at the two 
points of contact both point in the same way round the circle (counterclockwise), 
whereas the lower circle is incoherent—and probably rather insignificant, too, from 
the viewpoint of shape. Then (1.1) can be sharpened as follows. 


(1.3) PROPOSITION. With the hypotheses of (1.1), there is a coherent circle, tangent 
to M at p, and at some other point p # Py. Indeed, if po is not a vertex of M then there 
are two such coherent circles. 


REMARK. This result is false if we assume only an immersed curve M (compare 
Remark (1) of (1.2)). For a suitably shaped figure-eight, with p, at the bottom 
point, has only one bitangent circle and that one is incoherent (Fic. 6, middle). 


Po 


Fic. 6. 


Proof of (1.3). The absolute maximum or absolute minimum of f on M, as in the 
proof of (1.1), will occur away from py, and both will do so when p, is not a vertex 
of M. Consider the absolute maximum of f/f, illustrated in Fic. 6, right, where ng is 
the inward-pointing normal to M at p,. The circle C corresponding to this 
maximum will be entirely inside M. (No point on M can be inside C, for that would 
give a smaller circle tangent to M at p, and, hence, a Jarger value for f. But ng is 
the inward normal for M, so the whole interior of C is inside M.) 

Let M be oriented as shown at py. Then we claim that the orientation of M at p 
is coherent (D to E in the figure). For otherwise, the inward pointing normal at p 
will point out of C, so that there will be points close to p which are inside C but 
outside M. 

(Alternatively, it is clear that there is no way of joining B to E and A to D by 
arcs that do not go across the circle C.) 

This proves the result for the absolute maximum, and a similar argument covers 
the absolute minimum case. Note that negative values of f just refer to circles which 
are “under” the tangent at po. 
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2. The surface case n = 2. The result here is: 


(2.1) PRoposITION. Let M be a compact surface without boundary, smoothly 
embedded in R*, and let py © M. Then there is a sphere or plane tangent to M at py 
and at some point p © M, p # po. 

As in the curve case, once this is known, we can deduce that a generic surface M 
is the closure of the subset M, obtained as the envelope of spheres centered on the 
smooth part of the symmetry set. 

There is no problem setting up a function f: M — { po} > R as before. In fact, 
let us take p, = (0,0,0) and the tangent plane to M at p, the xy-plane. Then we 
define F: R*® — {(0,0,0)} > R by 


22 


F(x, y,z) = x? + y*+ 2? 


(1) 
and consider F restricted to M — { py}, which we write as /. 

Let us note that the level sets of F are precisely the spheres centered on the z-axis 
and tangent to the xy-plane at (0,0,0). In fact F(x, y, z) =k is the sphere with 
center (0,0,1/k) for k # 0 and the xy-plane for k = 0. Now f has a critical point at 
P precisely when the level set of F is tangent to M at p. (This is the principle behind 
Lagrange multipliers. If M is (locally) given by the equation m(x, y, z) = 0, then 
we are considering F' subject to the “constraint” m = 0. The condition for a critical 
point at p is that grad F = A grad m at p for some Lagrange multiplier A, and this 
says that the normals to the level sets { F = const} and {m = const} coincide at p. 
This is equivalent to the level sets of F and M being tangent at p.) Hence (2.1) is 
equivalent to: 


(2.2) LEMMA. f = F|M + { pg} has a critical point at some point p © M, p # po. 

It would be nice if we could, as in the curve case, extend f to a function on M. 
Then we could argue that there must be a maximum and a minimum on M. But 
problems do arise when we try to extend over M. To see this, write M locally as 
z = h(x, y) near py. Then f has no linear terms (the tangent plane at p, is z = 0) 
and we can further choose the x and y axes so that 


1 1 
h(x, y) = zx + 5 kay" + h,(x,y) (2) 


where h, has order at least 3 in x and y. (The x- and y-axes are in principal 
directions at py; x, and «, are the principal curvatures there. This local form for M 
is called Monge normal form.) 

Writing f as a function of x and y and replacing x and y by polar coordinates 
r,60(x =rcos0, y=rsin8@, r # 0) gives, after cancelling r7, 


K, cos” 6 + x, sin’ 6 + rg,(r, 6) (3) 
1+ r’g,(r, 9) 


for some smooth functions g, and g, (of period 27 in @, of course). Now fixing @ 
and letting r — 0, the limiting value of f(r, @) is 


f(r, 6) = 


K, cos? 6 + Kk, sin’ 6. (4) 


If x, # «, then this clearly depends on @, that is on the direction in which the origin 
is approached. In fact this limiting value is precisely the sectional curvature of M at 
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Po in the direction @. So we cannot extend f over M unless it so happens that 
K, = Kp. 

A point of M where xk, = x,—and hence where all sectional curvatures coincide 
with «x,—is called an umbilic of M. If p, is an umbilic then, extending f by 
defining f( py) = ,, we obtain in fact a smooth function on all of M, and as in §1 
(since M is compact) this must have a critical point away from p,. Hence: 


(2.3) LEMMA. (2.1) holds if py is an umbilic point of M. 
Question. What is the condition for f to have a critical point at p, in this case? 


REMARK. Before proceeding, we remark that it is possible to have a plane, but no 
sphere, tangent to M at p, and elsewhere. In fact an example can be produced by 
exactly the same trick as in the curve case. Start with an ellipsoid and a sphere 
tangent to it at the ends of its shortest axis, then invert with center on that sphere. 
We hope that the reader will enjoy checking the details! 

If p) is not an umbilic, we have seen that we cannot extend the function f on 
M — { p,} to a smooth (or even continuous) function on M. Thus, it might seem 
that we have reached an impasse, which is especially serious in view of the fact that 
umbilic points are highly nongeneric (in fact, the only closed surface on which all 
points are umbilic is a round sphere—see, for example, [7, Part two, V.1.2.]. 

However, if we think of M as an extension of the open surface M — { py}, we 
might ask if there is some other way of extending M — { p,} to a closed surface 
which will allow us to extend f as well. Pursuing this thought will allow us to 
establish lemma (2.2) in the case k, # k. 


3. The surface case: gradient vector fields. We assume from now on that k, # Kk), 
so that (2.3) does not apply. 

The idea here is to find a compact surface M without boundary, containing 
M — { po} in the sense that there is a smooth injective map i: M — { p)»} > M. We 
seek also a smooth map f: M — R extending f in the sense that f(p) = f(i(p)) 
for p & M — { py}. 

In order to prove (2.1) we shall then assume that f has no critical points, so that 
any critical points f has must be on the “extra” part of M. By examining f it will 
turn out that it has an impossible collection of critical points. So let us first see what 
restrictions there are on the critical points of a smooth function g on a compact 
surface N. 

The usual way of studying the relationship between a surface (more generally, a 
manifold) and the critical points of a smooth function on it is through Morse 
theory. Morse theory is powerful and delicate, but, at least in its usual formulation, 
relies on the “nondegeneracy” of the critical points. (If u and v are local coordi- 
nates on WN then the partial derivatives g, and g, are zero at a critical point of g, 
but g,,,2.» — £2, is nonzero at a nondegenerate critical point. See [9].) There is no 
guarantee that our function f has only nondegenerate critical points, so we shall 
work directly with the gradient vector field defined by the function. That is, at each 
point p of N we take the tangent vector in the direction grad g( p) (= (g,, g,) in 
local coordinates on AN). This is perpendicular to the level curve in N given by 
{ g = constant} through p in the direction of increasing g. For a beautiful discus- 
sion of vector fields, and of the Poincaré-Hopf theorem which we shall use, see §6 of 
[8]. See also [7, §3.5] and [5, §15]. 
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grad g( p) grad g(P) P grad g( p) 


P 


index 1 (minimum) index —1 (saddle) index 2 (dipole) 


The vector field at p points along the curve through p. The arrows point uphill. 


FIG. 7. 


Of course singular points of grad g give zeroes of the vector field. Each isolated 
zero p, has associated with it an index. To find the index, simply choose local 
coordinates with p, = (0,0) and center a small circle u* + v? = € at p,. Then see 
how many times the vector grad g( p) turns anticlockwise as p traverses the circle 
once anticlockwise. This number is the index (Fic. 7). (It is also the degree of the 
map p — grad g( p)/||grad g( p)|| from the circle to the unit circle.) 

Suppose that N is a smooth compact surface. If N is broken up into polygonal 
faces meeting only along edges, then the Euler characteristic x(N) of N is defined 
to be the number V — E + F, where V, E, and F are the number of vertices, edges, 
and faces, respectively. The Euler characteristic does not depend on how we break 
N up. Let g: N > R be a smooth function on M which has all its critical points 
isolated. If N has a boundary, then the vector field is required to point outwards 
along the whole boundary or else inwards along the whole boundary. The crucial 
theorem is: 


(3.1) POINCARE-HOPF THEOREM. The sum of the indices of the vector field is the 
Euler characteristic x(N) of N. 

This theorem applies to any smooth vector field with isolated zeroes, not just to a 
gradient vector field. 

In order to apply the theorem we shall need to know the index of grad g ata 
maximum or minimum of g. Strangely, this result (except at a nondegenerate 
maximum or minimum) does not seem to have a convenient reference, so we shall 
prove it. 


(3.2) LEMMA. The index of grad g at an isolated maximum or minimum of g is 1. 


Proof. Using local coordinates (u, v) we may assume that the isolated maximum 
or minimum is at (0,0) and that g(0,0) = 0. It follows directly from the definition 
that multiplying by a nonzero constant does not change the index. Thus, upon 
multiplying by — 1 if necessary, we may assume that (0, 0) is a minimum of g. Let D 
be a small disk centered at (0,0) and containing no critical point of g besides (0, 0). 
Let 6, be the minimum value of g on the boundary circle S of D (note that S is 
compact, so this minimum is attained). We first show that if 0 < 6 < 6), then 
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C = g '(8) N D is one smooth closed curve surrounding (0,0). Of course, C lies in 
the interior D of D. 

Now 6 is certainly a regular value of g on D (the only critical point in D has 
value 0) so g~‘(6) N D, which is inside D, is a 1-manifold; that is, a union of 
smooth closed curves. Consider one of these, C, say. If (0, 0) is not in the interior of 
C,, then C, will bound a topological disk D, inside D. The function g is constant 
(= 6) on C,, so g is either constant on D, or it achieves its absolute maximum or 
minimum on D, at a point of D,. In either case g has a critical point inside D,, 
which is not true. Similarly, if 2-8) 1 D has two components C, and C,, which 
we now know both contain (0,0), then the region A between C, and C, is compact 
and g = 6 on the boundary C, U C, of A. Again g has a critical point in A, a 
contradiction. 

Since C is a topological circle, it bounds a topological disk and a topological disk 
has Euler characteristic 1. Moreover, grad g points outwards all along C (indeed 
along the normals), so that the sole zero of grad g has index 1 by the Poincaré-Hopf 
theorem. 

We could have avoided appealing to the Poincaré-Hopf theorem to finish the 
proof of the lemma above. One uses the fact that grad g is always normal to C (Fic. 
8) to show that the map C — S' (S' = unit circle), p > grad g( p)/|lgrad g( p)|l, 
has degree 1. The degree is a homotopy invariant (this is a standard fact: see, for 
example, [8] §5) and it is easy to produce a homotopy of C onto a small circle inside 
C (along which grad g does not vanish). Thus, the index of the same map on a small 
circle inside C (and, so, the index of the zero of grad g) is 1. 


Fic. 8. Grad g points outwards from C for a minimum and inwards for a maximum. 


(3.3) LEMMA. Let g: N — R be a smooth function on a compact surface N without 
boundary. Then: (1) g cannot have exactly two maxima and two minima (and no other 
critical points); (ii) if g has exactly one maximum and one minimum (and no other 
critical points), then N is a topological sphere. 


Proof. (3.1) and (3.2) give x(N) = 4 in case (1) and x(NV) = 2 in case (11). But 
x(N) < 2 for any compact N without boundary, with equality if and only if N is a 
topological sphere (see [10], §39). 
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4. Extending M and f: doubling M. We shall give two ways of extending 
f:M-— {po} -R to f:M— R. Consider formula (3) in §2 above, which “ex- 
tends f over p,” at the price of having a different value for each direction of 
approach to py. What could be more natural than to unplug py and plug in a circle 
S to accommodate all those different values? In fact, for diametrically opposite 
directions (9 and 6 + 7) we do get the same extended value x, cos? # + x, sin’ 6. 
How about sewing together diametrically opposite points of S, to close up the 
surface? That is what method 2 below does (FIG. 9, center); see §5. Alternatively, we 
could stick together two copies of the surface with boundary S to make a-surface 
without boundary, by a “doubling” procedure. That is what method 1, which we 
shall consider in this section does (FIG. 9, right). Method 2 (“blowing up”’) is in a 
sense the least one has to do to extend M and f. Doubling is more extravagant. 


Po S 


FIG. 9. 


REMARK. It is tempting to work directly with the surface whose boundary is S. It 
turns out that f extends and that it is possible to push through a proof of lemma 
(2.2); in fact, this was the way that the authors first proved (2.2). However, the 
technical details are much more complicated. We cannot use the Poincaré-Hopf 
theorem because the vector field does not point all inwards (or all outwards) along 
S. Thus, it is necessary to appeal to Morse theory for manifolds with boundary. 
Even this is not straightforward: the vector field grad f has zeroes on the boundary 
S which is forbidden in Morse’s original presentation [9] (and the subsequent 
formulations with which we are familiar). The arguments needed to circumvent this 
difficulty are delicate and neither author quite believed them until the alternate 
arguments detailed above were found. 

Let us, as in §2, take M tangent to the xy-plane at p, = (0,0,0), with M given 
near py, by z = h(x, y) as in (2) of §2. In fact, let us suppose M is this graph for 
0 < \x* + y* <.e,. Let M, be the open cylinder (—.«,, e,) X S’ consisting of points 
(r,9) for O< 0< 2a and —e,<r<e,. Let M, and M, be two copies of 
M — {0}. We shall make a smooth surface M out of M,, M,, and M, (Fic. 10). 
Now a smooth surface M is made up of coordinate patches, that is bijective maps o, 
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Fic. 10. 


from open sets U, in the plane to open sets o,(U,) in M. The crucial condition is that 
when o,(U,) and o,(U;) overlap, the “transition function” o;* oo, must be smooth 
(it is amap U — R? where U is open in R”). When, as here, we have three known 
surfaces to stick together, we need 

(i) to say which points (of M, and M,, M, and M,) are identified and 

(ii) to write down transition functions, using coordinates on M,, M,, and M,. 

We handle these as follows: 

(i) (x, y, h(x, y)) © M, is identified with (r,@) © M, where x = rcos6, y = 
rsin@ (0 < r < «,). Also (x, y, h(x, y)) © M,; is identified with (—r, 8 + 7) € M,. 

(ii) We can use local coordinates (x, y) (0 < x* + y* < e?) on M, and M,. If 
S' c R* is {(X, Y): X? + Y* = 1} (think of X = cos 6, Y = sin), then M, has 
four convenient coordinate patches: 


(X,r) > (r, X,v1- xX?) | [x <1 
(X,r) > (7, X,- VI - x7) ] I< 4% 


(Y,r) > (r, V1 — ¥2,Y) Hee 
(Y,r) > (v,- V1 - Y?,Y) 


lr| < e 


The transition function between M, and M, using the first coordinate patch is then 
the smooth function 


(9) 0.7) = (1G), re 


x* + y? 


where y > 0 since r = (x? + y* > 0 and Y= yl — X* = sin@ > 0 here. Similar 
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r=0 ly 
l 


Fic. 11. Identifications to make M. 


ee 
~ ~ 


Po 


formulas apply to the other three patches. For M, and M, (first patch) we have 
(9) = = [=I (x? + y*) Be ersaal 
y*) 
where y < Osincer = — jx* + y*<OandY=yl — X’? = sind > 0. 


In this way, we construct a smooth compact surface M without boundary. Now 
we need to define f: M —> R extending f. That is, we need to define f on M,, M, 
and M, and check the definitions agree on overlaps. The various f,;: M,; — R must 
also be smooth, of course. 

On M, and M, we use formula (1) of §2. On M, we define f,: M, > R by 
formula (3) of §2. Thus using local coordinates (x, y) on M, and M, near pop, 


2h(x, y) 
x+y? + (h(x, y)) 
The important property of formula (3), defining f,, is that f,(7, 9) = f,(—r, 8+ 7). 
This is because rcos@ = (—r)cos(@ + 7), rsin@é = (—r)sin(@ +7) and f, is 
obtained from the same formula as f, or f, with x replaced by rcos 6, y by r sin@ 
and r* then cancelled top and bottom. In view of this, f,; and f, (resp., f; and f;) 


do agree on the overlap of their domains, and we get a smooth function f: M > R. 
It is now easy to complete the argument for (2.1). 


f(x,y) =f(x y) = 


(4.1) LEMMA. Assume that f: M — { py} > R has no critical points. Then f has 
two (absolute) maxima and two (absolute) minima, all on the r = 0 circle of M,, and 
no other critical points. 


Proof. Certainly f has no critical points on M except on the r = 0 circle of M,. 
If f has a critical point on this circle, then, restricting f to the circle, the resulting 
function of one variable @ will have a critical point (zero derivative). From (3) in §2, 
this function is 


6 > x, cos? 6 + k, sin’ 6, 


which (recall that x, # x) gives critical points precisely at @ = 0, 7/2, 7,37/2 on 
the circle. Also the corresponding values of f are k,, Ky, Ky, K5 respectively. Now f 
must attain its absolute maximum and minimum values somewhere on M, and the 
only choice is at these four points. This proves the lemma. 
Now (3.3) gives a contradiction, so (2.2), and hence (2.1), follows immediately. 
Question. The idea of coherent bitangent circles (1) extends to coherent bitangent 
spheres in a natural way. Is there always such a sphere for an embedded M? 


18 PETER J. GIBLIN AND DONAL B. O’SHEA [January 


Note that the argument of (1.3) won’t quite work, for possibly f could have both 
its absolute maximum and minimum on the r = 0 circle: the other critical points 
might have critical values between these two. Note that then M is entirely contained 
between its two spheres of curvature at pp. 


5. Extending M and f: blowing up. We shall go on to give a second derivation of 
(2.1), because the second construction of a closed surface provides a good illustra- 
tion of the important idea of “blowing up” a point. This is exactly what is needed to 
separate out all the different limiting values of f as r > 0 (see (3), (4) in §2). Rather 
than separate out all directions (all 6) through (0,0, 0) in the plane z = 0, blowing 
up separates all lines (@ and 6 + a give the same line). This is fine for us, since the 
limiting values of f for @ and @+ @ ((4) in §2) coincide. The effect is (roughly 
speaking) to remove a neighbourhood of p, in M and identify the boundary of a 
Mobius band with the newly created boundary of M. As a result, the new surface, 
M say, has no boundary. 


—o/ a 
aa Identification 
f to produce 


identify boundaries 


FIG. 12. 


Fortunately this is very easy to do from our construction in §4 of the “double” 
M. We simply put together M, and M, as in §4 and then identify each pair of 
points (7, 8) and (—r, 8 + 7) of M,. This has no effect on M,, which only meets 
the r > 0 part of M,. What we have to check is that, after these identifications, M, 
becomes a smooth manifold, M, say, without boundary (but “open at the 7 = e, 
end”), and that f, extends to M,. Note that (0,0) and (0, + 7) are identified 
together for each @. = 

So let us concentrate on M,(—.«, €,) X S* and on M,, obtained by identifying 
(r,9) and (—r, — @) for each (7, 8). (Thus M, has the “quotient topology”. In 
particular each point has a neighbourhood homeomorphic to a 2-dimensional disk 
—for r = 0 we can picture this as two separate pieces sewn together as in Fic. 13. 


Thus M, is a 2-manifold. Indeed M, is topologically a Mébius band (Fic. 13, 
right).) 

Looking back at the four coordinate patches for M, in §4, it is easy to see that 
the first pair have the same image in M,, and so do the second pair. We are left 


1990] 


Fic. 13. 


with two coordinate patches for M,, 
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a 


— Mdédbius 
band 


|X| < L|r| < 4, 


(1) 


\Y| < 1,|r| < e, 


where a represents the natural projection map M, > M, taking (r,cos 6, sin) = 
(r, X,Y) to {(r, X, Y), (—r,— X,— Y)}. These two maps are injective (because the 
square roots are positive). Furthermore given a(r, X,Y) in the overlap of the 
coordinate patches we can assume that X > 0 and Y > 0 so that the unique points 
going to a(r, X,Y) under the maps (1) are (X,r) and (Y,r). The transition 
function is Y = (1 — X”), which is smooth on |X| < 1. 

Thus M, is indeed a smooth surface without boundary. We can now check that f 
extends to f,: M, > R by using the same formula ((3) in §2) as we used in §4. 


Thus, we get f: M — R which is smooth. 
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The argument is concluded in a similar way to (4.1), namely: 


(5.1) LEMMA. Assume that f: M — { po} > R has no critical points. Then f has 
one absolute maximum and one absolute minimum, both on the r = 0 circle of M,, and 
no other critical points. 


To deduce (2.1) we need an extra step now. For by (3.3), M is a topological 
sphere. But M,, and hence M, contains a Mobius band (Fic. 13), which no sphere 
can do. This contradiction proves (2.2) and hence (2.1). = 

There is a more conventional way of describing the blown up part of M, namely, 
the part coming from M,, as follows. 


Let 
U, = { (21, Wye R’: x?(1 + y?) < e? } 
and 
U, = { (x2, y.) © R*:(1 + x3) y2 < ef }. 
See Fic. 15. 


& 


\ARAAAS 


anaes DALAT 
AS] LN (peTes 
SAAN YS 


X4 


U, 


FIG. 15. 


Let 7,: U, — M, be given by 


71(x,, 1) = (xy + y?) re + y?) , y/ 0 +y?) J. 


Thus 7,(x,, y,) = (7, cos 6, sin @) where rcos @ = x,, rsin@ = x,y,, and if x, = 0, 

the image is that part of the r = 0 circle for which cos @ > 0. After passing to M, 

this is all of the r = 0 circle except for one point corresponding to 6 = +7/2. 
Similarly let 7,: U, — M, be given by 


™(X5, ).) = (»./(0 + x3), x,/ {0 + x3) 1/0 + x3) ) 


This is (r, cos 6, sin 8) where rcos 6 = x,y,, rsin@ = y,, and y, covers, in M,, all 
of the r = 0 circle except for one point corresponding to @ = 0, =. 
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- Image 1, 


Image 7, 


Fic. 16. 


The images of 7, and 7, in M,, and by implication in M,, are sketched in 
FIG. 16, which shows the cylinder M, opened out along the 6 = +a generator. 
Thus 7,(U,) and 7,(U,) do not cover M,, but they do cover all of the r = 0 circle 
which is where the potential difficulties lie. There is no difficulty covering the rest of 
M,, for example, by covering the ordinary cylinder {(r,0):0<r<e} CM, by 
two coordinate patches, giving 0 < 6 < 7 and —7/2 <@< 7/2. 

The transition function between U, and U, is 


(x4, yy) = (1/y, x,y) 


(x, # 0, y, # 0 on the overlap in M,). The formula for f, extending f, in terms of 
coordinates from U,, is as follows. In the notation of (1) and (2) in §2, write 


hy (x1, X11) = xph2(x,, y,) for some hy, 


(since h, has order at least 3 in its coordinates). Also 4x,x7 + 4«,xjy? + 
(A\(%, x41)? = xth,(x,, y,), for some h;. Then 


kK, + Ki + x,h 9( x), yy) 


f(x.) = 
vet l+y; + xth 3(X1, Ys) 


Similarly for U,, and one can check that the formulas agree on the overlap in M,. In 
this way, we obtain an alternative construction of M and f. 

Note that the images of two rectangles in Fic. 15 together make up a Mobius 
band in M, and hence in M. 


6. What about n > 3?. We conclude with some remarks about higher dimen- 
sional cases. In the first place, there is an “easy case” analogous to (2.3). Suppose 
there are local coordinates (x,,..., x,, 2) © R"*' centered at p, such that locally 
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M (a smooth, compact n-manifold without boundary) has the form 


1 1 
z= zm + vee + Fh yXn + A(X, +15 Xn) (1) 
where h has order at least 3 in the coordinates and x, = --- =«,. Then the usual 
function 
F( ’ 2Z 
x, ° ? xX, Z = ? 
xpt--+ +x24 2? 


restricted to M — { p,} can be extended over all of M. It then follows that f= F|y 
has a critical point away from p,, and we deduce the existence of an n-sphere or 
n-plane tangent to M at p, and elsewhere. 

This case is the “equal eigenvalues” case, referring to the quadratic form in (1). 
Now assume not all x, are equal. = 

We can construct smooth manifolds M and M in a way analogous to that used 
in §§4 and 5, and extend f to these. In fact f: M — R looks the more useful one: 
here, py, is replaced by an (n — 1)-dimensional real projective space P which 
separates out all lines through p, = 0 in the n-plane z = 0. 

We assume again that f has no critical points on M — { py}; such a critical 
point would immediately give a bitangent n-sphere or n-plane, as in (2.2). Then f 
has critical points only on the “rv = 0” locus P, and we can check that these can 
only occur at points of P corresponding to directions along one of the coordinate 
axes in the plane z = 0. The problem is that these critical points may be maxima, 
minima or saddles of various indices, and we run into difficulties, and many 
separate cases, when we try to apply the Poincaré-Hopf theorem to the vector field 
grad f. In the case n = 2, we could get away without assumptions about nondegen- 
eracy (because of (3.2) and (5.1)), but for n > 3 we need to be more careful. _ 

As an example, let n = 3, and suppose that all the critical points of f are 
nondegenerate. The indices of grad f are all +1, and those at the maximum and 
minimum have opposite sign (since dim M = 3 is odd). Thus the sum of the three 
indices is +1, which should equal the Euler characteristic x(M). However, x(M) 
= 0 because the Euler characteristic of any 3-manifold is zero. (See, for example, 
Theorem III, 59 of [10]). In our case, this also follows directly from Poincaré duality 
which is applicable because M is orientable whenever M is orientable (this contrasts 
with the case n = 2, where M contained a Mobius band), and M is orientable 
because it is a hypersurface. 

If in fact h(x,,..., x,) = 0 and if all the x, are distinct, then we can show that 
all critical points of f are nondegenerate, but that is rather a special case! 


Endnote. At a late stage in the preparation of this article, the first author 
discussed the problem with Elmer Rees of the University of Edinburgh. Prof. Rees 
and a visitor from China, Duan Hai Bao, have subsequently announced a proof in 
the case n > 3 without nondegeneracy assumptions. The argument they give (in 
“The Existence of Bitangent Spheres”, Proc. Royal Soc. Edinburgh 111A (1989), 
85~—87) is based on much more technical considerations, related to the Lusternik- 
Schnirelmann category of a manifold, than those discussed in this paper. We still do 
not know of an “elementary” argument which works in the general case. 


1990] THE BITANGENT SPHERE PROBLEM 23 


REFERENCES 


1. H. Blum, Biological shape and visual science I. J. Theoret. Biol. 38 (1973) 205-287. 
2. M. Brady, Criteria for representations of shape, in ‘Human and Macine Vision’, ed. by J. Beck et 
al., Academic Press, New York, 1983. 
3. J. W. Bruce, P. J. Giblin, and C. G. Gibson, Symmetry sets, Proc. Royal Soc. Edinburgh, 101A 
(1985) 163-186. 
4. P. J. Giblin and S. A. Brassett, Local symmetry of plane curves, Amer. Math. Monthly, 92 (1985) 
689-707. 
5. V. Guillemin and A. Pollack, Differential Topology, Prentice-Hall, Englewood Cliffs, NJ, 1974. 
6. D. Hilbert and S. Cohn-Vossen, Geometry and the Imagination, translated by P. Nemenyi, Chelsea, 
New York, 1983. 
7. H. Hopf, Differential Geometry in the Large, Lecture Notes in Mathematics, 1000, Springer-Verlag, 
New York, 1983. 
8. J. W. Milnor, Topology from the Differentiable Viewpoint, Univ. of Virginia Press, 1965. 
9. M. Morse and G. B. van Schaack, The critical theory under general boundary conditions, Ann. 
Math., 35, no. 3 (1934) 545-571. 
10. H. Seifert and W. Threlfall, A Textbook of Topology, translated by M. Goldman, Academic Press, 
New York, 1980. 


The o-Game and Cellular Automata 


KLAUS SUTNER, Stevens Institute of Technology, Hoboken, NJ 07030 


KLAUS SUTNER received his doctorate in Mathematics from the Ludwig 
Maximilians Universitat in Munich in 1984. He is currently teaching Com- 
puter Science at Stevens Institute of Technology in Hoboken, New Jersey. 
His research interests lie in the area of the theory of computation. 


1. Summary. In an article in this journal Don Pelletier discussed the mathemat- 
ics involved in a little battery operated toy called MERLIN (see [3], and also the 
“Addenda” in this MONTHLY, Dec. 1987, page 994). Several years ago Stephen 
Wolfram, in another article that appeared in the MONTHLY, analyzed a number of 
simple cellular automata and the fractal patterns generated by some of these 
automata (see [6]). In this article we point out the close connection between 
MERLIN-type games and a class of cellular automata related to the ones described 
by Wolfram. We introduce a game played on directed graphs and give a detailed 
analysis of the special case where the graph is a rectangular grid. Our analysis uses 
linear algebra as well as ideas from the theory of cellular automata. 


2. Merlin’s Magic Square and o-automata. We begin with a description of a 
simple combinatorial game called the o-game. The o-game is played on a directed 
graph G. We suppose that the vertices of G can have one of two different states, 
which we designate by 0 and 1. A configuration is an assignment of states to all the 
vertices. A move in the game consists of the player picking a vertex. One may think 
of a vertex v in G as a button that the player can press at his discretion. If vertex v 
is chosen the states of all the vertices u such that (v, u) is an edge in G will change. 
Now suppose the opponent picks two configurations on G, say X, (the source 
configuration and X, (the target configuration). To win the player has to find a 
sequence of moves that transforms configuration X, into configuration X,. Any 
such sequence will be called a solution for (X,, X,). Call (X,, X,) a winning pair iff 
there is a solution for (X,, X,). Similarly one defines the o*-game: the move v 
changes the state of vertex v itself in addition to changing the states of all adjacent 
cells. 

A battery operated toy called MERLIN that emulates the o*-game on the nine 
point directed graph G,, shown below is described in [3]. X, is fixed and X, is 
randomly generated by the device. 
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In another version of the game the target configuration X, is fixed to be the 
“All-Ones” configuration 1 and the source configuration X, is always 0. However, 
the underlying graph varies, say, over all n Xn grids. In order to extend the 
definition of the o-game to undirected graphs such as grids, let us adopt the 
convention that every undirected edge {u,v} is interpreted as the pair of directed 
edges (u, v) and (v, u). 

The following questions come to mind: 

— Given the underlying graph G, what are the winning pairs? 

— Given a winning pair (X,, X,), how many solutions are there; in particular, when 
is the solution unique? 

— Given a winning pair (X,, X,), what is a minimal solution, i.e., a shortest. 
sequence of moves that will transform X, into X,? 

The reader may have noticed that the o-game can be described using linear 
algebra on F,, the field with two elements. Let G = (V, E) be a finite directed 
graph on n points. One may think of the adjacency matrix A of G as an n by n 
matrix over F,: the matrix A has a 1 in the u-th row and vth column iff there is a 
directed edge from vertex u to vertex v in G. A configuration X is a map from V to 
F,. The collection of all configurations of G is the configuration space of G, in 
symbols C,. Configurations are conveniently identified with subsets of V, namely X 
is identified with {v © V|X(v) = 1}. Cg may be construed as a vector space over 
F,; addition here amounts to taking symmetric differences. We will write 0 for the 
empty set and 1 for the set V as members of Co, so that O0(v) = 0 and 1(v) = 1 for 
all v in V. 

Now define a map p: V X C, > Ce by 


(v, X)(u) = 1+ X(u)_ if (uv, u) is an edge in G, 
-_ a X(u) otherwise. 


Thus p(v, X) is the configuration that results from configuration X after move 
uv. w is readily extended to sequences of moves. It is straightforward to see that 


u(vv, X) =X 
and 
u(uv, X) = p(vu, X). 


Therefore, a solution to the game is a subset of V rather than a sequence in V. In 
other words, » may be construed as a map p: Co X Co > Cg. Let us introduce a 
map 
a: O, 7G 
defined by 
o(Z) = p(Z,0). 

Note that u(v, X) = X + o(v). Hence p(Z, X) = X + o(Z). A solution to the 
o-game is thus given by a configuration Z such that p(Z, X,) = X,, or, equivalently, 


o(Z)=X,+X,. (2.1) 


Observe that by (2.1) (X,, X,) is a winning pair iff (X,, X,) is a winning pair, a fact 
that is somewhat less than obvious from the original definition of the o-game. One 
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can define »* and o* similarly. If the underlying graph is not clear from context 
we will occasionally write o, and so forth. 

The key observation that makes it possible to analyze our game in great detail is 
that both o and o” are linear. To see this think of the adjacency matrix A of G asa 
n by n matrix over F, as described above. Now construe a configuration X as a 
column vector over F,. Then clearly 


o(X)=A™-X and ot(X)=(A'4+/)-X, (2.2) 


where A’ is the transpose of the adjacency matrix of G and J is the identity matrix. 
Hence o and o” are linear maps from the configuration space to itself. Thus we 
have the following preliminary answer to the first two questions above: (X,, X,) is a 
winning pair iff X, lies in the affine subspace X, + rg(o) where rg(o) denotes the 
range of o. Furthermore, the collection of all solutions is an affine subspace of C, 
and has the form Z, + ker(o) where Z, is any particular solution for (X,, X,) and 
ker(o) is the kernel of o. Let us define d(G) = dim(ker(o)) = log,(|ker(c)|). 
Similarly let d*(G) := dim(ker(o*)). Then any winning pair of the o-game on G 
has exactly 2“) solutions and there are 24°“ solutions for the o*-game. 

To find a solution of the o-game therefore amounts to solving a system of linear 
equations over F,, and in order to determine the number of solutions one has to 
compute the corank of the adjacency matrix of G. We will give a simple example of 
this procedure. Let us agree on some notation for graphs: P,, will denote the 
undirected path graph on m points. The vertices of P,, are assumed to be 
{1,2,...,m}. For later use define P,,,, to be the rectangular grid on vertices 
{1,...,m} X {1,...,}. Note that the adjacency matrix A of P,, has 1’s in the sub- 
and the super-diagonal, and 0’s everywhere else. A little computation shows that: 


A has rank m iff m is even 
and 


A has rank m — 1 iff m is odd. 


Thus for even m the range of o is Cg by (2.2). It follows from (2.1) that every pair 
(X,, X,) is winning on P,, for even m. For odd m one can see that the range of o 
consists of all configurations Y such that |Y M {1,3,..., m}| is even. Hence, for m 
odd, (X,, X,) is winning iff |(X,+ X,) 9 {1,3,...,m}| is even. Note that 
{1,3,..., m)} is the only nontrivial element in the kernel of o in this case. Therefore, 
we have shown 


_ {0 if mis even, 
d(P,,) = . if m is odd. (2.3) 


Similarly one can show that 


n _ {0 if m #2 (mod 3), 
AP) ‘ if m = 2 (mod 3). (2-4) 
Unfortunately this method breaks down when applied to general grids: the 
adjacency matrix of P,, ,, becomes unwieldy and there seems to be no elegant way 
of solving the equations of the form A - X = Y. Of course, one may use brute force 
and a computer. 
However, there is a different approach to the o-game in terms of cellular 
automata. A cellular automaton is a discrete dynamical system. Its components, 
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called cells, are interconnected in a fixed way and can assume finitely many possible 
states. Here we will be interested only in automata with possible states 0 and 1. A 
configuration of the automaton is a map from its cells to {0,1}. Every configuration 
determines a next configuration via a local transition rule: the state of a cell at time 
t+ 1 depends only on the states of its neighbors at time t. Now suppose the 
underlying mesh of cells of an automaton is given by a directed graph G = (V, E). 
Define the neighborhood of v in G to be 


C(v) = {ue V\|(u,v) € E}. 


Note that this is the open neighborhood of », 1.e., v is not included in I'(v). We 
write [*(v) for the closed neighborhood I'(v) U {v}. Let X be a configuration on 
G, X: V — F,. Then we may express the map o by: 


o(X)(v)= QL X(u). 
uEGT(v) 
Hence o can be construed as a cellular automaton rule. The dynamical system 
consisting of the graph G together with the rule o will be called the o-automaton on 
G. Similarly one obtains rule o* with ['*(v) taking the place of I'(v). Observe that 
the rules o and o” are well defined even for infinite graphs as long as the graphs 
are locally finite (1.e., ['(v) is finite for all v). 

A well-known example of a o-automaton is the traditional 1-dimensional cellular 
automaton presented in Wolfram’s article. The underlying (undirected) graph is the 
infinite path P... Thus there are infinitely many cells v,, i © Z, and the neighbor- 
hood of cell v; consists of v;_, and v,,,. Rules o and o* on P,, are usually called 
rule 90 and rule 150 respectively. The evolution of the seed configuration X = {vu } 
under rule o and its relation of binomial coefficients are discussed by Steven 
Wolfram in [6], see also [7] and [8]. The resulting fractal pattern for rule o has 
dimension log,3 and for rule o* log,(1 + ,5). 

The existence of predecessor configurations on finite o-automata is connected 
with the reversibility of the automaton. The rule o is said to be reversible on G if 
every configuration X in C, can be reconstructed from o( X), in other words, if o,: 
C, — Cg 1s injective. Because o, is a linear map this is equivalent with o, having 
trivial kernel. But C, is a finite space, thus rule o is reversible on G iff o, is 
surjective. Hence every pair of configurations is a winning pair on G iff every 
winning pair on G has exactly one solution. For the Merlin graph G,, from above it 
is shown in [3] that rule o* is reversible. Thus every configuration on G,, has a 
predecessor and the game MERLIN always has a uniquely determined solution 
regardless of which starting configuration the device generates. 

A configuration on a cellular automaton that fails to have a predecessor is often 
called a Garden-of-Eden configuration: once lost it cannot be regained. See [1] and 
[2] for a discussion of Garden-of-Eden configurations in classical cellular automata. 
If the underlying graph G is undirected, one can show that a configuration X has a 
predecessor under rule o iff 


for every configuration Y in the kernel of o, the cardinality of X MN Y iseven. (2.5) 


Incidentally, the same result holds for rule o*. As a consequence, o*-automata on 
undirected graphs have the peculiar property that the All-Ones configuration 1 is 
never a Garden-of-Eden configuration. Hence (0, 1) always is a winning pair for the 
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o*-game on undirected graphs. Using standard techniques from linear algebra one 
can solve the system A - X = 1 and compute a basis for the affine subspace of all 
solutions in polynomial time (i.e., in a number of steps polynomial in the number of 
vertices of G). However, it is NP-hard to find a minimal solution. Hence most likely 
the only way to construct a minimal solution is to perform an exhaustive search over 
all 2¢°(® candidates. For a proof see [4] and [5]. To determine the number of 
solutions is similarly quite difficult. Even for a simple class of graphs such as 
rectangular grids we do not know how many solutions there are in general. Figure 1 
gives solutions for the 48 x 48 and the 80 X< 80 square. The solutions are unique in 
each case. 
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Fic. 1. Solutions to the all-ones problem for 48 X 48 and 80 X 80 squares (using rule o*). 


It is easy to see that in general (0, 1) fails to be a winning pair for rule o. For 
example on the 3 X 3 square P; , configuration 1 cannot be reached from 0 by any 
sequence of moves. Indeed, Y = {(1, 1), (2, 2), (3, 3)} lies in the kernel of o on P; 3. 
But | YA 1| is odd, and hence by (2.5), 1 fails to have a predecessor. 

For rule o, however, it is relatively easy to determine the number of solutions on 
a m by n grid. The remainder of this note is devoted to a proof of the following 
theorem. 


THEOREM. For all m,n > 1 the kernel of rule o on the m Xn grid P,, , has 
dimension 


gcd(m+1,n+1)—-1. 


Hence the m X n grid is reversible under rule o iff m +1 and n+ 1 are relatively 
prime. 


In fact we will explicitly construct a basis for the kernel of o on P,, ,,. The results 
in [4] show that this description of the kernel of o on P,, ,,, in connection with (2.5), 
implies that (0, 1) is a winning pair on the n X n square iff n is even. The number of 
solutions in this case is 2” by the theorem. In general let p = gcd(m + 1,n + 1) - 
1. Then the All-ones game has a solution on P,, ,, with rule o iff 


(1) p is even or 


(2) (m+1)(n4+1)/(p + 1)’ is even. 
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3. The number of solutions on rectangular grids. To prove the theorem we will 
exploit the fact that it is frequently possible to simulate one cellular automaton on 
another. The evolution of a configuration on the first automaton can thus be studied 
on the second automaton. To make this precise, let us say that the o-automaton on 
H simulates the o-automaton on G iff there is an injective linear map f: C, — Cy 
such that for all X in Cc: 


f(og(X)) = o7(f(X)). 

The map f will be called a simulation of G on H. Note that since f is required to be 
injective we have d(G) < d(H) whenever H simulates G. As an example consider 
the path on m points P,,. The o-automaton on P,,,,, simulates the automaton on 
P,, via the map f(x) = {x,2m + 2 — x}. Note that for any configuration X on P,, 
and all ¢ > 0 o ( f( X))\(m + 1) = 0, by the symmetry of rule o. More generally, we 
have the following proposition. Define two involutions (reflections on the first and 
second coordinates) of P,,,, by F(x, y)*=(m—x+1,y) and F(x, y) = 
(x, — y + 1). For any configuration X on P,, ,, let X; denote the j-th “column” 
of X X= XN {1,...,.m} xX {i}, l<j<cn. 


PROPOSITION 1. The o-automaton on P,, ,, can be simulated by P,, 5,41. Hence 
A( Pin) <4 Pan2n+1) and the o-automaton on P,, 5,4, is reversible only if the 
o-automaton on P,, ,, is reversible. If in addition P,, is reversible then P,,, 5,41 is 


min m, 


reversible iff P,,, ,, is reversible. The same holds for rule o*. 
Proof. The map f: Cp —> Cp , defined by 


F(X) (x, y) = ay ify=n+] 


is a simulation. For example, o( f(X))(x, n + 1) = f(X)(x -— 1, 4+ 1) + 
f( Xx + 1,1 + 1) 4+ fCX)\(x, n) + fCX)(x%, n + 2) = X(x, n) + X(x, n) = 0. 
Hence the kernel of o on P,, ,,,,, contains the image of the kernel of o on P,, , 
under f. Clearly f( X) # 0 whenever X # 0 and the first claim follows. 

Now suppose P,_, is reversible under rule o. We will show the image of the kernel 
of op under f consists precisely of those configurations in the kernel of 


/ 


ify’=yory’=2n+2-y,l<y' <n. 


OP nant 
that are invariant under F,. Pick such a configuration Y in the kernel of 
OP nt 


that is invariant under F,. Then for all x, l1<x<m, Y(x,n) = Y(x,n + 2). 
Hence op (Y,,,) = 0 and Y,,, = 0 by our assumption that o is reversible on P,,. 
Set 


X= Y,U---UY,CP,,. 


Then X is in the kernel of o on P,, ,, and clearly f(X) = Y. It remains to show that 
whenever the kernel of o on P,,5,,, is nontrivial then there exists a non-trivial 
configuration in the kernel that is invariant under F,. Let Y # 0 be in the kernel of 
o. If Y is invariant under F, we are done, otherwise set Y’ = Y + F,(Y). Then Y’ is 
nontrivial, lies in the kernel of o and is a fixed point under F,. The argument for 
rule o* is entirely similar and will be omitted. a 
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Fic. 2. Extending initial configuration X, = {0} on P,, with rule o”. 


It follows from proposition 1 by induction on k that every (2* — 1) x (2% — 1) 
square grid is reversible under rule o*: the path on 2“ — 1 points is reversible (since 
2* — 1 #2 (mod 3)). In fact any r-dimensional hypercube of the form 

(24 —1) x (24-1) x--- x (2%—-1) 


is reversible under rule o*. The o*-game on any such hypercube therefore has only 
winning-pairs and the solution is always unique. 

We will now give a useful description of the kernel of o on the grids P,, ,. First 
note that a configuration X in the kernel of o is completely determined by its first 
column X,. To lighten notation, we write just + for the rule o on the path graph P,,. 
Then: 


X= id( X,) — 7°(X), 

X, = (X,) = 7 (X), 

X3 = T(X)) + x) = 77(X,) + 7°(X,), 
X, = 1(X;) + X, = 7°(X) 


and so forth. More precisely the sequence (X;: j > 1) of configurations is defined 
inductively by 


X,=1(X_,) + X,_. for j > 3. (3.1) 


For example letting m = 5 and X, = {2,3,5} we get successively: X, = {1,2, 3}, 
X, = {1,2,4,5}, X, = (3,4,5}, X; = {1,3,4} and X,=0. X, can clearly be 
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written as a polynomial in 7 applied to X;: 


X, — » Ch 7° T/(X). 
O<j<n 
The two-dimensional pattern of the coefficients c,, , displays self-similarity proper- 
ties. In fact, it has fractal dimension log ,3 and resembles the pattern obtained from 
a one-point seed on P., and rule o after a rotation. A picture of the latter can be 
found in Wolfram’s article. 
Now suppose we start with an arbitrary partial configuration X, in the first 
column. Define X,, X, and so forth as above and let 


X= X,X {1} UX, xX {2} U--- UX, xX {n} cP 


It is obvious from the definition of X; that for alll <i<m,1<j<n: o(X)GJ) = 
0. Thus X is in the kernel of o iff r(X,,) + X,_, = 0. Hence we have the following 
proposition. 


PROPOSITION 2. A configuration X' in Cp can be extended to a configuration X in 
Cp, such that X, = X’ and o(X)=0 in at most one way. The only possible 
extension is described by (3.1) and is successful iff X,,, = 9. 


It is easy to see that X,,, is a linear function of ¥- X’, hence d(P,, ,,) 1s the 
corank of the map X, > X,,,. Therefore d(m, n) < n and by symmetry 


d(P,,,) < min(m, n). (3.2) 


Let us define the following “checkerboard” configurations Z,,, on P,, ,, 
Ze mk +i-j,itj+1)=1 for i=0,...,m—k, j=0,...,k —1, 
and Z, ,,(x, y) ‘= 0 everywhere else. The following picture shows Z; 49. 


We claim that { Z, ,,|1 < k < m} isa basis of the kernel of o on P,,, ,,,. It is easy 
to see that o( Z, ,,) = 0. The configurations Z, ,, are linearly independent in Cg. 
But by (3.2) d(P,, ,,) < m and we are done. 

Using a simulation, we can now establish a lower bound on d(P,, ,,). To this end 
let p + 1 be a divisor of m+ 1 and n+ 1. We will simulate the p X p square on 
P,, »- Note that there is a tiling of P,, , by p X p squares that leaves gaps of width 1 
between the squares as shown below. The simulation f: P, , > Pin, Will place one 
copy of configuration X into each of the squares in the tiling. To insure that f is a 
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simulation some of the copies will be horizontal or vertical reflections of X. More 
precisely, the (i, 7)th square contains F} +! (™°¢ 2) Ft! (med2)( ¥)), Formally define f 
as follows: 


f(a, b) = ((a,+i-(p+1),b)+7-(p+1))|0 <i <img, 0 <j < no} (3.3) 


where m, := (m+ 1)/(p + 1), no = (m + 1)/(p + 1) and 


a= 


I 


a 1 even, 
p-—at1_ iodd; 


and similarly for b. The placement for the various copies of X is indicated in the 
next figure. 


m,n 


Now let (x, y) be a cell in the gap between the copies of X in P,, ,, say, 
x =pt+1. Then f(X)(x, y +1) =0 as (x, y + 1) also lies in the gap. By the 
definition of f we also have f(X)x +1, y) = f(X)(x — 1, y). Therefore, 
o( f(X))\(x, y) = 0. Similarly 


o( f(X))(a,+i-(pt1),a;,+j-(p+1)) = 0(X)(a, b) 


for all O<i<my, 0O<j<ny. Hence o( f(X)) = f(o(X)), showing that f is a 
simulation as claimed. Thus we have established the following lemma. 


LEMMA 3. For all m,n >1 the kernel of rule o on the m Xn grid P,, , has 
dimension at least gced(m + 1,n + 1) — 1. In particular the m X n grid is irreversible 
under rule o whenever m + 1 and n + 1 are not relatively prime. 


It remains to show that the lower bound of Lemma 3 is tight. To do this we must 
rule out the existence of configurations in the kernel of o other than the ones 
obtained from the simulation just described. In the special case n + 1 = k(m + 1) 
for some k >1 this follows immediately from (3.2): d( Py pomsiy-1) > 
gcd(m + 1, k(m + 1)) — 1 = m by lemma 3 and d(P,, p¢m4iy-1) < m = min(m, n) 
by (3.2). The general situation can be handled as follows. 

We proceed by induction on min(m,n). For m = 1 or n= 1 we are done by 
(2.3) as P, , and P, , are isomorphic to P,. So assume that min(m, n) > 1. As P,, , 
is isomorphic to P, ,, we may assume that n > m. In the case n = m we are done 


n,m 


by the preceding remark, so assume m < n. Let k = [(n + 1)/(m + 1)] and pick a 
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Fic. 3. The first half of configuration Z, 39 . 


configuration X in the kernel of o on P,, ,. Extend Y, := X, to a configuration in 
the kernel of o on P,, x¢m+1y-1- The simulation f defined in (3.3) shows that 


= 
foe) 
ewe” 
no) 
S 
so. 
N § 
+ ON 
W 
5 
A wv 
~ § 
ee 
2 & 


mod (2m +2) 
Yom —j 


[¥ 
Y 


Here we assume for convenience that Yp 


Y, : 


0. But by proposition 2 X, can be 


extended in at most one way to a configuration in the kernel of o on P,, ,, for any 


Y, for all j <n. Furthermore, (3.4) implies that X’ := xX, U X, 


U X,, lies in the kernel of o on P,, ,, where n’ 
n>2m-+ 2 and n’ := 2m — n otherwise. But n’ < m, so by our induction hypoth- 


esis d(P,, ,) 


n’. Hence X; 


LJ oe. 


n mod (2m + 2) for 


gced(m+1,n’+1)—1. Now gced(m+1,n’ + 1) = gced(m + 1, 


n + 1) and the theorem is proved. 


Since d(P,, ,) = gcd(m + 1,n + 1) — 1, it follows that every configuration in 


the kernel of op is of the form f(X), where f: P, , > P,,,, 18 the simulation in 


(3.3), p= ged(m +.1,n+1)-—1 and X is in the kernel of op 


Ps P 


We conclude by pointing out the major difficulty with giving a similar analysis of 
the rule o*. The evolution of a one-point seed configuration on P., according to 
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at Prn.m) 


Irreversible o*-automata on square grids P,,_,,, m < 100. 


(3.1) for rule o leads to a simple infinite checkerboard pattern. This is witnessed by 
the diamond shaped checkerboard configurations Z,, ,,. The same evolution for rule 
o*, however, generates the pattern shown in Figure 2. The pattern is substantially 
more complicated, in fact, it has fractal dimension log,(1 + /17) = 1.81. If one 
defines configurations Z,' ,, analogously to the Z, ,, configurations for rule o the 
result looks typically like Z;'3, as shown in our last figure. One can still prove that 
for every m there exists some n > m such that d*(P,, ,,) = m, but the least such n 
appears to depend on number-theoretic properties of m in a rather complicated 
fashion. In particular we are not aware of any simple description of d*(P,, ,,). The 
following table contains the values of d*(P,, ,,) for m < 100. The missing entries 
are 0. 


Acknowledgements. I would like to thank the referee for suggesting a number of improvements and 
pointing out several inaccuracies. 
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In this department the MONTHLY presents easily stated unsolved problems dealing with notions 
ordinarily encountered in undergraduate mathematics. Each problem should be accompanied by 
relevant references (if any are known to the author) and by a brief descripition of known partial 
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University of Calgary, Alberta, Canada T2N 1N4. 


More on Mortality 


MELVEN KROM 
Mathematics Department, University of California, Davis, CA 95616 


MYREN KROM 
Mathematics Department, California State University, Sacramento, CA 95819 


A set of matrices is called mortal if the zero matrix can be expressed as a product 
of members of the set. For example, 


(ae) 
(ih Me 9) 


BS) “ENG ste 9 
0 O/\1 1/\1 1/\0 0 0 OO; 

Question 1. Does there exist an algorithm which decides for any finite set of 
2 X 2 matrices with integer entries whether the set is mortal? 

It is easy to find examples of mortal sets and examples of nonmortal sets of 
matrices. A natural question to ask is whether there is an algorithm to determine 
mortality. You might be able to answer the question in terms of basic properties of 
matrices and of integers by discovering, from examples, a systematic way of 
checking for mortality. Indeed, if we limit the question to lower triangular matrices, 
there is a simple algorithm to determine mortality. A finite set of lower triangular 
2 X 2 matrices is mortal if and only if, for each diagonal entry position there is a 
matrix in the set with a zero in that position. 

It is important to realize that there may not exist an algorithm as indicated in 
question 1. That is, there may not be a recursive procedure [1, p. 534], [2] for 
deciding mortality and we would say that the mortality problem is unsolvable [4, p. 
2], [2]. It is possible to use elementary means and a known result that some other 
problem is recursively unsolvable to verify that a mortality problem is unsolvable. 
This was done in [5] to show that the mortality problem for 3 x 3 matrices is 
unsolvable. Thus, if we change question 1 to refer to 3 X 3 matrices instead of 2 x 2 


is not mortal but 
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matrices the answer to the question is no. The proof in [5] of the unsolvability of the 
3 X 3 case consists of showing how to form, for each instance of a known 
unsolvable decision problem, a finite set of 3 X 3 matrices that is mortal if and only 
if the answer to the instance of the unsolvable decision problem is yes. For examples 
of known unsolvable problems that might be used in this way for the 2 X 2 case, see 
the many references given in [2]. 

Question 1 was first mentioned as an open problem in [6] where it is discussed in 
terms of transformations of the complex plane. In [3] question 1 is shown to be 
equivalent to question 2 below. Question 2 is a reformulation of the mortality 
problem as an “equality of entry” problem. We let c;, denote the entry in the ith 
row and jth column of the matrix C. 


Question 2. Does there exist an algorithm which decides for any finite set P of 
nonsingular 2 X 2 matrices with integer entries whether there is a product that is 
formed with members of P that is equal to a matrix C such that c,, = c,,? 

If we approach the mortality problem, question 1, by restricting it to lower 
triangular matrices then the problem becomes trivially solvable. But if we approach 
the equivalent equality of entry problem, question 2, by restricting it to lower 
triangular matrices then it becomes a nontrivial open problem, question 3. 


Question 3. Does there exist an algorithm which decides for any finite set P of 
nonsingular lower triangular 2 X 2 matrices with integer entries whether there is a 
product that is formed with members of P that is equal to a matrix C such that 
Co, = C9? 
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NOTES 


EDITED BY DENNIS DETURCK, DAVID J. HALLENBECK, AND RODICA SIMION 


Elementary Evaluations of {,°e ~ *" dx, (cos x? dx, and 
Je sin x* dx 


ROBERT WEINSTOCK 
Department of Physics, Oberlin College, Oberlin, OH 44074 


Less than a decade ago [1] there appeared a new method for evaluating 
J= [oe dx, 1 
i (1) 


long known to have the value yz /2. The cleverness of the method resides in the 
introduction of 
co € 
e(x) = f 


-». 1+? 
whose differentiation leads readily to 4/7 = 7. Since e(x) is defined by an im- 
proper integral, however, one cannot use the elementary Leibniz rule to compute its 
derivative: One must either apply a deeper theorem—e.g., [2]—or perform an 
ad hoc computation to form e’(x), as is done in [1]. 

The present note offers a modification of the method in [1] that borrows the main 
item from [1], but which avoids the immanent complication of having to differenti- 
ate an improper integral. 

One finds in [1] the evaluation of the Fresnel integrals by essentially the same 
method, mutatis mutandis, that is used there to evaluate (1); again an awkwardness 
intrudes through the need to differentiate improper integrals. The present note 
modifies the method so as to evaluate the Fresnel integrals while avoiding such 
complications. 

First, the evaluation of (1): In place of (2) we define 


—x(1+17) 


dt, (2) 


1 ex +t") 


f(x) = i ——— dt (3) 


1 +t? 


and note—along with the fact that f(x) is continuous and differentiable for all real 
x—that 


f(0) = arctanl = (7/4), f(co) = 0; (4) 


the latter because, when x > 0, 


— xr? 


1 @ 
0<f(x)= ex Tay dt < e~*(7/4). 


Application of the Leibniz rule to (3) gives, for x > 0, 


1 _ 2 _ | er) e* yx _.2 € 
, — _ x(1+1?*) — _ x xt —_— _ u —_ ___ 
f'(x) fre dt e fe dt Te J, e“ du Te 2( 
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where 


g(z) = few du. (6) 


Integrating (5) from 0 to oo, then making the substitution z = yx, we use (6) to 
obtain 


f (00) — f(0) = - [° ee) dx = -2[°e-“g(z) dz 


-2f"s'(z)a(z) z= [sO [s(m)). (7) 


Because of (1), (4), and (6), we read (7) as —(7/4) = —J’—i.e., J = ya /2. 
For evaluation of the Fresnel integrals 


oO oO 
F= / cos y* dy, G= | sin y* dy, (8) 
0 0 
we introduce, in analogy to (3), 


1 COS xt* 1 sin xt 


pier Ble 9) 


a(x) = 


We note— along with the fact that a(x) and B(x) are continuous and differentiable 
for all real x—that 


T 
a(0)=7, BO) = 0. (10) 
Furthermore, we show below that 
a(oo)=0, Boo) =0. (11) 


Applying the Leibniz rule to both integrals in (9), we use algebraic rearrangement 
to obtain a pair of linear differential equations 


a(x) — B(x) = —q(x), B(x) + a(x) = p(x), (12) 


where, for x > 0, 


: 2 1 pe >, 2 

p(x) = J cos xt* dt = =f cos y* dy = Te ulvx) a3) 
1 l py 1 ° 

q(x) = i sin xt* dt = ze ft sin y*dy = Tvs) 


and, 
u(z) = [cos ydy,  ov(z)= [sin y dy. (14) 
0 0 


We treat (12) as follows: (i) multiply the first by —cos x, the second by sin x, 
then add the products; (ii) multiply the first by sin x, the second by cos x, then add. 
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The two results can be written, respectively, 


d 
(i) Gy (B(x) sin x — a(x) cos x} = p(x) sin x + q(x) cos x 
(15) 


d 
(ii) 7a (B(x) cos x + a(x) sin x} = p(x) cos x — q(x) sin x 


Integrating (15, i) and (15, 11) from 0 to 00, we obtain, on application of (10) and 
(11), 


[[e(s) sin x + q(x) cos x| dx = 7 [° (p(x) 008 x — q(x) sin x] dx = 0. 
(16) 


Now, using (13), the substitution z = ¥x, and (14), we rewrite (16) as 
7 = 2 f° [u(z) sin 2? + v(z) cos z?| dz = 2f°[u(z)o"(2) + v(z)u'(z)] dz 
= 2 f° lu(2)o(2) dz = 2FG, (17) 
0 = 2 [u(2) cos 2” ~ y(z) sin z?] dz = 2 f° [u(2)u'(2) — y(z)v'(z)] & 
= [E(w - [o(z))P}a=P-G (18) 


—with further help, in both cases, from (8) and (14). 
To show that G is positive (and with it F, by (17)), one substitutes y = ¥x in (8) 
and writes 


re 1) Sin x 


oo SIN X 1 2 
G=-] —=d=- dx 
5, vx 7» 1k vx 


—obviously an alternating series of diminishing terms whose k“ term approaches 0 
as k — oo and whose k = 0 term is positive. It follows that G > 0. From (17) and 
(18), therefore, F = G = (1/2)yz /2. 

To prove the second in (11), we substitute y = xt? in (9) (x > 7) 


sin y 


B(x) = =f bi+o/” (19) 


We partition [0, x] into the subintervals [0, 7], [7,27], ...,[n, x] for the appropri- 
ate positive integer n, and write (19) as a sum of integrals over the subintervals; this 
obviously results in a sum of diminishing alternating-sign terms, of which the first is 
positive. Thus, for x > 7, 


1 spa sin y rsiny 
<0) <a Bi onT® aes 


whence 8(co) = 0. 
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It is almost as obvious that a like partition of the interval [0, x] for evaluation of 
1px cos y 

a(x) = —= | ——— 

aed vy [1 + (y/x)] 


leads to the following bracketing of a(x) when x > a: 


0 loon cos y 1 
<alx) < 35 J Wy + Gx) © 
i n/2 cos y i 1/2COS y 
<ord, Bit or’ < aed vy” 


—whence, also, a(oo) = 0. This completes the proof of (11). 


A remark on personal motivation. The use of 1 as upper integration limit in (3) 
was not my original choice: Given (2) and wishing to avoid differentiating an 
infinite integral in a classroom presentation of the method of [1], I tried using 
arbitrary r > O as upper limit in (3), with the intention of r — oo eventually. The 
device worked quite simply; but I later noticed that setting r= 1 makes the 
calculation simpler vet. 

Thanks go to the (second) referee for suggesting that I try to evaluate the Fresnel 
integrals (8) by means of a finite-integral modification of the device used in [1], and 
for calling my attention to yet another recent evaluation of them [3]. (Only J was 
evaluated in the note I first submitted.) 
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The L* Norm of a Polynomial with Coefficients +1 


DONALD J. NEWMAN AND J. S. BYRNES, 
Prometheus, Inc., 103 Mansfield Street, Sharon, MA 02067 


A classic unresolved question regarding polynomials 


n—-1 
P(z)= )/ e,(n)z* 

k=0 
with coefficients ¢,.,, = €, = +1 is whether the maximum modulus of such a 
polynomial on the unit circle can be n'/* + o(n'/*). As shown by Kahane [3], if 
complex coefficients of modulus 1 are allowed, then not only is it possible for this 
property to be satisfied, but the minimum modulus can be n!/”? + 0(n!'/”) as well. 
Specifically, Kahane proved that for any n there is a polynomial of degree n with 
coefficients of modulus one whose modulus everywhere on the unit circle is 
n'/? + 0(n?/" log n). 
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Erdés [2] had conjectured the existence of a c > 0 such that, for any polynomial 
P of the types described, ||P\|,, > (1 + c)n'/?. Clearly the Kahane result disproved 
this conjecture for the modulus 1 case, but the situation for coefficients +1 remains 
open. Employing an elegant construction Shapiro [6, 5] demonstrated the achievabil- 
ity of the order of magnitude n'/*, but the maximum modulus of the Shapiro 
polynomials is about (2n)!””. 

Since the LZ? norm of any such polynomial is n'/*, by the Parseval Theorem, and 
the L’ norm for any p < oo is a lower bound for the maximum modulus, it is 
natural to look at the ZL? norm for some p > 2. Motivated by these considerations, 
as well as the inherent tractability of the L* norm, we examine the L* norm of such 
polynomials. As one might expect, this leads to several interesting combinatorial 
questions. We provide answers to some of these, and conclude with a refined version 
of the Erdés conjecture. 

Throughout the note n will be a positive integer, P(z) will denote the previously 
indicated polynomial of degree n — 1 with coefficients +1, and z will lie on the 
unit circle, z = e779 0 <6 <1. All integrals will be over 6 € [0,1]. We begin with 
a lemma. 


LEMMA. 
4 
WPilts= = 8 8 Ey. 
Jt+k=l+m 
O<j,k,l,m<n 
Proof. 


|Pllts = [\P(er*) [a0 


n-1 n-l n—-1 n—-1 

=| y €,2/ | yy e,2* |] ¥ ez "|| db e,27” 
j=0 k=0 1=0 

= | constant terms. 


Since a constant term occurs in this product if and only if 7 + k = 1+ m, the result 
follows immediately. 

Of interest is the expected value E(||P\\7«), if the coefficients ¢ ; are chosen at 
random. 


THEOREM 1. E(||P{|44) = 2n? — n. 


Proof. Clearly if exactly 3 of the indices j, k, /, m are identical, or if at least 3 of 
them are different, then E(e,- ¢,- &-€,) = 0. Therefore, it follows from the 
lemma that 


E(\|Pilt+) = FEE Em: (1) 


j=landk=m 
or j=m and k=/ 


For each of the n(n — 1)/2 pairs of integers p, gq, 0 < p < q <n, there are 4 


terms appearing in (1), €,€,£,€, being typical. The only other terms in (1) are e*, 
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0 <p <n. Since all of these terms equal 1, 
E(||Pll¢+) =n + 4(n)(n — 1)/2 = 2n? — n, 


completing the proof of Theorem 1. 

We now observe the improvement that is achieved when this random choice is 
replaced by the Shapiro coefficients. Shapiro’s polynomials are defined, together 
with his auxiliary polynomials Q, by the recurrence formulas 


P,(z) = Qo(z) =1, 
Pr+ilZ) = Py(z) + 27°O,,(z), 
Onsilz) = P,(z)-270,,(z), m>0. (2) 
For the interested reader we point out articles by Brillhart et al. [1] and Mendés- 
France and Tenenbaum [4]. The former shows that the Shapiro coefficients can be 


defined directly from the binary representation of the order of the polynomial, while 
the latter relates them to paper-folding sequences. 


THEOREM 2. If n = 2* and P(z) is the Shapiro polynomial of degree n — 1, then 
Pllis = (4n? — (~1)*n) 73. 
Proof. It follows directly from (2) that 
Pn+i(Z) | +lQmsi(Z) = 2(|Pn(z) 1° +1Qn(Z) |) 
so that, as observed by Shapiro, 


[P(z) |) +|Qn(z)/ = 2", (3) 
Now (2) and (3) yield 


[Pnii(z)| = 2"*! + 2Re(2”0,,(2)P,(z)). (4) 
Next we observe that z7Q,,P,, is composed solely of frequencies which are 


positive powers of z, so that it can be thought of as OP» where P,, is the 
“reversed” polynomial of P. Thus 


[lPnail’ = 2424 4 [(ReQ,,P,,) 


Since f = QP is analytic and 0 at the origin, 


0= fs? = Ref [?] = f(Res)?- farms)’ 


so that 


[Ref = 4 flF =F fll Pal? = 4 PrlIQnl?- 


Altogether then, we have 
[lPnvilt = 20? + 2 f[P,12(2"*1 — [Ppl”) 


_— J2mt+3 _ 2 [IP al*. (5) 
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The remainder of the proof is now simply induction on k. The result is obviously 
true for k = 0, since P)(z) = 1. Furthermore, from (5) and the inductive hypothesis 


q2k+2 _ (- 2) k 
4 FN 
fi P,,| 3 ’ 
it follows that 
q2kt+2 _ (—2)* QUk+1)+2 _ (—2)**? 
rr rr 
as required. This completes the proof of Theorem 2. 


[lPevsl* = 2°k+3 — 2 


Note that Theorem 2 implies that the L* norm of the (n — 1)st-degree Shapiro 
polynomial is asymptotic to yn times the fourth root of 4/3 ~ 1.07457/n. Based 
upon extensive numerical evidence employing the Bose-Einstein statistics methodol- 
ogy of statistical mechanics, we conjecture that the Shapiro polynomials do not give 
the minimum L* norm among all polynomials of the same degree with coefficients 
+1, but that this minimum L‘ norm is asymptotically yn times the fourth root of 
6/5 = 1.04664y/n . Observe that the truth of this conjecture would imply that of the 
Erdés conjecture mentioned earlier, with c = (6/5)'/* — 1 = .04664. 


Research sponsored by the Air Force Office of Scientific Research (AFSC), under Contract 
F49620-—86—C—0088. 
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Isoptic Characterization of a Circle 
(Proof of a Conjecture of M. S. Klamkin) 


JOHANNES C. C. NITSCHE 
Department of Mathematics, University of Minnesota, Minneapolis, MN 55455 


Suppose that a smooth convex curve C has the property that as we walk around 
a certain circle K that contains C in its interior, the curve C always subtends the 
same angle. Can we conclude that C is a circle? We cannot [1], but Murray Klamkin 
[2] has conjectured that if there are two concentric circles K, and K, with respect 
to which C has this “isoptic” property, then C must indeed be a circle. In this note 
we prove his conjecture. 
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The locus of intersection of tangents to a smooth convex curve C meeting at a 
constant angle, say 7 — f, is an isoptic of C; if B = 7/2, the isoptic is called an 
orthoptic. 

Isoptics of a circle are circles. As J. W. Green [1] has shown, the converse of this 
statement is only true if 8 is an irrational multiple of 7 or if 8B = (m/n)1, where m 
and n are relatively prime positive integers and m is even. For example, the circle 
x? + y* = a* + BD’ is an isoptic for the ellipse (x/a)* + (y/b)* = 1 corresponding 
to the angle B = 7/2: 

Let C have two concentric isoptic circles of radii R, and R, > R,, with center at 
the origin, and corresponding angles 8, = (m,/n,)7, B, = (m,/n,)7, where 0 < 
B, < B, < a. For j = 1,2, the integers m, and n, are relatively prime, and m, is 
odd. It is easily verified that the origin must be an interior point of C. Denote by 
h(@) the Minkowski supporting function of C, chosen with respect to the origin. 
This function measures the distance between the origin and the unique oriented 
tangent to C which forms the angle 6 + 7/2 with the positive x-axis. Its equation is 
x cos @ + ysin@ = h(@). The curve C appears as the envelope of its tangents and 
can be represented in the form x = h(@)cos@ — h’(@)sin 8, y = h(@)sin@ + 
h'(8)cos 6. The curvature of C turns out to be [h(@) + h’(8)]7?. 

We have 0 < h(@) < R,, and the following relations are read off from Figure 1: 


Arc cos|h(0)/R,| + Arccos| h(6 + B,)/R,| = B,, j =1,2. (1) 


hi ~ By —-—»\. 


Fic. 1. 


Here Arccos stands for the principal branch of the inverse function cos~!. Advanc- 
ing the argument in (1) by f, and subtracting the resulting relation leads to 
h(@ + 2B) = h(@). Thus h(@) has the periods 28, and 2£,, in addition to the 


Thus, a denizen of Flatland in Edwin Abbott’s charming story, walking by chance around this circle, 
would be unable to distinguish the shape of the ellipse from that of the circle x? + y? = (a* + b”)/2, 
because both would appear to him consistently under the visual angle 7/2. If this denizen were 
enlightened, i.e., cognizant of our theorem, he could settle the question by repeating his angular 
measurements from the points of a second circle. 
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period 27. There are integers a,, b; for which a,;m, + bn, = 1, so that 
a ,2B, + ba = a(m,/n,)27 + ba = 2a(a,m, + bn,)/n, =20/n, 


are periods of h(@). As the m, are odd, then h(@+ B;)=h(@+7/n,), and 
relation (1) becomes 


Arc cos| h(8)/R,| + Arccos| h(6 + a/n,)/R,| = B, j=1,2 (2) 
or, equivalently, 
h?(0) + h*(6+ n/n;) — 2cosBh(O)h(O+a/n,)=Ri sin’ B, jf = 1,2. 
(3) 


Let d= (n,,n,) be the g.c.d. of the denominators and set n, = »,d. There are 
integers a and b for which by, + av, = 1, so that (2a7/n,)a + (Qa/n,)b = 22(ar, 
+ bv,)/dv,v, = 22/N, with N = v,v,d = nv, = n,v,. Therefore the number p = 
2a/N is a period of the supporting function h(@). 

Observe that 7/n, = pv,/2 and 7/n, = pr,/2. It follows that h(@ + a/n,) = 
h(@) if v, is even and h(@ + 7/n,) = h(@) if v, is even. In either case, (2) implies 
that h(@) = const., i.e., that C must be a circle. If »; = 2k, + 1 and vp, = 2k, +1, 
then 7/n, = k,p + p/2,a/n, = k,p + p/2, and subtraction of the two relations 
(3) gives 


2[cos B, — cos B,|h(0)h(6 + 7/N) = R3 sin’ B, — Ri sin’ B, 


or, since 0 < B, < B, < 7, h(@)h(6 + w/N) = const. In conjunction with (3), this 
fact implies h(@) = const., so that, again, C must be a circle. 


Remark. It might be of interest to consider similar questions in higher dimen- 
sions. 
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Two Theorems on Singular Values and Eigenvalues 


Roy MATHIAS 
Department of Mathematical Sciences, Johns Hopkins University, Baltimore, MD 21218 


We present elementary proofs, based on the eigenvalue interlacing property of 
Hermitian matrices, of two theorems relating the singular values and eigenvalues of 
a square complex matrix. 

Let M,, , denote the set of n-by-m complex matrices and set M, = M,, ,. For a 
given A =[a,,] © M,, denote its eigenvalues by {A,(A)}, always ordered in de- 
creasing modulus 


|A,(A)| > |A2(A)] > +> > 1A, CADL- 
The function p(A) = |A,(A)| is the spectral radius of A. 
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We define the singular values of A&M by 0,(A) = [A,(A*A)]}7, i = 


m,n 
1,...,min{m,n}. Note that A*A is positive semi-definite and because of our 
ordering convention for eigenvalues, 

0,(A) > 0(A) > +** > Oningm,njy(A) > 0. 


Since the eigenvalues of a matrix are a similarity invariant, it follows that the 
singular values of a matrix are invariant under unitary equivalence, i.e., for any 
unitary U and V, 


@,(UAV) = [d,((UAV)*(UAV))]'” = [A(VtAtAV) |? 
= [A,(4*4)]'? = (A), 


Notice that if A € M, is singular, then so is A*A, and hence 0 = [A,,(A*A)]!/? = 
6,(A). The function |||A|||, = 0,(A) is called the spectral norm of A; it is the 
operator norm of A induced by the usual Euclidean norm ||x||, = (x*x)!/? on C”, 
i.e., |l|Alllp = max{||Axl|2: x ©”, |x|], = 1). 

There are two well-known relations between the spectral norm and the spectral 
radius: 


WlAlll, > (A) (1) 
‘lim AI" = (A), (2) 


Both are easily proved, and both are true more generally for any matrix norm, not 
just the spectral norm (see Chapter 5 of [3)). 

We are interested in two generalizations of these relations that involve eigenval- 
ues and singular values other than the largest. 


THEOREM 1 (Wey] [5]). Let A € M,. Then 
0,(A)o,(A) --- o,(A) > [A(A)A2(A) -+- ACA) fork =1,...,2. (3) 


THEOREM 2 (Yamamoto [6]). Let A € M,. Then 
lim [o,(A”)]'”" = |A,(A)] fork =1,2,...,n. (4) 


Notice that the case k = 1 in these theorems is just (1) and (2), and that the 
inequality in Theorem 1 is actually an equality for k = n since both sides are equal 
to |det A| in this case. Consider the case k = n in (4): If A is singular, then so is 
A™ and o,(A”) =A,(A”) = 0 for every m=1,2,.... If A is nonsingular, the 
assertion follows from applying (2) to 4~', since o,(A~') = 1/0,(A) and A,(A7') 
= 1/\,(A). 

These two theorems can be deduced easily from (1) and (2), respectively, by using 
the notion of a compound matrix, and this is how they are proved in [5] and [6]. 
Other proofs of Theorem 1 are known that do not involve the compound, e.g., [2] 
and Lemma 3.3 in Chapter 2 of [1], but there seems to be no other proof of Theorem 
2 available. We shall show that both theorems are easily deduced from Lemma 3, 
which follows directly from the interlacing property for eigenvalues of a Hermitian 
matrix. 


LEMMA 3 (Interlacing property of singular values). Let A € M,, be given, and let 
A,EM (respectively, A, © M ) denote a submatrix of A obtained by 


n,n—p n—p,n 
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deleting any p columns (respectively, any p rows) from A. Then 
o,(A) >0,(A,) >6,,,(A) fori=1,2,...,n— p. (5) 


Although deleting rows or columns of a matrix may decrease its singular values, 
the next lemma shows that this does not happen when zero rows or columns are 
deleted. 


LEMMA 4. Let A&M, be given, and consider B =[0|A] € M, 
= 4 eM 


0 n+p,n 


A. Then 


and C 


obtained by adjoining p columns (respectively, p rows) of zeros to 


»n+p 


0,(A) =6,(B) =a,(C) i=1,2,...,n. 


For k © {2,..., — 1} it is convenient to denote by A,,, the k-by-k submatrix 
of A obtained by deleting its Jast n — k rows and columns; A/,) © M,,_;,41 denotes 
the submatrix obtained by deleting the first k — 1 rows and columns. Notice that 
the entry a,, appears in both A,,, and A,,). 

We can now prove the two theorems. Since each is known already for k = 1 and 
k =n, the proofs will be given for an arbitrary k € {2,...,n — 1}. 


Proof of Theorem I. The Schur triangularization theorem permits us to write 
A = UTU*, where U & M, is unitary and T = [7;,] © M,, is upper triangular with 
t,;, = A,(A) =A,(T). Moreover, o,(T) = o,(A) by unitary invariance and Lemma 3 
guarantees that o,(7T) > o,(7|,,). Thus, 


o,( A) mv o,(.A) = o(T) mot o,(T) 2 5, (Tyx)) ms 6,.(Tx)) = |det Tea | 
— Aa (Try) a (Ty) — |A,(T) i A(T )| 
= |A,\(A) +--+ A, (A). 


Proof of Theorem 2. Write A = UTU* as before, so that A” = UT”U*. The 
upper triangular structure of T ensures that (T),,)" = (T"");,, and (Ty,))”" = (T)%»: 
Thus, the same reasoning used in the proof of Theorem 1 gives the lower bound 


0,(A") = 0,(T") > o,((T”)ta) = o,((Ttx)”). 


To obtain an upper bound, we use the second inequality in (5) (with i = 1 and 
p = k — 1) and Lemma 4 to write 


6,(A”") = 0,(T”) = 014«-y(T”) < 0,((T”) xy) — o((Tyx))”). 
Thus, we have the two-sided bounds 


lo. ((Tuy)”)) < [o,(A”)] 7” < |o,((Tu)”)| 


By the two cases of Theorem 2 we have already established (limits involving the 
smallest and largest singular values of powers of a square matrix), we know that 


lim lo.((Tre)”)] 7” = Ac (Tra) 


mo 


1/m 
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and 
1/m 


lim |o,((Z4)”)| 


m— 00 7 Mi ( Tees I 
But since A,(7{,)) = A,(T) = A, (A) and Ay(T,,)) = A,(T) = A;,(A), we are done. 


For more information on these and similar inequalities see Chapter 2 of [1], 
Chapter 3 of [4] and Sections (7.3—4) of [3]. 


Acknowledgement. I am grateful to Prof. R. Horn for his encouragement and assistance in the 
preparation of this paper. 
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THE TEACHING OF MATHEMATICS 


EDITED BY MELVIN HENRIKSEN AND STAN WAGON 
A Decomposition Theorem for Finite Automata 


TERESA L. SANTA COLOMA AND RALPH P. TUCCI 
Department of Mathematical Sciences, Loyola University, New Orleans, LA 70118 


Automata theory is an important branch of theoretical computer science, and the 
Krohn-Rhodes theorem [1] is one of the best known theorems in this area. It states 
that the semigroup of a finite automaton can be covered by a wreath product of 
semigroups which are either finite simple groups or semigroups of the form U’, 
where U is a right zero semigroup containing two elements and U’ is the semigroup 
consisting of U with identity adjoined. This type of decomposition is useful, for 
example, when examining decompositions associated with VLSI (see [4]). Unfortu- 
nately, undergraduate students and even beginning graduate students usually do not 
have either the mathematical background or the mathematical maturity to under- 
stand a proof of this theorem. In this paper we present a decomposition theorem 
that is easier to understand than the Krohn-Rhodes theorem and could serve to give 
a student the flavor of the Krohn-Rhodes theorem without requiring an extensive 
technical background. For proofs of the Krohn-Rhodes theorem, the interested 
reader is referred to [1] and [3]. We assume the reader is familiar with the basic 
theory of semigroups as presented in [2]. 


DEFINITION. A finite state automaton (or machine) M is an ordered triple 
M = (Q, 3, F’) where Q is a finite set called the set of states, > 1s a finite set called 
the input alphabet, and F is a partial function F: Q X X > Q. 


We think of the machine as reading a string of symbols, one at a time, from the 
input alphabet =; when the machine is in state q reading symbol o, then the 
machine goes to state F(g, a). Each element o € & induces a partial function F:: 
Q — Q defined by F,(q) = F(q, 0). Sometimes we will abbreviate F(q, 0) by qo. 

Automata are often represented by directed graphs with labels on the edges. For 
example, suppose that the flowchart in FIGURE 1(a) represents a program that 
branches to statement A or to statement B depending on whether its input is a or 
b. Then the automaton in FIGURE 1(b) models this behavior. 


READ SYMBOL 


SYMBOL = a 


(a) (b) 


Fic. 1. 
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As another example, suppose the flowchart in FIGURE 2(a) represents a program 
which keeps reading b’s until it reads an a, at which point it executes statement A. 
The automaton in FIGURE 2(b) models this behavior. 


READ SYMBOL 


(a) (b) 


FIG. 2. 


Let =* denote the set of all finite strings over >. Then we can extend the 
definition of F from elements of > to strings over > in the obvious manner; i.e., if 
w © X*, and w = ow’ for some o € 2 and w’ € 3*, then F(qg, w) = F(F(q, ©), w’) 
for all g © Q. As before, each string s over %* induces a partial function F.: 
QO — Q defined by F.(q) = F(q, S). 


DEFINITION. Let M = (Q, 2, F') be an automaton, and take =* to be a semi- 
group under concatenation. Define the congruence = on >* by saying that if s, 
s’ © &*, then s =s’ if and only if gs = qs’ for all g € Q. The semigroup of the 
machine M, denoted S(M) or simply S, is the semigroup =*/= . 


Note that S is a semigroup with identity, where the identity of S is induced by 
the empty string. Also note that we could define the congruence = by saying that 
s = s’ if and only if s and s’ induce the same partial function on Q; Le., s = s’ if 
and only if F. = F... Further, if s, s’ © %* and if the partial function F,o F., 
induced by the string ss’ is undefined, then we adjoin a zero element 0 to S and 
define ss’ = 0. (We can think of 0 as inducing a partial function on Q with domain 
@.) 

In this paper we decompose the semigroup of an automaton into simpler 
semigroups according to the next two definitions. 


DEFINITION. Let M, = (Q, >), F,) and M, = (Q, 25, F,) be two automata with 
the same state set. Then M, covers M,, denoted M, < M,, if there is a 1-1 
function g: =, — , such that, for every o, € 2, and g € Q, either F,(q, o,) is 
undefined or F,(q, 0,) = F;(q, 0,) where o, = g(o,). The function g is said to be a 
covering of M, by M,. (The reader should note that this definition of covering is not 
standard; see [3, p. 43].) 


This situation is best described by the commutative diagram in FIGURE 3, where 
g(o,) = 6, and where J denotes the identity function on Q. The expression (J, g) 
denotes the function from Q X22, to QxX 2, defined by (J, g)\(q, 0,) = 


(I(q), g(6,)) = (4, 62) for all (q, 0,) © Q X 2). 
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CI, g) 
(4,6) —e ee SC; ) 


F, 


qo = qo 


FIG. 3. 


Intuitively, we can say that M, covers M, if M, can do everything that M, does. 
For example, in FIGURE 4 M, covers M,; the covering g is the function from the 
alphabet of M, to the alphabet of M, given by g(a) = c and g(b) = d. Note that 
F,(0, b) is undefined in M,, while F,(0, g(b)) = F,(0, d) = Lis defined in M,. 


a a Cc Cc 
d 
0 , ; +—_ 
M, M, 
FIG. 4. 


DEFINITION. Let M, = (Q, >,, F,) and M, = (Q, 25, F,) be two automata with 
the same state set and disjoint alphabets. The alphabet direct sum of M, and M,, 
denoted M, @ M,, is the machine M = (Q, 2, U 2, F) with F(q,0) = F(q,¢) 
for all g © Q, where j = 1 or j = 2 depending on whether o € 2, oro € 3). 


We are now ready to state the decomposition theorem. This theorem shows how 
we can cover the semigroup of a machine by a group with zero and a semigroup 
with no units except for the identity. 


THEOREM. Let S be the semigroup of an automaton M. Then there are automata M, 
and M, such that M < M, ® M, and (1) S(M,) is a group with zero which is a 
homomorphic image of S(M); (2) S(M,) is made up of a subsemigroup of S(M) 
which contains no units, together with an identity element. 


Proof. Let S = S(M), and let J = {s © S|F, is not an onto function}. Then J 
is a two-sided ideal of S (which therefore contains no units) and S/J is a group 
with zero. Take M, = (Q,S//, F,); for each gE Q and JE S/J, s €J, let 
F,(q, sJ) = F(q, s), and let F(q, J) be undefined. Take M, = (Q, J, F,); for each 
g€Qandse€J let F,(q, s) = F(q, s). Define the function g: S > (S/J) UJ by 
g(s)=sJ ES/J if s€éJ, and g(s)=s EJ if s © J. It is now easy to see that 
M, ® M, covers M. Let s © S; if se J then F(q, s) = F,(q, s) and if s € J then 
F(q, s) = F,(q, sJ). The other statements in the theorem follow immediately. 


Example. Let M be the machine in FIGURE 5. The semigroup of M is S = 
{1, a, b, ab, ba, aba,0} where a” = 1, b* = b, and bab = 0. The ideal J as defined 
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in the theorem is J = (b) = {b, ab, ba, aba,0}. The machine M, in FIGuRE 6 has 
semigroup S(M,) = {1, a,0} = Z, U {0} which is a group with zero, the machine 
M, has semigroup S(M,) = J‘ (the semigroup J with identity adjoined) where J 
contains no units, and M, ® M, covers M. Note that the semigroups of M, and M, 
are simpler than the semigroup of M, even though the diagrams are a bit more 
complicated. 


b 
<1 


a 


FIG. 5. 
lJ lJ 
b aba 
aJ ba 
0 ——*, ——_—_—_—__> 
4——____—. 0 <—-] 

aJ ab 

FIG. 6. 
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A Counterexample in Galois Theory 


THOMAS W. HUNGERFORD 
Department of Mathematics, Cleveland State University, Cleveland, OH 44115 


A typical first-year modern algebra course culminates in the proof of Galois’ 
solvability criterion: 


THEOREM. Let F be a field of characteristic 0 and f(x) © F[x]. If the equation 
f(x) = 0 is solvable by radicals, then the Galois group of f(x) is a solvable group.” 


*The converse is also true, but its proof is often omitted because this is all that’s needed to 
demonstrate the insolvability of the quintic. 
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The proof involves a subtle point that is often missed by students and, alas, by 
the authors of several current modern algebra texts. In this note we present a simple 
example that shows why a certain technical lemma is necessary for a valid proof. 

To understand the difficulty, we first review the usual proof. By the definition of 
“solvable by radicals,” the splitting field K of f(x) is contained in a radical 
extension L of F, that is, there is a tower of fields 


FC F(u,) C F(uy, uy) © +++ C F(u, u,...,u,) =L, (*) 
such that K C L, 
ut? © Fand u” © F(u,,...,u;_,) foreachi > 1. 


The next step is to adjoin all the n,th roots of unity. They can be adjoined one at a 
time, or in a single step by letting n be the least common multiple of n,,..., , and 
adjoining a primitive nth root of unity 8B. The powers of £ include all the n,th roots 
of unity. Since B” = 1 © F, the tower 


FC F(B) C F(B,u,) C F(B, u, uy) © +++ CF(B,u, uy,...,u,)=E (**) 


is also a radical extension of F. Furthermore, 

1. F(f) a normal extension of F (it is the splitting field of x” — 1). Its Galois 
group is abelian.** 

2. Each F(,u,,...,u;) is a normal extension of F(B, u,,..., u;_,) (it is the 
splitting field of x”: — u?'). Its Galois group is abelian.** 

Under the Galois correspondence, the sequence of fields (**) corresponds to a 
sequence of groups 


G2G,2G6,2G,2 ::- 2G,= (1), 


in which G is the Galois group of E over F and each G; is normal in the preceding 
group. The Fundamental Theorem shows that the quotient G/G, is isomorphic to 
the Galois group of F(£) over F (and hence is abelian by 1) and each quotient 
G,/G,_, 1s isomorphic to the Galois group of F(B, u,,..., u;) over F(B, u,,..., U;—1) 
(and hence is abelian by 2). Therefore G is solvable. The proof is completed by 
showing that the Galois group of K over F (the Galois group of f(x)) is a 
homomorphic image of G and hence solvable.*** 

And now the sticky point: for this proof to be valid, the field FE in (**) must be a 
normal extension of F (otherwise the Fundamental Theorem and the Galois 
correspondence can’t be used). Most students tacitly assume that the construction of 
E given above guarantees its normality over F. Unfortunately, this may not be the 
case, despite some implicit and explicit statements to the contrary in several texts. 

To see why £ need not be normal over F, consider the tower 


Q < Q(/2i) < Q(v2i, V2(1 - a)) = Z. 


L is a radical extension of the field Q of rational numbers because 
(V2i) = -2€Q and 
2 
(v2 (a — i) | = —2/2i € Q(y2i). 


**See [4] or [5] for a proof of 1 and [3], [4], or [5] for a proof of 2. 
***See [3] for details. 
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This is just the tower (*) above with F = Q, t= 2, u, = v2 i, u, = y2(1 — 1), 
n, = 2=n,, and n = Iem(2, 2) = 2. To obtain the tower (**) we must adjoin a 
primitive 2nd root of unity. But in this case Q already contains the primitive 2nd 
root B = —1. So Q = Q(B) and the tower (**) coincides with (*): 


Qc Q(v2i) ¢ Q[v2i, V2(1 - i) =E. 


Furthermore, the dimension [E:Q] is at most 4 because (V2i)? <Q and 


(V2 (1 — i))? © Q(y2i). 


We now show that E is not normal over Q. You can readily verify that 
+ y2(1 —1i) and + y2(1 + i) are the roots of p(x) = x*+ 8, from which it 
follows that p(x) is irreducible in Q[x]. E contains the root u = y2(1 — i) of 
p(x). If E were normal over Q, p(x) would split in E[x] and the root v = v2(1 + i) 


4 
would be in E. Hence E would contain both v/u = i and (v — u)/2i = y2 and we 
would have 


4 
But x* — 2 is the minimal polynomial of ¥2 over Q, and x* + 1 is the minimal 


4 
polynomial of i over Q(y2 ) (all of whose elements are real). Consequently, 


[£:Q] > [Q(12,:): 9] = [e(2. 1): a(72)|[e(i2):Q] =2-4=8. 


This contradicts the fact that [EZ : Q] < 4. Therefore E is not normal over Q. 
Once noted, the sticky point is easily taken care of by starting the proof of the 
Theorem with this! 


LEMMA. If F has a radical extension, then F has a normal radical extension. 


Then one can assume that L is normal over F in (*), from which it follows easily 
that E is normal over F in (**), so that the proof outlined above is valid. 

The Lemma (or an equivalent) appears in the graduate texts [4], [5] and [6], as 
well as in undergraduate texts, such as [1] and [3] (where the proof is a starred 
exercise). The problem is avoided in [2] and [7] by restricting the discussion to 
normal radical extensions. 

But in modern algebra texts published since 1986 (and a few earlier ones), the 
reader should be careful. Even when the Lemma is stated, its proof may be faulty. 
Most of these books present either a fallacious proof of the Theorem or Lemma 
(counterexample above) or a suspiciously incomplete proof. 

The incomplete arguments typically say “adjoin all necessary roots of unity,” but 
give no proof that this guarantees the normality of E over F. In this case, depending 
on the interpretation of “necessary,” the example above may not be a counter- 
example (if you adjoin the 4th root of unity i between Q and Q(y2 i), the resulting 


top field in (**) is QQ, 12), which is normal over Q). 
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A Simple Algorithm for Finding Eigenvalues and Eigenvectors 
for 2 X 2 Matrices 


TYRE A. NEWTON 
Department of Mathematics, Washington State University, Pullman, WA 99164 


The solution of the 2 xX 2 matrix eigenvalue problem is of considerable signifi- 
cance in linear algebra, and is basic in linear systems theory. Moreover, this problem 
can arise whenever you have a nonlinear 2nd order system, whether it is composed 
of algebraic, differential, or difference equations. A technique that is commonly 
used in the nonlinear case is presented in [1, Chap. 9] where we find at each critical 
point of the system an associated matrix with real elements, namely, 


a b 
A= ( *). (1) 
For each such matrix we determine its eigenvalues and corresponding eigenvectors. 

The traditional procedure for solving this eigenvalue problem is to first solve the 
quadratic characteristic equation of A for the eigenvalues and then for each 
eigenvalue solve a homogeneous system of linear equations to get the corresponding 
space of eigenvectors. In the method presented here, we solve a different quadratic 
equation related to A, and from its roots determine immediately both the eigenval- 
ues and corresponding eigenvectors. 

To facilitate this, we say that if A is an eigenvalue and v is a corresponding 
eigenvector of A, then the pair [A,v] is an eigenpaar of A. Since any nonzero 
multiple of an eigenvector is an eigenvector corresponding to the same eigenvalue, 
we agree that [A,v] = [A, av] for all real a # 0. Also associate with the matrix A 
given in (1) the equation 


bm? + (a-—d)m—c=0. (2) 


It simplifies things and there is no inconsistency to state that oo is a root of (2) 
whenever b = 0. The multiplicity of oo as a root of (2) is 1 if b = 0 and a — d # 0, 
and 2if b = 0 and a—d=0, butc + 0. 

Our procedure is given in the following. 


PROPOSITION. Given the matrix A in (1). 


(a) The pair 
E + bm, (1) (3) 


is an eigenpaar of A if and only if m is a finite root of (2). 
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(b) The pair 


(1) ® 


is an eigenpaar of A if and only if 00 is a root of (2). 
(c) If at least one of the coefficients in (2) is nonzero, then the space of eigenvectors 
for each eigenvalue found above is one-dimensional. If all three coefficients are zero, 


° (3) ‘ 


is an eigenpaar of A for any x and y that are not both zero. 


Proof. For ease of notation we will define the vectors 


m= (m) and v= (7): 


If m is real, then m is the slope of v,, while the slope of v,, is infinite. 
For part (a), assume that m is a finite root of (2). When we rewrite (2) as 


m(a+ bm) =c + dm, (6) 


we conclude that 


(alm) = (2+ om)(m) Y 


Consequently (3) is an eigenpaar of A. Conversely, if (3) is an eigenpaar of A for 
finite m, then (7) is true from which (6) follows and m is a root of (2). 
For part (b), consider the equation 


a b\(0\_ ,[0 
(alt) = (a) 
We see that (4) is an eigenpaar of A if and only if b = 0 in which case oo is a root 
of (2). 


For part (c), first assume that (2) has at least one nonzero coefficient so that it 
has at most two roots. When we consider the equations 


(oF\(A)=a(h)-oeo ana (2 S)(2) =a(2) 


we see that if v, or v,, is an eigenvector, then necessarily the corresponding 
eigenvalue is a + bm or d respectively. Since any nonzero vector in R? can be 
written uniquely as a constant multiple either of v,, for some real m, or of v,,, then 
any eigenpaar defined by (3) or (4) defines a one-dimensional space of eigenvectors. 
However, if all three coefficients of (2) are zero, then oo and any finite m will be a 
root. In this case A is diagonal with equal diagonal elements and it follows from (3) 
and (4) that [a,v,,] for any m and [a, v,,] are eigenpaars. That this is the same as (5) 
for any x and y now follows from our statement of equality for eigenpaars. 0 


Comments. In making phase plane plots of the linear system 
dX/dt = AX (8) 


where A is given by (1), we find that m is a solution of (2) if and only if y = mx 
contains linear trajectories of (8). These lines partition the phase plane up into 
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sectors which bound the phase trajectories of (8). Consequently, we are led to call 
(2) the sector equation of A, and its roots the sector roots of A. 
A property common to the characteristic equation of A, namely 


— (a+d)d + (ad — bc) = 0, 
and the sector equation of A is that they have the same discriminant, namely 
A=(a-—d) + 4be. 


It is immediate that AX = a + bm is the transformation that relates the characteristic 
equation and the sector equation. 

The technique presented here facilitates the writing of a simple microcomputer 
program for finding all eigenpaars for a given 2 X 2 real matrix. 

We conclude with some examples. 


Example 1. A > 0, b = 0. Given 


(2 O 
A=(¢ t): 


the sector equation is m — 6 = 0, so that the sector roots are oo and 6. Since for 
m = 6, a + bm = 2, and the eigenpaars are 


a(r)] ae .(6)} 


Example 2. A < 0. Given 
_ 1 1 
a=(_5 3) 


the sector equation is m7 — 2m+ 2 =0, so that m=1+i, and a+ bm=1+ 
(1 +7) = 2 + i. Consequently the eigenpaars are 


pei(.iiJh 


Example 3. A > 0, b # 0. Given 


_ 1 —-—2 
A = ;): 


the sector equation is —2m” — m + 1 = 0s0 that the sector roots are m = 1/2, — 1. 
At m=1/2, a+ bm=0. At m= —1, a+ bm = 3 so that the eigenpaars are 


o(va)}=[o(3)] E(t) 
Note that A is singular here but is nonsingular in Example 1. 
Example 4. Infinity is a sector root of multiplicity two. Given 
A= (4 °), c #0, 


the sector equation is Om? + Om + c = 0 having oo as a double root. Then the only 
eigenpaar is 


60 ARNO VAN DEN ESSEN [January 


REFERENCE 


1. William E. Boyce, and Richard C. DiPrima, Elementary Differential Equations and Boundary Value 
Problems, 4th ed., Wiley, New York, 1986. 


Magic Squares and Linear Algebra 


ARNO VAN DEN ESSEN 
Department of Mathematics, Catholic University of Nijmegen, Nijmegen, The Netherlands 


Magic squares have been the subject of an enormous number of papers. The aim 
of this note is to show how magic squares can be used as examples in a first year 
course in linear algebra. 

Let Q be the field of rationals and n a positive integer greater than 1. An n-by-n 
matrix M with entries in Q is called semimagic, or a semimagic square, if there 
exists an element of Q denoted by o(M) such that the sum of each row and each 
column equals o(M). The matrix M is called magic, or a magic square, if 
furthermore the traces of both M and JM equal o(M) (J is the n-by-n matrix 
satisfying J, =Oifit+j#n+ 1 and J,;=1if i+j =n + 1). The set of magic 
squares of order n (resp., the set of semi-magic squares of order n) we denote by 
Mag_,(Q) (resp., SMag,(Q)). If no confusion is possible we drop the index n. 
Finally J denotes the identity matrix. 

As announced above we now describe four examples where magic squares can be 
used to illustrate several concepts of linear algebra. 


1. The first thing to observe is that both sets SMag,(Q) and Mag, (Q) are 
Q-vector spaces. Then questions like “compute the dimension of both spaces” and 
“find a Q-basis for each of them” (for small n) supply nice exercises [2]. 


2. Obviously Mag, (Q) is a linear subspace of M,(Q), the set of all n-by-n 
matrices with entries in Q. The following proposition, which describes how the 
vector space of magic squares sits in the vector space of semi-magic squares, 
illustrates the notions of trace of a matrix and direct sum decomposition of vector 
spaces. 


PROPOSITION 2.1. SMag(Q) = Mag(Q) @ Q/ @ QV. 


Proof. Denote by £ the n-by-n matrix with entries all equal to 1. Obviously E is 
magic. The crucial property of E is that EM = 0(M)E = ME for all semi-magic 
squares M. In particular E* = o(E)Ee = nE and more generally E? = n?~E for 
all p > 1. 

1) First we show 


Mag(Q) = Mag®(Q) ® QE and SMag(Q) = SMag’(Q) ® QE. (1) 


where Mag®(Q) (resp., SMag°(Q)) denotes the set of magic (resp., semi-magic) 
matrices with trace zero; namely, if M is magic (resp., semi-magic) then 
M — *Tr(M)E has trace zero and is magic (resp., semi-magic). That both sums are 
direct follows by taking traces. 
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ELEMENTARY PROBLEMS 


E 3361. Proposed by William P. Wardlaw, U.S. Naval Academy, Annapolis, MD. 


Prove or give a counterexample to the following conjecture concerning the Euler 
totient function: For every even positive integer n there is an odd positive integer k 
such that 


p(k) = (nn). 


E 3362. Proposed by Bjorn Poonen, Winchester, MA. 


Given a positive integer n, let M denote the set of n by n matrices with complex 
entries. If A € M, let A* denote the conjugate transpose of A. Prove that for each 
positive integer k and each B € M there exists a unique A € M such that 


A(A*A)* = B. 
E 3363. Proposed by N. J. Fine, Deerfield Beach, FL. 


A printer is given three copies of the numeral 4 and an unlimited supply of each 
of the six symbols 


+,:,-,+,V,] |, 
as well as an unlimited supply of left and right parentheses. (Here |¢] denotes the 


greatest integer not exceeding the real number f¢.) Show that he can compose an 
expression for each positive integer N. For example, 


19 = [4 = (VVVVVVV4 - LVV4)) |. 
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E 3364. Proposed by H. W. Lenstra, University of California, Berkeley. 


Determine whether there exist positive integers a,b,n,m such that a # b, 
n>2, m > 2, and 


-b 
a hn mM. 


E 3365. Proposed by Dean S. Clark, University of Rhode Island, Kingston, RI. 


Let S denote the sum of the integer weights that come with a two-pan balance 
scale. Give a necessary and sufficient condition on the weights that assures that all 
integer-weighted objects in the range of 1 to S can be weighed exactly. Use of 
weights on both pans is permitted. 


E 3366. Proposed by the Editors, University of Illinois at Urbana-Champaign. 
For n = 1,2,3,... determine the subset of (0,1) on which 


(=) (oe -log(1 — x)} <0. 


SOLUTIONS OF ELEMENTARY PROBLEMS 
Breaking a Rock into Pieces 


E 3094 [1985, 427]. Proposed by Louis Funar, University of Craiova, Romania. 


Let D be a convex body in R?, and let A(D) = sup, area (7 M D) where w isa 
variable plane. Prove that D can be divided into four parts D,, D,, D;, D, so that 
A(D,) < A(D), i = 1,2, 3,4. Is it possible to divide D into three parts with the same 
property? 


Solution by C. A. Rogers, University College, London. It is possible to divide D 
into two parts with the same property. Choose an interior point 0 of D. Choose a 
sequence /,,/,,... of chords of D passing through @, the chords being dense in D. 
We choose a sequence 6€,, €,,... of small positive numbers decreasing rapidly to 
zero. For each n > 1, let C, denote the set of all points of D lying at a distance less 
than ¢, from /,. By choosing «, €,,... suitably we can ensure that A(C,) < 
2~"A(D), for each n > 1. 

Take D, = U%_,C, and D, = D\ D,. Then 


A(D,) < EMC) <\(D). 


It is also clear that 
area (1 ™ D,) < A(D) (1) 


for each plane 7. 
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Suppose it were true that A(D,) = A(D). Then it would be possible to choose a 
sequence 7, 7,,... Of planes so that 


area(7,1D,) > A(D) as n> o. 


Further, these planes could be chosen so that they converge to some plane 7. By (1), 
we would have 


area (a7 D,) <A(D). 
So we could choose a constant p and an open set G in 7 with 7 N D, C G and 
areaG <p <A(D). 


Since D, is compact, points of 7, D, cannot approach points of 7 \ G as n tends 
to infinity. This would ensure that 


area (7, D,) < uw <A(D) 


for all sufficiently large n. Thus our supposition is false and hence A(D,) < A(D), 
as required. 


Editorial comment. Although a partial solution was given by the proposer, no 
complete solution was received before the customary deadline. The above solution 
was prepared by Professor Rogers at the request of the editors. 

The proposer may have intended to place further restrictions on the “parts,” but 
the above solution certainly disposes of the problem as published. 


A Subtle Inequality 


E 3116 [1985, 666]. Proposed by M. Laub, Jerusalem, Israel. 


(a) If x and y are non-negative real numbers, prove that x* + y” > x’ + y*. 


(b) If {x,, x5,..., x, } 18 a finite sequence of non-negative real numbers and 
{ 1, Yo.--+> ¥, } 1S any permutation of this sequence, prove or disprove that 
XP + XZ? res tx D> XP + KF? + +++ +x} 


Solution by Ishai Ilani, Jerusalem College of Technology, Israel. Both parts are 
true. 
(a) Without loss of generality we may assume that x < y, and we shall prove that 


y? —y* — (x? — x*) > 0. 
We shall distinguish two cases: 
Case (1), y > 1. In this case we shall use the identity 


y? —y* = (x” — x*) = f (y"log y — x‘ log x) dt 
and, since y > max{1, x}, we have 
y' log y> x‘ log y > x‘ log x. 
Hence, 


y? —y* — (x” — x*) > 0. 
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Case (i1), x < y < 1. In this case we consider the function f defined by f(t) = 
y’ — y' — (t” — t'), where y is fixed and 0 < t < y. Obviously f(y) = 0. We shall 
prove that f’(t) < 0 for all ¢ such that 0 < ¢ < y, from which follows immediately 
that f(x) > 0. 


Clearly 
f(t) = g(t) + A(t), 
where 
g(t) = t'logt — y'‘ log y 
and 
A(t) = t*— yt?™*. 
Since 


ya y 
t)=—f —(s‘l ds=—|f{ st(14+11 ds, 
g(t) = — f Zs‘ logs) ds = — fos (1 + tlogs) ds 


the condition t <s<y <1 implies that 1+ ¢logs>1+tlogt>1l—e '>0. 
Hence g(t) < 0. In order to see that h(t) < 0, we consider the function k defined 
on (0, 00) by k(z) = t' — zt771, where ¢ is fixed; we wish to show that k(y) < 0. 
Now 0 <t<y<land k(t)=0>t'’-—1=k(1) and k(z) -t' asz > ©. 

If k(y) were non-negative, then k would have a zero in [y,1) and a zero in 
(1, co) in addition to the smaller zero at ¢t. Thus if k(y) > 0, the function k would 
have at least three positive zeros and hence k’ would have at least two positive 
zeros. But, since k’(z) = —t?~+(1 + z logt), we see that actually k’ has only one 
positive zero. Thus we must have h(t) = k(y) < 0. 

(b) It is enough to prove the assertion for cycles, 1.¢., that 


xt + xF2 bores Ake + xX > xP + xFB+ ++ +X, + xB ( * ) 


for all positive integers n and all nonnegative real numbers x,,..., x,. We proceed 
by induction on n. The assertion is true for n = 1,2. So suppose n > 2 and (*) is 
known to be true if n is replaced by n — 1. Then 


XPT A KZ vee HM + xX — (XR? + FH + HM, + XN) =A B, 
where 
A= (x + XZ? Hove Hx + xn) — ( x}? + x73 torts Hx + x1), 
B= (x*»— x) — (x%, — x*,). 
By induction A > 0. To prove B > 0 let us put for brevity 
Hy XX -1 TV, XH = 2, 
so that B may be written 
B=(z?—2*)—(y?—y*). 
Again there are two cases: 


Case I, one of the x, is greater than 1. Without loss of generality we may assume 
that x, = max, .;<, X;, since the cycle may be rotated if necessary. Then 


B=2z7-—2*—-(y’-y*) = f (2! log z — y‘log y) dt > 0, 


1990] PROBLEMS AND SOLUTIONS 67 


since z > max(1, x, y) and, accordingly, 
z'logz > y'logz > y‘ log y. 


Thus B > 0 in case I. 

Case II, none of the x, is greater than 1. Without loss of generality we may 
assume x, = min, .;<,,X;, Since the cycle may be rotated if necessary. Thus, in the 
notation introduced above, z = x, < min(x,, x,_,) = min(x, y). We consider the 
function F defined by F(t) = t' — t* — (y‘ — y*). Clearly 


F(t) = (t'logt — y‘log y) + (t' — xt*~') 
= g(t) + h*(t), 
where g is as in part (a) and h*(t) = t' — xt*~+. As in part (a) we have g(t) < 0 for 


0O<t<y and h*(t) <0 for0 <t< x, so that F(t) < 0 for 0 < ¢t < min(x, y). 
But 


F(min(x, y)) = F(x) = F(y) = 0, 


and thus F(t) > 0 for 0 < ¢ < min(x, y). Since z < min(x, y), we have B = F(z) 
> 0 in case II. Thus the induction is complete and assertion (b) is proved. 


Editorial comment. As noted in Case (i) of the above solution, the assertion of (a) 
is almost immediate if max(x, y) > 1. Thus if x < y and y > 1, we have 


y” —y* = y*(yp*- 1) > x*( yr _ 1) > x*(x?™* _ 1) _ xy — x*. 


However, the first inequality fails when x < y < 1, since then y”~* — 1 is negative. 
Thus a more subtle argument seems to be required in case max(x, y) < 1. Many 
would-be solvers either ignored this case or asserted incorrectly that the preceding 
trivial argument applies. 


Part (a) was solved also by B. Brunson. Part (b) was partially solved by E. Rips and the proposer 
jointly. 


A Polynomial Identity 


E 3207 [1987, 456]. Proposed by Geng-Zhe Chang, University of Science and 
Technology of China, Hefei, Anhui, China. 


If m 1s a positive integer, let 


Show that 
Fy) = Fl) = pay (20m = x)” 
for m > 1. 


Solution I by the University of South Alabama Problem Group, Mobile. For m > 1, 
set G(x) = F,_1(x) — F(x). Using Leibniz’s rule for the differentiation of a 
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product, 


F(0) = Ca ent : at - = \(i)-08 


Hence F,,(0) = 1, F/(0) = —2, and F{(0) = 0 for 2 < i < m. This yields G0) = 
0 for 0O<i<m—1. Also (1—x)”** is a factor of F(x). Thus G,(x) = 
a,,x""(1 — x)” for some constant a,,, since the degree of G,, is at most 2m. If 
H,(x) = G,(x)/Q — x)”, then a,, = H,,(1) is the coefficient of the last nonzero 
term in the sum defining F,,_,(x), so that 


a,, = H,,(1) = [1 — m= }(2" 7 | — aay (2%) 


Solution IT by students of the 1987 Mathematical Olympiad Program, U.S. Mili- 
tary Academy, West Point, NY. Let 


t 


B(t;n,x)= ¥ (7) ~ x) 


j=0 


We claim that the polynomial identity 


B(t;n,x) — B(t+1;n+4+2,x) 


=(F)s7a y= (Pyjamas) 


holds for all nonnegative integers ¢. To see this, suppose that 0 < x < 1, and view 
the identity in the language of probability theory, specifically in terms of n + 2 
Bernoulli trials with success probability x. The identity (*) may be rewritten in the 
equivalent form 


n n— 1 
B(t;n, x) + (, n 1) —x)" 


= B(t+ In +2, x) + (Txt — x) (x *) 


Now B(t; n, x) is the probability of at most ¢ successes in the first n trials 
regardless of what happens in trials n + 1 and n + 2, the second term on the left of 
(* *) is the probability of exactly ¢ + 1 successes in the first n trials followed by 
failures on trials n + 1 and n+ 2, B(t + 1; n+ 2, x) is the probability of at most 
t + 1 successes in the n + 2 trials, and the second term on the right of (* *) is the 
probability of exactly ¢ successes in the first n trials followed by successes on trials 
n+ 1 and n+ 2. Using these interpretations, we see that both sides of (* *) give 
the probability of the following event: There are at most ¢ + 2 successes in n + 2 
Bernoulli trials with the restriction that ¢ + 2 successes can occur only in the case of 
exactly ¢ successes in the first n trials followed by two successes in the last two 
trials. Thus (* *) holds for 0 < x < 1 and hence (* *) and (*) hold for all values 
of x. 
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Now, note that F(x) = B(m — 1;2m, x) — 2xB(m — 2;2m — 1, x) for m> 1. 
Using the identity (*), we find 
F(x) — F(x) = B(m — 2;2m — 2, x) — B(m — 132m, x) 
— 2x{ B(m — 3;2m — 3,x) — B(m— 2;2m—1,x)} 


= (2m =? )xm( —x)"- 2m 2 hm — x)"** 


1 m 
= Xam cay (inn) =" — x) 


for all m > 2. The assertion of the problem can be verified directly for m = 2. 


Solution III and generalization by David G. Robinson, Emory University, Atlanta, 
GA. We define the symbol 


recursively by putting 


(o).,=4 for n>1, (et for n>1l, 


n _{n-1 n—-1 _ 
iden ( k ) tts). for l<k<n-1. 


By induction we may express this symbol in terms of ordinary binomial coefficients 
as follows: 


k k 
n _ (n-1 n—1)\_ _* « (7. 
ida a k }+o(7 24) {a{ ~) oo \(7 
for l<k<n-1. 
In particular 


" 2k \(7 for 0<k 1 
(i) 2 (tli) Or SIS he 


In terms of generating functions we have 


n 


Le) es (ay + bx)(y +x)" (n > 1). 


Defining 
[n/2] 


A,(a,b;x,y)= (i) xty"*, (n> 1) 
k=0 
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we see that F(x) = A,,,(1, — 1; x,1 — x). The recurrence formula for the coeffi- 
cients yields the following recurrence for the polynomials: 


A, (a, b; xX, y) — (x + y)A,_1(a, b; xX, y) 
n{n—l n h 
= (-1) 72 | xl *D/Alyln/21 (n> 1). 
Applying this twice, we obtain (for m > 1) 
A,,,(a, b; xX; y) — (x + Y) Agm—7( 4, b; xX, y) 


_ 2m 2) mim _ om ~ 2) m+1,,m-1 
( m ab > (am ab y ° 


which reduces to the desired recurrence for F when a = 1, b= —1, and y=1--x. 


Solved also by G. E. Andrews, K. L. Bernstein, J. Fitch, T. Hermann (Hungary), Kanji Ikari & 
Shizuka Nakano & Akira Teramoto & Kaori Yagow & Hiroshi Zaizen jointly (Japan), E. Lee, O. P. 
Lossers (The Netherlands), Y. S. Sathe & R. G. Sheno jointly (India), J. H. Steelman, and the proposer. 


Asymptotic Behavior of a Recurrence 
E 3229 [1987, 787]. Proposed by Gary E. Stevens, Hartwick College, Oneonta, 
New York. 
Determine the asymptotic behavior of the sequence defined by the recurrence 
ayg=l, a,=n/%1 for n> 0. 


Solution by Preben Alsholm, Danmarks Ingeniorakademi, Lyngby, Denmark. We 
will show 


log k)* 
a5, = 2k — (log2k)’ + O “|, 


k 


log(2k + 1) (log k)° 
Geri ht Get Ola J 


Let b,, = log a,,/log2k for k > 1. Applying the recurrence twice yields 


by, 49 _ oe log(2k t+ l)exp(— bo, log 2k) (1) 


We will show that the sequence { b,,} is increasing for k > ko, provided ky > 2 and 


(2k, a 2)’ | 


(Qk, — 3)(2k, — Ilog(2k, — 1) (2) 


box, > Max) ba, 9; 


Using a calculator, one finds that k, = 15 satisfies (2). Suppose then by induc- 
tion that by, > b,,_4 > b.,,-. for some k > ko. Using (1), we see that b,,,5 > 5, 
if and only if 


log(2k + 1) 
log(2k — 1) 


e (bo, — 52,2) log(2k — 2) + bo, log {2k /(2k—2)} > 
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Let L denote the left side of this desired inequality. By the induction hypothesis, we 
find 
2k 


L > er 108(2k/2K-2) 5 pbragloB(2k/2k-D) 3 1 4 b. lop——— 
2k BD 2 


1 
1+ . 

Ft) 
Since log(] + x) > x — x*/2 >x(1—x,/2) for 0< x <x, we set x)= 
1/(k, — 1) to obtain 


k-1 2k,-2° 


L>1 + by, 


On the other hand, the inequality log(1 + x) < x for positive x yields 


log(2k + 1) 2 
i yp 
log(2k — 1) (2k — 1)log(2k — 1) 
2k, — 2 1 
< 


1+. . —_, 
(2k, — 1)log(2k, -1) k-1 


since (a — 1)/(a log a) is a decreasing function of a for a > 1. Applying (2) to this 
inequality, we obtain 


log(2k + 1) 2kj-3 1 


———— <1+b,, -——~ - —— < L. 
log(2k — 1) *ko Vk, -2 k-1 


Thus, {5,,} is an increasing sequence for k > ky. Since a, > 1 for all n > 0 and 
thus a, = n'/*"-1 <n, it follows that b,, <1 for all k >1, and hence {b,,} 
converges to some a in (0,1]. This implies b,, exceeds a/2 for sufficiently large k, 
at which point (1) yields 


log(2k +1) log(2k + 1) 
by, S a/2 
(2k) (2k) 


Hence log b,, converges to 0 and a = 1. It follows immediately from a,,_, = 55, 
that a,,_, decreases to 1 as k — oo (for k > k,). Since 6), increases to 1, we find 
for any e > 0 that 1-—e <b,, <1 for k > k(e). Thus (2k)'"* < ay, < 2k for 
k > k(e). By using 


— log b3,.4. = 


a, = eB/ay- (3) 


with n = 2k + 1, we find 


are + 1) 
exp| ————— 


log(2k + 1) 
2k 


S Agx41 S oo (Qk) 


Since a,,,, does not equal the preceding lower bound, we cannot get an expansion 
for ay,,, Solely by expanding the series for the exponential. However, since 
log x = o(x°) for all & > 0, we obtain the approximation 


Aop4, =1 t+ o(k%~*) for € >. (4) 
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We can now iterate (3) successively for n = 2k and n = 2k + 1, using successive 
formulas for a,,,, and a, and the fact log x = 0(x°), to generate asymptotic 
expansions of a, for odd n and even n. We will derive the first several terms. The 
first two steps yield 


a,,=2k+O(k") for e > 8, (5) 
and 


log(2k + 1) 


Arp4, = 1+ ska + O(k87*) for e; > &. (6) 


Having obtained two terms for a,,,,, we can replace (4) by the more accurate 
formula a,,,, = 1+ O((logk)/k). Resuming the iteration from that point, we 
replace (5) by a,, = 2k + O((log(2k))*) and (6) by 


log(2k + 1) (log k)” 
fae ORT ie 


Finally, one more iteration yields 
_ 2 —1 4 
ay, = 2k — {log(2k)}* + O(k7(log k)’). 


Editorial comment. Using the Maple symbolic computation package, Gaston H. 
Gonnet of the Symbolic Computation Group, University of Waterloo, Waterloo, 
Ontario, obtained the following more precise results. For n odd we have 


logn 2logn+ (logn)* + 2(logn)’ 
—— + ———__ 


=1+ 
On 2n? 


6 log n — 12(logn) + 25(logn)° + 3(logn)* + 9(log n)” 1 

, Sean = ion) + lea) + seen)" + Moen), (2) 
6n n 

while for n even we have 


2logn — 4(logn)’ + (log n)* — (logn)* 


a, =n— (logn)” + - 


. 15 log n — 30(logn)” + 42(log n)’ — 25(log n)* + 6(log n)” — 4(log n)° 


6n? 
0 1 
+ O|—— |}. 
[| 


All other solvers showed only that a, approaches 1 for n tending to infinity 
through odd values and a,/n approaches 1 for n tending to infinity through even 
values. 


Solved also by L. Erdés (Hungary), J. Marengo, P. Tracy, and the editors. Partially solved by the 
proposer. Four incorrect solutions were received. 
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On Integer Triangles 


E 3256 [1988, 259]. Proposed by John Isbell, SUNY at Buffalo. 


(a) Let T be the set of triangles in the plane whose vertices have integral 
coordinates and whose sides have integral lengths. Certain isosceles triangles in T 
can be constructed by fitting together two congruent right triangles in 7, e.g., the 
isosceles triangle with vertices (— 12, — 9), (12,9), (—12, 16) arises in this way. Are 
there any other isosceles triangles in T? 

(b) Consider the set V of triangles in 3-space whose vertices have integral 
coordinates. Does V contain any equilateral triangles with integral side-length? 


Composite solution by Jesus Ferrer, Oliva, Valencia, Spain, and Stephen M. 
Gagola, Jr., Kent State University, Ohio. (a) The answer is no. Let A, B,C be the 
vertices of a triangle in T with AC = BC. By translation, we may take C = (0,0). 
Let A = (a, a,), B = (b,, b,), x = AC = BC, and y = AB. Then 

x? =a? + a3 = b? + b (1) 
and 


2 2 
y= (a, — b,)° + (a, — by)” = 2( x? — a,b, — ayb,), (2) 
Thus, y is even, so the segment of AB from A to its midpoint M has integer 


length. If y is even, then y” is Omod 4, and congruence mod 4 implies that a, — b, 


and a, -— b, are even. Hence M = ((a, + b,)/2,(a, + b,)/2) has integer coordi- 
nates. 


Finally, we compute CM. We may assume a, # b,. By rearranging terms in (1), 
we obtain (a, + b,)/(a, + b,) = —(a, — b,)/(a, — b,). Using this and then (2), 


we have 
2 

a, + b,\’ a,+ b, 

M)? = |———] + ——— 
(C ) | 2 | 2 2 al 
naa y? 


2 
= |——— | {1+ = —— - 
2 ay —_ b, 2 (a, _— b,) 
This is an integer and is the square of a rational number, so its square root must be 
an integer. 


(b) The answer is no. Let A, B,C be the vertices of an equilateral triangle in V 
with side length x. By translation, we may take C = (0,0, 0). Let 
A= (a,, Ay, 43), B= (b,, b,, 53). 


Without loss of generality we may assume that a), a,, 4a, b,,b,,b, have no 
common factor greater than 1 and in particular that at least one of them is odd. 
The equations used in (a) become 


a+b) (51) 14 


xv=artaptap=bi+b54+ b3 
= (a, - bi)” + (a, —- by)” + (a, —- b;)” = 2( x? — a,b, — ayb, — ab;). (I) 
Again we conclude that x is even, which implies 
0=a?+a5+ az = be + b5 +b; (mod 4), 


which is impossible when at least one of a,, a5, a3, b,, b5, b; is odd. 
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Editorial comment. Several solvers noted that in higher dimensions the answers 
are different. In particular, the points (0,0,0,0) (2,0,0,0), and (1,1,1,1) form the 
vertices of an equilateral triangle with integral side-length. If n > 4, we may append 
n— 4 zero coordinates to give three points in Z” forming the vertices of an 
equilateral triangle with integral side-length. 

If F is a subfield of the reals, Jim Delany proved (a) there is an equilateral 
triangle with vertices in F* having side-length in F if and only if F contains V3, 
and (b) there is an equilateral triangle with vertices in F? having side-length in F if 
and only if 3 is a sum of two squares of elements of F. 

I. J. Schoenberg’s paper, “Regular simplices and quadratic forms,” J. London 
Math. Soc. 12 (1937) 48-55 determines those values of n for which there exists an 
equilateral n-simplex whose vertices are in Z”. 


Also solved by J. Anglesio (France), J. Delany, K. G. Fischer, W. Janous (Austria), A. Pedersen 
(Denmark), A. J. Schwenk, and the proposer. Partially solved by J.-M. Monier (France). 


A Special Collection of Permutations 


E 3260 [1988, 350]. Proposed by Peter Andrews and Edward T. H. Wang, Wilfrid 
Laurier University, Waterloo, Ontario, Canada. 


In how many ways can n white squares and n black squares be chosen from a 2n 
by 2n chessboard in such a way that no two of the chosen squares lie in the same 
row or the same column? 


Solution and generalization by John Henry Steelman, Indiana University of Penn- 
sylvania, Indiana, PA. If n is odd there are no such permutations, but if n is even 
the answer is [n! /(n/2)!]*. 

Translate a permissible choice into a permutation 7 such that choosing position 
(i, 7) corresponds to w(i) = 7. The squares of one color are those with i = 7 mod 2; 
the other color have i # j mod 2. Thus the desired set corresponds to the permuta- 
tions of {1,...,2n} such that i # w(i)mod2 for exactly n values of i. More 
generally, let f(m, k) denote the number of permutations of [m] = {1,..., m} such 
that i # 7(i)mod 2 for exactly k values of i; we compute f(m, k) for allO < k < m. 

Note first that in any permutation 7 of [m], the number of odd values that have 
their parity changed must equal the number of even values that have their parity 
changed, because the total number of odds must be the same before and after the 
permutation. Hence f(m, k) = 0 if k is odd. 

If k is even, the desired permutations can be constructed by choosing the 
numbers whose parities will be reversed and then choosing a permutation for the 
[m/2] numbers that will be mapped to odd numbers and a permutation for 
the | /2]| numbers that will be mapped to even numbers. The number of ways to do 
this is 


j(m,k) = | Pea! imam 2} 


When m = 2n and k = n, this reduces to the formula claimed. 
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Editorial comment. The proposers remarked that, since the total number of 
permutations is m!, the explicit determination of f(m, k) yields the interesting but 
well-known combinatorial identity U7"_ )f(m, k) = m!. 


Solved also by the proposers and 24 others. 


Simplices with Integer Edges and Volume One 


E 3279 [1988, 655]. Proposed by Stanley Rabinowitz, Westford, MA. 


For some n (n > 1), find a simplex in E” with integer edges and volume 1. (Cf. 
Problem 1261 in Mathematics Magazine {1987, p. 40; 1988, p. 118].) 


Remark. The condition n > 1 was inadvertently omitted from the published 
statement of the problem. 


Solution by the proposer. In E+, the simplex with vertices 
(0,0,0,0), (7,0,0,0), (33/14, 1573 /14,0,0), 
(81/14, — 19973 /70, y582 /5,0), 
(53/7, 65273 /315, 12887582 /4365, 8/194 /291) 
has the desired property. 


Editorial comment. As Walther Janous (Austria) remarked, Heron’s formula 
shows immediately that a triangle with integer edges cannot have area 1. Actually it 
is known that if a triangle has integer edges and integer area, then the area must be 
a multiple of 6. (See the above-mentioned problem 1261 in the Mathematics 
Magazine.) In any event, a simplex of the type mentioned in the statement of the 
problem does not exist for n = 2. 

In a paper, “Tetrahedrons with integer edges and integer volume,” to appear in 
Mathematics Magazine, Kevin L. Dove and John S. Sumner show that there is no 
tetrahedron with integer edges and volume 1. Thus n = 4 is the smallest value of n 
for which the phenomenon of the problem occurs. 

It is conceivable that a numerically simpler example than that given above can be 
found for some larger value of n. However, the editors know of no such configura- 
tion. 


No other solutions were received. 
ADVANCED PROBLEMS 
6619. Proposed by Arthur L. Holshouser, Charlotte, NC, and Benjamin G. Klein, 
Davidson College, NC. 


Suppose that N = 4"(8k + 7) for non-negative integers h and k. 
(a) Show that there exist infinitely many distinct 5-tuples (a, b,c, d,e) of 
positive integers such that 


N=(a*+b?+c?+d7+ e?)/( abcde +1). 
(b) Show that there is no 4-tuple (a, b, c, d) of positive integers such that 
N =(a?+b?4+ c? + d?)|( abed +1). 
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6620. Proposed by W. Allegretto and A. Meir, University of Alberta, Edmonton, 
Canada. 


Suppose f is locally integrable on R, but 


J fC) dx = 00. 


Show that for every p > 0 there exists ¢ € L?(R) such that it is impossible to have 
f(x)o(x) = g(x)A(x) (x ER), 
where g € L'(R) and h € L?(R). 


6621. Proposed by William C. Waterhouse, Pennsylvania State University, Univer- 
sity Park. 


Which numbers can occur as a ratio of roots of an irreducible quadratic 
equation? More precisely, suppose K is an algebraically closed field. For each 
subfield E of K and each separable quadratic extension F of E with conjugation , 
certain elements of K occur as ratios ¢/c with c in F. Which elements of K (besides 
1) do arise in this way when we allow all possible choices of E and F? 


SOLUTIONS OF ADVANCED PROBLEMS 
Ratios of the Terms of a Sequence 


6559 [1987, 885]. Proposed by Benjamin G. Klein, Davidson College, and John 
Layman, Virginia Polytechnic Institute and State University. 


Let Cy, C1, C),... be a sequence of real numbers determined by the values of 
Co, C1, Co together with the recursion 


Cn+1 — (C,Cq—1 + 1)/c,_2; 
for n > 2. Further, suppose that cy = c, = a, where a > 0. 
(i) For what values of c, does the sequence of ratios c,,,,5/c,, converge? 
Determine the value of the limit for each such value of ¢,. 
(ii) Show that for each value of a there is exactly one value of c,, such that the 
sequence of ratios c,,,/c, converges. Find this value of c, and then determine the 
associated value of the limit of the sequence of ratios. 


Solution by O. P. Lossers, Eindhoven University of Technology, The Netherlands. 
We shall show that c,,,,5/c, has a limit only in the following three cases: 
Ico=-l1/(2a). WW. -1/2a)<c, <0. Ill. ¢,>0. 
In case I the limit is —1, in case II the limit is (B — VB* — 4)/2, where 
B =2+2/(c,a), and in case III the limit is (B + VB* — 4)/2. 
The ratio c,,,/c, has a limit only if c. = (—1 + ¥1 + 16a*)/(4a) and then the 


value of the limit is 
(ac, + 1+ /2ac, +1 \/(2a?). 
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We now proceed to prove these assertions. Note that the recursion breaks down 
if any of the numbers c, vanishes. So let us assume that at every stage of the 
recursion the number c, in the denominator does not vanish and check afterwards 
whether all c, do not vanish. From the initial elements (Cp, c,, c,) = (a, c, a) and 
the recursion it follows that 


c,=(ca+1)/a and c,=(ca+2)/c. 


Claim. If c, = Bc,_,—c¢,-4 for some B, then c,,, = Bc,_1 — ¢,_3- 


Proof. 
Cnt+1°n-2 Chen -1 +1= (Be, — Cha) Cn—1 +1 
— Be, 2ly—1 a (C,~3Cn—2 + 1) + 1 
= (Be,_, C,-3) Cn—2° 
Clearly in our situation B = (c4 + Co)/c, = 2+ 2/(ca). We now distinguish 
three cases. 
(1) B = 2 is impossible. 
(2) B= —2 yields ca= —1/2, so (Co, C, C4) = (a, a, — 3a), hence c,, = 
a(1 — 2n)(—1)”, and (c,, c;) = (c, — c), hence c,,,, = c(—-1)”. 
Clearly all c, # 0, the ratio c,,5/c, tends to —1 (in both cases), and c,,,,/c,, has 
no limit. 
(3) If |B| # 2, then 
Co, = aA" + Bu"; Conga = YA" + Ou", 


where A and up are the roots of the equation 
x?-— Bx+1=0. (*) 


Using the starting values we obtain 


Con — 


and (* *) 


1 
Con+1 = aac + mw”), 


We now subdivide case (3) into three sub-cases. 
a) If |B| < 2 then A and p are complex conjugates, so c,,5/c,, has no limit. 
b) If B < —2 then (*) has two distinct negative roots, so c,,,5/c, tends to the 


smaller root, 1.e., 
1 
Cn42/Cn > 5 (B- B?- 4) <0, 


and c,,,/c, has no limit. 
c) If B > 2 then (*) has two positive roots, so 


1 
C49/l, > 5(B + VB*-—4)>0. 
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We now verify that for |B| > 2 none of the numbers c, can be zero. Since 
A = p', and without loss of generality |A| > |u|, we have 


1C2,| - "+ Ap"| 2 eli (\Aj"~* + |u|") > 0 
an! A+ 1 PTA NET " 
and, trivially, 
C 
Jeeval >| AI" = Lal”) > 0. 


For (ii) we may, because of the foregoing arguments, restrict ourselves to the case 
B > 2, and assume A > 1 > p. From (**) it follows that 


\ 2a xX 
mo y")—o3 — . 
( B") c A+] 


ie prtt 


Cans2 _ 24 n 
A+1 pti 


Con+1 c 


In the same way 
Con+1 


c 
—(A+1 ; 
c, > 57! ) asn—> 


n 
In order for c,,,/c,, to have a limit as n — oo we, therefore, must require that 
2a dX 

c A+1 


simultaneously with (*), that is, 


“(N41 


2 
v- [2+ =|a+1=0 


ac 
These equations have a root A in common (and are even identical) if 
2 -— 4a*/c* = -2 -2/(ac). 


This means, since c must be positive, that 


—~1+yl1+ 16a‘ 


c= Ao (* * *) 
and 


Cn+1 


( 1 
> —-(A 41) = =5(2ac +: 14+ y2ac4+1), 
( 2a 2a 


n 


where c is determined by (***). 


Also solved by C. K. Bailey and R. B. Richter, Robert J. Currier, Lorraine L. Foster, Ha-seo Ki 
(Korea), D. Hensley, Kim Jun Kyo (Korea), R. J. Tomkins, and the proposers. 


Limit Involving a Power of the Sine 


6564 [1988, 60]. Proposed by Gerald Weinstein, City College, City University of 
New York. 


Define S(@) = lim |sin n|\". 


n-— +00 
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(1) Show that $(2) does not exist. 

(ii) If C is a constant such that |p/q — 7| > q~© for all pairs p, g of sufficiently 
large positive integers, prove that S(@) exists and is zero for all 6 > 2C — 2. 

Mahler [ Indagationes Math, 15 (1953) 30—42] showed that we can take C = 41.09. 
For the best known value of C see D. V. Chudnovsky and G. V. Chudnovsky, 
Springer Lecture Notes 1052, (1984), 37-84. 


Editorial remark. The absolute value symbols around sin @ were unfortunately 
omitted from the definition of S(@) in the published statement of the problem. 


Solution by Ha-seo Ki, student, Yonsei University, Seoul, Korea and the proposer 
(combined ). It is well known that for any positive irrational number £ there are 
infinitely many pairs of relatively prime positive integers n and k such that 


<— 0<c <1 (*) 


where Co is an absolute constant. Hence 
2rk — t 
nj < 
| | k 


infinitely often, and we see that for 8 > 0 the limit (if it exists) must be 0. We also 
note that for sufficiently small angles @ we have 
2 
1— $' <cosp<1l—- =. 
Fix e > 0. We now ask if there are infinitely many pairs of integers N and K 
(call them “epsilon” pairs) such that 


T 9.1/2 
Ink + 5 —N] < (Ie) 
If so, for large N we would have 
9.1/2 
sin N = cos(A(1 — &% \ (0<A <1) 


>1-(1- eN””) = N° 
and the limit defining S(@) could not exist. 

Now let 8 be a positive irrational number, o any real number, and n, k a typical 
pair approximating £ as in the first paragraph. Let ¢ be the integer nearest to ko, so 


C 
o= Fa lal? 
For integers K and N we have 
no Cy Cy 
BK — Nol -(=+ B\K-N-7-s] als 


eed ——saawes 


Since n and k are relatively prime we may choose K and N so that the first fraction 
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above is 0, and in fact with k < K < 2k. Thus, if k is sufficiently large, 


K_N 2k 1 5 5 ¢3(B,0) 
-N-ol<>a4t+aF=a< Ts < —. 
6 a) ay a N (**) 
Since there are infinitely many pairs k, n satisfying (*), it follows that (* *) holds 
for infinitely many pairs K, N. (A more careful estimate due to Khintchine shows 
that the 5 may be replaced by 5~!/* + y for any 7 > 0). 

For 8 = 27 and o = —7/2 it follows that there are infinitely many “epsilon” 
pairs if 


C3 1/2 1\'”? 
— <(1—exp(N~® loge))“° ~ n-*7 (og 
N E 
for all large N. Clearly for 6 = 2 and «e a fixed but sufficiently small positive 
number this is valid. Thus (i) is established. 
For (ii) the simple idea is to show that n cannot get too close to 7/2 or 37/2 


modulo 27 infinitely often; we give the details only for 7/2. For n large, 


a(n)= = [2k + 5) an 


4k +1 
1 —-C 
> [2k + 5|(4k +1) = b, 


7 


vii 
amk+ 7 —n 


where b is very small. We may confine our attention to the case a(n) < 1/10. It is 
then obvious that 


jsinn| < cosb 
and thus, that there are positive constants c,, c;, and c, such that 
jsin n| < cos(c,k!~°) < cos(c4n'~©) 
C6 


Hence, 


@—(2C-2) 


2c-2] Nn 
1D Ce \" 
| sin n| < j{1—- 76-2 
n 


< exp( — cen? CC), 


which, for 9 > 2C — 2, tends to 0 as n tends to infinity. In this way (ii) is 
established. 


No other solutions were received. 
Sequences Containing Arbitrarily Long Arithmetic Progressions 


6565. Proposed by L. A. Rubel, University of Illinois at Urbana-Champaign. 


Suppose A = {a), a), a;,...} is a sequence of distinct positive integers and 
B = {b,, b,, b3,...} is another. Suppose A contains arbitrarily long arithmetic 
progressions. 
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(a) If |b, — a,| < K for all n, where K is a fixed positive number, prove that B 
must contain arbitrarily long arithmetic progressions. 
(b) If L°°_,1/{1 + |b, — a,|} diverges, does the same conclusion hold? 


Solution by Joseph E. McCanna, University of New Mexico, Albuquerque, NM 
and Douglas Hensley, Texas A & M University, College Station, TX (combined). We 
shall prove (a) and then show that no such conclusion is valid for (b). Fix M > 0. 
Let g, < g, < -:: <g, be an arithmetic progression of length L in A. “Color” 
each g,= a, with the number b, — a, . By hypothesis we are only using a finite 
number of colors. By van der Waerden’s theorem there is a monochromatic 
arithmetic progression of length N > M among the g, if L is large enough. But 
clearly this means there is also an arithmetic progression of length N in the 
sequence B. Next, let {a,} be the sequence 


10, 11, 1000, 1001, 1002, 10°,..., 10° + 3,10°°,...,10° + 4,... 
and {b,} the sequence 
10,107, 10°, 107, 10°, 10°,... 


Since a, = b, for n = 1,3,6,10,15,... it is clear that the series diverges. Clearly A 
has arbitrarily long arithmetic progressions, while B has none. 


Also solved by Alan H. Forrest (student), Hillel Gauchman, Robert B. Israel (Canada), Bruce 
Reznick, Lior Tzafriri, and Daniel Ullman. 
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A basic hypergeometric series is written as 


6 oe Ams Ys / -y (41) ,( G2), °° (Gn) pz" (1) 
“ by, -++5 By. k>0 (b,) (2), oo (bi, ) Ca) 

where (a), = 1 and (a), = (a; q), = (1 — a) — aq) ---(1 — ag*"') for k > 1 
and |z| <1, |q|< 1, b,#q~" for any nonnegative integer n. We get a bilateral 
series if we allow k to assume negative values as well: 


Oy,1115 4m G5 Zz) & (a1), (45) p++ (Qn) ,2" 
rial by... +5 By. |- =. (by) (bo), -+- (Onde 


where (a)_, = (1 — a/q*)"'(1 — a/q*™')"! --- (1 — a/q)™1. We can obtain Ja- 
cobi’s theta series from (2), for example, 


y gizk (3) 


k=-— 0 


(2) 


is found by setting n = 0, m = 1, a, = (1/2) and replacing g by q” and z by —tqz 
and then setting ¢ = 0. Basic or qg-series have important applications to the theory of 
partitions, the representation of numbers as sums of squares, the classification and 
proofs of binomial identities, statistical mechanics, and other areas of mathematics. 

The earliest examples of these series can be found in the work of L. Euler. He 
first conjectured in 1741 and later proved in 1750 the beautiful formula: 


TTQ-a@")= Le (-1igenren*, (4) 
n=0 n=— oo 

where the numbers (1/2)n(3n + 1) are called pentagonal numbers. The series in (4) 
is a particular case of (3) obtained by replacing q* by q? and z by —q’/’. Euler was 
led to consider the product in (4) after his discovery that 


Ta —q") = E p(n)a" (5) 


where p(n) denotes the number of unrestricted partitions of n, that is, the number 
of ways in which n can be written as a sum of numbers less than or equal to n. 
A. M. Legendre discovered the following combinatorial interpretation of (4): Let 
p(n) (resp. p,()) denote the number of partitions of n into an even (resp. odd) 


82 
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number of distinct parts. Then (4) implies that 


p.(n) — p(n) = (-1)" ifn= =m (3m + 1) 


0 otherwise. 


Euler was able to use (4) and (5) to obtain a formula which provides an efficient 
algorithm for computing p(n), namely, 


p(n) — p(n— 1) — p(n — 2) + p(n — 5) + p(n — 7) 


foes. +(-1)"p(n ~ =m (3m ~ 1) 


+(=1)"p[n = 5m(3m + 1)} 4 +. =, (6) 


where it is understood that p(k) = 0 for k negative. 

C. F. Gauss encountered the very important basic series 1 + 2°, gq” and 
1 + 2E°_.(-1)"q", which are particular cases of (3), in the earliest stage of his 
mathematical career. There is an oral tradition that he had observed the connection 
of these series with the arithmetic-geometric mean in 1791 when he was only 
fourteen. It was somewhat later in the period 1797-1800 that he found the relation 
of these topics with the theory of elliptic and modular functions and the division of 
an arc of a lemniscate into equal parts. These were very great discoveries but he 
published almost nothing of them. He also summed the above mentioned series as 
infinite products: 


oO oO 
1420 q"= [](-ag™")(1tqy, (7) 
n=1 n=1 
oO 5 oO ) 
1+2¥ (-1"q" = TH - 49") - ae (8) 
n=1 n= 
After he found these results Gauss set himself the problem of expressing 1 + g + 
g?> + q°+q'° + ---, a series whose powers are triangular numbers rather than 
squares, as an infinite product. He derived the answer that 
CO fore) 1 _ gq?” 
1+ Degree = [] To geei (9) 
n=1 n=1 q 


from his theory of elliptic modular functions. However, in 1811 he published 
another derivation independent of elliptic functions. The ideas of this paper are 
important in the study of q-series. It may be of interest to note that this paper also 
contains his evaluation of the sum 1 + L7_,e?*""'/", which was of crucial impor- 
tance in his fourth proof of the quadratic reciprocity law. The problem of its 
evaluation had worried Gauss for almost four years (1801-1805). A history of 
Gauss’ work on elliptic and modular functions is contained in an interesting paper 
by D. Cox [9]. 

In a 1750 letter to Goldbach, Euler suggested that the most natural way of 
proving the four-square theorem would be to expand (1 + g+ q*+q°+---)* as 
a power series 1° _,c,g” and show that c, #0. In fact c, would also give the 
number of representations of a positive integer as a sum of four squares. Euler had 
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no clue as to how this could be done but in 1828 C. Jacobi succeeded in proving the 
following identities: 


c ia 2n+1 


(1+ 2q+2q*+--- ers4ay pil (10) 


n 


(l+2qg+2g4+---)J’=14+8 ¥ a 


n#mod 4 | 
n>1 


7: (11) 


It is an easy consequence of the first identity (10) that the number of ways of 
expressing a positive integer n as a sum of two squares, (counting order and sign) is 
four times the difference of the number of divisors of n congruent to 1 modulo 4 
and the number of those congruent to 3 modulo 4. The second identity (11) gives 
the number of representations of n as a sum of four squares to be 


8 yd. 
d/n 
d#0 mod4 
Jacobi derived these identities from his theory of elliptic functions. As another 
consequence of his theory, Jacobi found the famous triple product identity 
oO oO 
y g”z" _ Il (1 _ q?nt?\(1 4 zq?""1)(1 4 zo lg?ntl) (12) 
n=-~—0o n= 
which contains (4), (7), (8), and (9) as particular cases. Gauss too had discovered 
(12) independently. There are several elementary proofs of (12) which do not use 
elliptic functions. One such proof is in the book under review. M. Hirschhorn [13, 
14] has recently shown that Jacobi’s identities (9) and (10) on sums of squares can 
be directly obtained from (12) by the elementary processes of differentiation and 
algebraic manipulation of series. An excellent treatment of these matters is con- 
tained in [7]. G. Andrews [2] has found that there is an identity similar to (10) and 
(11) for the cube of the series in (9) from which one can obtain Gauss’ theorem that 
every number is the sum of three triangular numbers. 

The first person to study basic hypergeometric series in a general way was 
E. Heine. Euler, Gauss and Jacobi had studied only those series which arise in 
modular function theory. In 1847, Heine published the following transformation 
formula: 

For |q| < |, |z| < |, and |b] < 1 


2 (4), (5) n — (Bol 4Z en & (¢/b) n(Z)» 
~ Ae. (q), (c), ~ (€)co(Z) eo (Z)oo no (4) (az), (gaz), 


where (x),, = [1%_,)(1 — xq”). A particular case of this is the “g-analog of the 
binomial theorem”: 


(13) 


= (a), (az) 
2. Oe (14) 


a fundamental result in the theory of basic series, which was known to Gauss and 
Cauchy though it appears that H. Rothe was the first to publish it in 1811. The 
binomial theorem follows from it if we set a = q* and let g — 1. Particular cases of 
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(14) were known to Euler: 


oo gh 


» 


1 
—— =-—_, (15) 
n=0 (q), (Zz). 
00 zngnin—D/e 


» 


n=0 (qd), 


From the general theorem of Heine one can easily obtain as a particular case the 
following identity of Cauchy: 


= (~2)oo- (16) 


n*—n n 


oO 6g" Z 1 
\) —_— = —., (17) 
n=0 (A) nlZ)n (Zoe 

and if we set z = g we get a formula of Euler: 


n2 


1+ ) > eseFa = TD a"). (18) 
nai (l—-q) ss (l- 4") mai 
We also note that if we replace g by q” and set z = q in (16) we obtain 


n2 


oO q oO 
1+ ) — 3 ns = «TT 0 t+ ?""?). (19) 
20-8) 0) | 
These identities have interesting combinatorial interpretations. For this as well as 
for recent developments in this area the reader may enjoy seeing Andrews [3]. 
The identities (18) and (19) are not very difficult to prove but the following which 
are superficially similar are extremely hard: 


2 


— q" _ ~ — ~omtl —1 — ~omt+4 —1 

I+ Xu (l—q)---(Q-q") = I1G gry (1-4), (20) 
— grey) = 5m+2\71 5m+3\71 

I+ harp aay 7 thos ) (ae) 1) 


These are due to L. J. Rogers and S. Ramanujan. They have an interesting history 
which has been told many times but may still be worth repeating. Ramanujan 
discovered these identities around 1910 but had no proof. He communicated them 
to G. H. Hardy in February, 1913 but neither Hardy nor others consulted by Hardy 
could find a proof. P. MacMahon expanded both sides as a series and verified the 
equality up to 89 terms. This convinced him of the truth of the results which he 
included without proof in his book on combinatorics [15]; a book which is even now 
of more than historic interest. MacMahon found the following interpretation of the 
first identity: The number of partitions of n with minimal difference 2 is equal to the 
number of partitions into parts of the form 5m + 1 or 5m + 4. The second identity 
has a similar interpretation. 

In 1916, Ramanujan accidentally came across Rogers’ 1894 paper in the Proceed- 
ings of the London Mathematical Society which contained the statements and proofs 
of the identities. As a result Rogers and his work theretofore completely neglected 
became known. However, it was only recently that the greater significance of 
Rogers’ work was understood. The reader may wish to consult [2] for details of this 


86 RANJAN ROY [January 


interesting story. The two identities were also rediscovered by I. Schur in 1917 and 
then in 1979 by R. Baxter who utilized them in his solution of the Hard Hexagon 
Model [6]. Some interesting mathematics has arisen from reproving and generalizing 
these identities. A purely combinatorial proof involving a new convolution principle 
was found by A. Garsia and S. Milne [10] though the simplest proof is perhaps the 
one due to D. Bressoud [8]. No proof depending on modular functions is known. 

Ramanujan used the two identities as the basis for his theory of continued 
fractions of the form 


1 x= x? 
1+1+#41+ 
It is not difficult to show that this continued fraction is the ratio of the two series in 
the identities. He used this to obtain some remarkable formulae, such as 


1 ee 5+ 5 5 +1 
l+1+1+ © 


2 2 
Hardy’s comment on these results was, “I had never seen anything in the least like 
them before. A single look is enough to show that they could only be written by a 
mathematician of the highest class.” A coverage of this aspect of Ramanujan’s work 
is contained in [1]. 
Ramanujan extended the theory of basic series in many directions. Apparently he 
was the first to sum the bilateral series 


Spey Fe nz" _ (B74) co 42 Joo 4/42 Joo 4) 29 
alas Bs 2) . (db), (G/8) oo (B/0Z ) (BD) o(Z) 0 


This result was somehow missed by Gauss and Jacobi. It contains the triple product 
identity and the g-binomial theorem as special cases. It arises sufficiently often in 
mathematics to be considered an important result. Ramanujan’s identity was 
extracted from his notebooks by Hardy (see [12]), who refers to it as “a remarkable 
formula with many parameters.” Several proofs of this formula have appeared and 
one is contained in the book under review. R. Askey’s [5] observation that (23) and 
some other formulae are in fact generalizations or analogs of Euler’s beta integral 


1 xeol _ I'(a)T(B) 
Game?" Tat py 


eer? (22) 


pad 


(23) 


throws new and fruitful light on them. One gets some conception of the origin of 
such formulae and what can be done with them. 

In the last year of his life, Ramanujan introduced a new kind of q-series into 
mathematics which he named “mock-theta functions.” He wrote about them in a 
letter to Hardy two months before his death. G. N. Watson developed the theory of 
these functions on the basis of that letter and apparently it even motivated the 
young A. Selberg to do some early work in this area. However, it was not until very 
recently that mock-theta functions began to play a greater role in mathematics, 
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inspired by the dramatic discovery of Andrews of a lost notebook of Ramanujan. 
Mock-theta functions may yet be recognized as Ramanujan’s greatest contribution 
to mathematics. 

If in (1) we set m =n + 1 and substitute g* for a, and q® for b, and let g > 1 
we get the ordinary hypergeometric series 


(a,), (42), ms (4y,) 42" 
a (b,) (2), aS (b,) <Q) x 


where (a), = a(a + 1):-- (a+ k — 1). These series have played an extraordinarily 
important role in mathematics and physics and were studied extensively by Euler, 
Gauss, Kummer, Riemann, Ramanujan, and numerous other mathematicians. Re- 
cently they played a significant part in L. DeBrange’s solution of the Bieberbach 
conjecture. It is also now generally becoming known that they can be used to bring 
some order into the problem of classifying binomial identities. For this, the reader 
may consult [4], [11], and [16]. 

N. J. Fine’s book provides a fascinating introdution to the subject of basic or 
g-hypergeometric series. We see that it is possible to obtain beautiful and deep 
mathematical results without the development of a heavy and elaborate machinery. 
In fact, this book can be studied very profitably by a talented undergraduate. From 
the introduction to the book we learn that in 1948 Fine himself embarked upon an 
investigation of basic series with only a modicum of casual information about the 
subject. He was led to several elegant new results on partitions and a rediscovery of 
many important old results, among them the evaluation of Ramanujan’s ,W, sum. 
Over the years the author has added to his results and polished his presentation and 
this book is a consequence of this labor of love. Topics covered in the book include 
partitions, representation of numbers as sums of squares, mock theta functions and 
modular equations, all of which are subjects of great research interest today. 
G. Andrews has added notes at the end of each chapter giving references to some of 
the most recent developments. Almost every mathematician should find something 
to attract or charm her/him in this book. It is highly recommended. 


(24) 
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There is an old adage they tell to cub reporters: Never give the circus a bad 
review. The same underlying principle provides a cautionary motive for this re- 
viewer. What reader of this journal, including this reviewer, could possibly do other 
than applaud Professor Paulos for giving us a book on such an important and timely 
subject? Nonetheless, having offered my applause I feel obligated to state that I do 
not think that this book lives up to the expectations that the title creates. I surely 
agree with Professor Paulos that there is a problem of innumeracy but I do not 
know the audience to whom this book is addressed, I do not think that the author 
has a clear understanding of the problem he is discussing, and I do not understand 
how he thinks that this book can help with the problem of innumeracy. 

Surely all of the readers of this journal will agree with Professor Paulos that there 
is a problem of innumeracy and that, roughly, innumeracy refers to numbers and 
arithmetical manipulations in the same kind of way that illiteracy refers to letters 
and reading. The problem of innumeracy, then, is that there seem to be many people 
who are unable to cope with numbers, in a manner quite strikingly analogous to the 
way in which there are quite a few people who are unable to read. 

If this be temporarily granted, subject to subsequent discussion, then we may 
begin by outlining the approach taken by Professor Paulos in his book. His first 
chapter, “Examples and Principles,” is devoted to examples of arithmetical calcula- 
tions involving, often, large numbers. He offers a one-paragraph explanation of 
scientific notation and then tells us that human hair grows at 107° miles per hour 
and that 5 x 10’! cigarettes are smoked each year in the United States. He 
calculates the total volume of human blood in the world and concludes that if walls 
were put around Central Park in New York City then all the blood in the world 
would cover the park to a depth of about twenty feet. He cites Archimedes and 
approximates his calculation of the total number of grains of sand required to fill up 
the earth and the heavens. Next he calculates some combinations, including the 
number of five-card hands dealt at poker. Then he estimates the probability of 
acquiring AIDS heterosexually. Finally, he calculates the probability that anyone 
who takes a deep breath inhales a molecule exhaled by Caeser in his dying breath. 

Chapter 2, “Probability and Coincidence,” gives a number of probabilistic 
calculations. We are told that “A tendency to drastically underestimate the fre- 
quency of coincidences is a prime characteristic of innumerates... .” After some 
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examples, Professor Paulos proceeds to the calculations. Anyone who teaches a 
course in basic statistics will be familiar with many of the examples: the birthday 
problem, the game of chuck-a-luck, the 1964 mugging in Los Angeles involving a 
blonde woman with a ponytail, and Pete Rose’s National League record of hitting in 
44 consecutive games. Chapter 3, “Pseudoscience,” briefly discusses biorhythms, 
parapsychology, dreams, astrology, numerology, and UFO’s. These subjects are, at 
best, only marginally related to innumeracy and one might well wonder why they are 
included. The author does, however, include a section on conditional probabilities in 
this chapter. 

Chapter 4, “Whence Innumeracy,” presents the author’s ideas concerning the 
basic question he raises: “Why is innumeracy so widespread even among otherwise 
educated people?” His answer, in summary form, is: “The reasons, to be a little 
simplistic, are poor education, psychological blocks, and romantic misconceptions 
about the nature of mathematics.” The author’s comments here are both interesting 
and insightful. “Romantic misconceptions” include the belief that a concern for 
numbers is antithetical to an appreciation of the grand questions, that mathematics 
is low-level and technical, and that it is limiting and opposed to human freedom. In 
this chapter the author also includes a digression on the use of a logarithmic safety 
index for evaluating risks. Chapter 5, “Statistics, Trade-offs, and Society,” is the 
final chapter. It is pretty much of a grab-bag: voting paradoxes, the prisoner’s 
dilemma, death days, Type I and Type II errors, polling, correlation, and other 
topics. The author claims that most of these examples share the general problem of 
the balancing of conflicting concerns, but for me this common denominator was not 
much in evidence. 

With this short summary in hand, let us return to the questions asked in the first 
paragraph of this review. To whom is this book addressed? It is fairly obvious that 
no innumerate person is going to read it. In fact, probably only quite highly 
numerate people are likely to find the book interesting. If the book is viewed as a 
compendium of often counterintuitive, attention-gaining numerical examples for 
teachers, then it can be given high praise since Professor Paulos has missed very few 
such examples. But the author’s telegraphic style and encapsulated discussions will 
be incomprehensible to any “innumerate” who happens to pick up the book. 

More generally, who is interested in the kinds of numerical calculations the 
author exemplifies in his Chapter 1? Archimedes calculated the number of grains of 
sand it would take to fill the universe because he wanted to show that such a 
calculation could be done. Granted the Greek system of numeration, it was no mean 
feat. But apart from the transient sense of triumph, did anyone really care what 
number he came up with? The same thing is even more obviously true with 
Professor Paulos’ own example: the depth of all of the human blood in the world if 
it were stored in a reservoir based in Central Park. Many numerates would have to 
agree with the innumerates: Who cares? Nonetheless, some of us definitely enjoy 
working such things out. But the key phrase is “some of us.” In fact, it is very few of 
us, and most mathematicians are not included. My observation is that the enjoy- 
ment of such calculations is not a common characteristic, even among numerate 
people. The reason for the enjoyment seems to be partly that of the mountaineer: It 
is there! But the other part may be more a matter of humor than of anything 
else—and Professor Paulos is the author of a well-known book on mathematics and 
humor. Thus, if someone asks whether a ballpoint pen costs more per mile than 
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does a typical car there is, for some of us, a certain entertainment value to the 
question and, hence, to the answer. Nonetheless, all of this suggests that in the 
present book Professor Paulos has undertaken the easy task of preaching to the 
already converted. The hard task is converting the unbelievers. 

More importantly, it is folly and false to assume that there is any relationship 
between these kinds of calculations and the problem of innumeracy. It is false for 
the reasons already noted, namely, that highly numerate people do not find such 
calculations of any interest. It is folly because it misconstrues the problem of 
innumeracy. The analogy with illiteracy is helpful here. In the area of reading, the 
social problem is generally recognized to be that of what is called functional 
illiteracy. This includes all of those persons who are unable to read signs and the 
most simple instructions. We can all agree wholeheartedly that such persons do 
represent a social problem because they are undergoing hazards that most of us 
avoid, and because they are unable to participate in most socially desirable activi- 
ties. However, we surely do not argue that anyone has some degree of illiteracy that 
is of social concern simply because that person does not ever read Shakespeare or 
Thomas Mann. Some of us may privately believe that life without reading Shake- 
speare is life misspent but we do not establish scales of illiteracy based on this 
belief. This, it seems to me, is what Professor Paulos is doing when he equates, at 
least by implication, innumeracy with the inability to perform or follow the 
relatively intricate calculations he presents. 

Another pervasive feature of the author’s approach can be illustrated by one of 
his probability examples, the birthday problem. There are many informed people 
who argue that one of the most important subjects that ordinary individuals, 
including innumerates, would find useful to know is elementary probability theory. 
Unfortunately, as any teacher of beginning statistics can testify, many highly 
intelligent students find the subject distressingly difficult. One reason for this is 
undoubtedly that the methods of solution of elementary probability problems often 
seem to the student to smack of ad hoc conjuring tricks. There are not at all the 
kind of stable procedures that the student finds, say, in algebra or differential 
calculus. This fact ought to go far towards defining what a book like this one ought 
to say about probability theory. It seems to me that Professor Paulos’ approach is 
quite otherwise. Consider, then, the birthday problem. It is, of course, quite 
counterintuitive that for a randomly selected group of 23 persons the probability is 
a little more than one-half that at least two will have the same day of the year for 
their birthday. But how does working the argument out benefit an innumerate? 
Does knowing the answer, or the argument, help the reader to the slightest degree in 
estimating or calculating the answer to any variation of the problem? Absolutely 
not. Let the reader of this review try these variations as examples. There are 12,000 
taxicabs in New York City. How many riders are required to make the probability 
one-half that the same taxicab has been used at least twice? There are about 2.6 
million five-card poker hands. How many hands must be dealt to make the 
probability one-half that at least one hand has occurred twice? Return to the 
birthday problem itself. How many persons must be randomly selected so that the 
probability is one-half that at least three have the same birthday? At least five? At 
least ten? How many persons so that the probability is .9 that at least two have the 
same birthday? (The answers are at the end of this review.) I think the reader will be 
convinced that neither the answer nor the method of the birthday problem is easily 
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generalizable. What, then, is the advantage of presenting the argument to an 
innumerate reader? 

My hypothesis is that Professor Paulos has been misled by his own definition of 
the problem of innumeracy. By defining numeracy in terms of the ability to 
duplicate the solutions of his examples he has put it beyond the reach of most 
innumerates. In fact, I suspect that almost all of the readers of his book will be 
persons who already share the author’s opinions and his enjoyment of calculation 
for its own sake. An innumerate person who chances to pick up this book will 
probably recoil in fear and trembling. It is most unlikely that such a person would 
be able to negotiate the arithmetical difficulties presented by the author. 

With a more nearly correct definition of the problem of innumeracy there is, I 
believe, some hope for its solution. We need to start with a definition of functional 
innumeracy. This would surely mean such things as inability to calculate an 8.25% 
sales tax on a purchase, inability to calculate the price that results from a 20% 
reduction in price, inability to fill out the income tax forms, inability to calculate the 
paint required for an apartment given the square feet covered per gallon, and so 
forth. These are the kinds of things that everyone needs to be able to do—and can 
learn to do. Whether these things can be self-taught from a book is not clear and is 
perhaps doubtful. But it is certainly clear that these are desirable and achievable 
goals. 

The problem of finding the minimal kind of knowledge of ‘probabilities that is 
needed to escape functional innumeracy is more vexing. Perhaps the most important 
kinds of knowledge have nothing to do with probabilistic calculations. There is a 
fair amount of evidence that living in accordance with rational prescriptions 
requires ability to estimate probabilities based on one’s own knowledge (“ probabil- 
ity assessing,” in the literature), an ability to behave rationally regarding risks, and 
an awareness of the general overestimation of small probabilities (the “over-under 
effect,” in the literature). There is a considerable body of experimental evidence that 
these are all learnable skills. Something clearly needs to be done in this area in any 
general assault on the problem of innumeracy. 

As already mentioned, it is not clear whether any or all of this might be 
accomplished through self-study from an appropriate book. But it is surely worthy 
of note that Professor Paulos’ section on a logarithmic safety index is a very good 
initial statement on how to think about risks. One can easily imagine that if such a 
book can be successful then perhaps Professor Paulos is precisely the person to 
write it. 


(Answers to questions: 128 rides; 1900 poker hands or nine hours of play for seven 
players at the rate of two minutes per hand; 87 persons for > 3; 310 for > 5; 1170 
for > 10; 41 persons for a probability of .9.) 
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General, S(13-16). Higher Mathematics in Prob- 
lems and Ezercises. P.E. Danko, A.G. Popov, T. 
Ya. Kozhevnikova. MIR (US Distr: Imported Pub), 
1988, $14.95 each. Part 1, 400 pp, (ISBN: 0-8285- 
3957-X]; Part 2, 480 pp. [ISBN: 0-8285-3958-8] The 
Russian equivalent of the Schaum Outline Series: a 
collection of specially selected exercises and solutions 
aimed at revealing the essential features of the main 
concepts and theorems of undergraduate mathemat- 
ics (especially engineering mathematics). Each sec- 
tion begins with a brief summary of relevant theo- 
retical underpinnings, then proceeds to sample prob- 
lems and solutions, and concludes with exercises for 
the student; numerical answers in the back. Part 
1 covers analytic geometry (with elements of vector 
algebra), linear algebra, calculus, elementary analy- 
sis, linear programming. Part 2 includes multivari- 
able calculus, differential equations, probability the- 
ory, complex variables, operational calculus, numeri- 
cal analysis, calculus of variations. The two volumes 
contain over 2800 examples and exercises; you can’t 
beat the price! Especially appropriate for the college 
bookstore. LCL 


Education, S, P. Mathematics Projects Handbook, 
Third Edition. Adrien L. Hess, Glenn D. Allinger, 
Lyle E. Andersen. NCTM, 1989, v + 52 pp, $5 (P). 
[ISBN: 0-87353-283-X] Designed as a guide for teach- 
ers and students in choosing mathematics projects, 
from classroom learning experiences to exhibits for 
entrance in competitions. This edition contains stu- 
dent guides and up-dated references, and places more 
emphasis on topics such as discrete mathematics, 
probability, and statistics (First Edition, TR, Jan- 
uary 1978; Second Edition, TR, October 1982). SB 


Education, P, L. Results from the Fourth Math- 
ematics Assessment of the National Assessment 
of Educational Progress. Ed: Mary Montgomery 
Lindquist. NCTM, 1989, viii + 173 pp, $15 (P). 
[ISBN: 0-87353-274-0] Examples of test items from 
the 1986 assessment for objectives in seven content 
areas: mathematical methods, discrete mathemat- 
ics, data analysis, measurement, geometry, relations 
and functions, and numbers and operations. Sum- 


marizes and interprets results on individual items 
for grades 3, 7, and 11. Analyzes performance dif- 
ferences among minority and gender groups. More 
detail than has appeared in NCTM journals. MW 


Education, S(16-17). How to Use Conjecturing 
and Microcomputers to Teach Geometry. Daniel 
Chazan, Richard Houde. NCTM, 1989, vii + 56 pp, 
$5 (P). [ISBN: 0-87353-279-1] Answers questions 
posed by teachers over five years of workshops on 
using Geometric Supposer software to teach geom- 
etry and engage students in behavior “like working 
members of the community of mathematicians.” De- 
tailed suggestions for planning, teaching, and testing. 
Excellent for teachers planning to use the Supposers; 
too specific for transfer to other situations. MW 


History, S, P. Charles Loewner, Collected Papers. 
Ed: Lipman Bers. Birkhauser Boston, 1988, xii 
+ 517 pp, $79. (ISBN: 0-8176-3377-4] The pub- 
lished papers of Charles Loewner. His contributions 
in the areas of complex analysis and differential ge- 
ometry are part of the classical literature in these 
subjects. LCL 


Logic, T(15-16: 1), L. Notes on Logic and Set The- 
ory. P.T. Johnstone. Cambridge U Pr, 1987, x + 111 
pp, $34.50; $12.95 (P). (ISBN: 0-521-33502-7; 0-521- 
33692-9] Text aims to present “everything that the 
well-educated mathematician in the late twentieth 
century needs to know about the foundations of his 
subject.” Covers first-order logic through complete- 
ness theorem, recursion theory, Zermelo-Fraenkel ax- 
ioms, ordinal and cardinal arithmetic, incomplete- 
ness theorems. Assumes knowledge of abstract alge- 
bra. KES 


Logic, P. Cartesian Closed Categories of Domains. 
A. Jung. CWI Tract, V. 66. Mathematisch Centrum, 
1989, 110 pp, Dfl. 17,80 (P). [ISBN: 90-6196-376-1] 
A study of objects which have no finite description 
and functions defined on them. The objective is to 
strike a balance between the idealistic and the realis- 
tic; the framework for satisfying these requirements 
is that of algebraic directed-complete partial orders 
(called domains). This tract presents the author’s 
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thesis results. LCL 


Logic, S(16-18), P. Proofs and Types. Jean-Yves 
Girard. Transl: Paul Taylor, Yves Lafont. Tracts 
in Theoret. Comput. Sci. Cambridge University Pr, 
1989, xi + 176 pp, $34.50. (ISBN: 0-521-37181-3] 
A short course on typed A-calculus (Prawitz’ natu- 
ral deduction, sequent calculus, Godel’s System T,, 
System F) given af the Université Paris VII in 1986- 
87. LCL 


Foundations, P. Toposes and Local Set Theo- 
ries, An Introduction. J.L. Bell. Logic Guides, 
V. 14. Clarendon Pr, 1988, xii + 267 pp, 
$75. (ISBN: 0-19-853274-1] Toposes (categories 
with features—products, subobjects, terminal ob- 
ject, power objects—which enable them to be con- 
sidered generalizations of the category of sets), and 
local set theories (where types replace sets as prim- 
itives) serve as alternatives to ordinary set theory 
as foundation for mathematics, yielding intuitionis- 
tic logic of types. Toposes are presented as models 
of theories, “domains of discourse” in which mathe- 
matical concepts may be defined, interpreted. RM 


Combinatorics, P. g-Series and Partitions. Ed: 
Dennis Stanton. IMA, V. 18. Springer-Verlag, 1989, 
xi + 212 pp, $24. [ISBN: 0-387-97086-X] Proceed- 
ings of a workshop in q-series and partitions held at 
the IMA, Minneapolis, March 1988. Includes papers 
on identities, unimodality of Gaussian polynomials, 
constant term problems, and orthogonal polynomi- 
als. LC 


Number Theory, P. Lecture Notes in Mathemat- 
tcs-1380: Number Theory. Eds: H.P. Schlickewei, E. 
Wirsing. Springer-Verlag, 1989, v + 266 pp, $24.30 
(P). [ISBN: 0-387-51397-3] Proceedings from the 
15th Journées Arithmétiques held at the University 
of Ulm in 1987. Contains only papers not appear- 
ing elsewhere ranging over many areas of current re- 
search. Included in the paper of G. Frey is progress 
on a promising approach to proving Fermat’s Last 
Theorem. GG 


Algebra, T(14-15). The Number Systems: Foun- 
dations of Algebra and Analysis, Second Edition. 
Solomon Feferman. Chelsea, 1989, xii + 418 pp, 
$23.95. [ISBN: 0-8284-0333-3] A course in the foun- 
dation of the number system. The author begins 
with elementary set theory, assumes the existence 
of the positive integers, and then develops its prop- 
erties and properties of the integers, rationals, the 
reals (via cuts), and the complex numbers. Also dis- 
cussed are integral domains, polynomials, and ex- 
tension fields. Worthy of consideration as a text for 
the “transitional” course for mathematics majors. A 
reprinting of the 1964 edition. SG 


Algebra, L. A Survey of Lie Groups and Lie Al- 
gebras with Applications and Computational Meth- 
ods. Johan G.F. Belinfante, Bernard Kolman. Clas- 
sics in Appl. Math., V. 2. SIAM, 1989, iii + 164 
pp, $24.50 (P). [ISBN: 0-89871-243-2] Corrections 
of minor misprints, but no major changes from the 
1972 original edition (TR, January 1973). LC 


Algebra, P, L. Lezikon der Algebra. Gunther 
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Eisenreich. Akademie-Verlag, 1989, 677 pp, 80 DM. 
[ISBN: 3-05-500231-8] A modern, carefully orga- 
nized and extensive German dictionary of algebra. 
Statements of important theorems; many references 
to literature. JD-B 


Algebra, T(15-16: 1). Algebra for Computer Sci- 
ence. Lars Garding, Torbjorn Tambour. Universi- 
text. Springer-Verlag, 1988, ix + 198 pp, $29.80 (P). 
[ISBN: 0-387-96780-X] Very concise presentation of 
number theory and abstract algebra and their appli- 
cations to computer science. Proofs of many theo- 
rems left as exercises. Assumes mathematical matu- 
rity and elementary linear algebra. KES 


Algebra, P. Galois Groups over Q. Eds: Y. Ihara, 
K. Ribet, J.-P. Serre. Math. Sci. Res. Inst., V. 16. 
Springer-Verlag, 1989, x + 449 pp, $45. (ISBN: 0- 
387-97031-2] Inspired by a week-long conference at 
the Berkeley Mathematical Science Research Insti- 
tute, March 23-27, 1987. Principal themes of the 
conference are extensions of Q with finite simple Ga- 
lois groups; Galois actions on fundamental groups, 
nilpotent extensions of Q arising from Fermat curves, 
and the interplay between Gauss sums and cyclo- 
tomic units; and representations of Gal (Q/Q) with 
values in GL(2); deformations and connections with 
modular forms. SB 

Algebra, T*(18: 1-4), S, P, L. Basic Algebra II, 
Second Edition. Nathan Jacobson. WH Freeman, 
1989, xvii + 686 pp, $47.95. [ISBN: 0-7167-1933-9] 
A number of changes occur in the segment of the 
book devoted to commutative algebra, plus a num- 
ber of other improvements and corrections, in addi- 
tion to some new exercises and an updated index. 
(1980 edition, TR, January 1981.) LCL 


Calculus, T(13: 1). Applied Calculus: A First 
Course, Second Edition. Shirley O. Hockett, Mar- 
tin Sternstein. Robert E Krieger, 1989, x + 654 
pp, $44.50. [ISBN: 0-89464-351-7] Intended for 
business, biology, and social science students. No 
trigonometric functions. Lots of applied examples 
and exercises, generally routine and artificial. Brief 
sections on separable first order differential equa- 
tions, multivariable functions, and Lagrange multi- 
pliers. “Things to Remember” sections and extra 
exercises added for this edition. (1979 Van-Nostrand 
Reinhold edition, TR, June-July 1979.) JO 


Calculus, T*(13-14: 1-4), S*, L. Introductory 
Mathematics Through Science Applications. John 
Berry, Allan Norcliffe, Stephen Humble. Cam- 
bridge University Pr, 1989, xii + 547 pp, $79.50; 
$29.95 (P). [ISBN: 0-521-28446-5; 0-521-24119-7] 
An applications-based introduction to the first two 
years of college mathematics. Designed to “intro- 
duce the mathematics from a physical point of view” 
without overwhelming the student with mathemat- 
ical rigor. Stressing the application of calculus to 
the sciences, each chapter begins with several exam- 
ples “setting the new techniques to be introduced in 
the context of the physical world,” and includes two 
more “worked examples” sections and an extensive 
exercise set (with answers in the Appendix). First 
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five chapters provide a nice introduction to functions 
and their uses, then three chapters each on differen- 
tiation and integration. Introduces numerical meth- 
ods in context rather than as separate topics. Also 
includes substantial sections on ordinary and par- 
tial differential equations, probability, and statistics. 
Could be a good source for problems or as a text 
(note paperback option). Give it a look. SB 


Real Analysis, P. Regular Variation. N.H. Bing- 
ham, C.M. Goldie, J.L. Teugels. Encyclop. of Math. 
& Its Applic., V. 27. Cambridge University Pr, 
1989, xix + 494 pp, $34.50 (P). (ISBN: 0-521-37943- 
1] Changes from the 1987 hardback edition (TR, 
May 1988) include recent progress in the field (e.g., 
R. Trautner’s covering principle) and an expanded 
reference section. MLR 

Complex Analysis, T(17: 1), S, L. Elliptic 
Functions and Applications. Derek F. Lawden. 
Appl. Math. Sci., V. 80. Springer-Verlag, 1989, xiv 
+ 334 pp, $59. [ISBN: 0-387-96965-9] An introduc- 
tory book intended by the author to serve as a text 
for (British) undergraduates and “practicing applied 
mathematicians and engineers.” Very readable any- 
time after a first course in complex analysis. Focuses 
on geometrical and physical, as opposed to number- 
theoretical aspects. Includes numerous exercises and 
several tables of computations. GG 


Partial Differential Equations, P. Heat Kernels 
and Spectral Theory. E.B. Davies. Tracts in Math., 
V. 92. Cambridge U Pr, 1989, ix + 197 pp, $49.50. 
[ISBN: 0-521-36136-2] A concise, yet comprehen- 
sive, study of a classical subject—second-order lin- 
ear elliptic differential equations—with special em- 
phasis on modern techniques arising from theories 
of quadratic forms and Sobolev spaces. Such tech- 
niques lead to sharp, computable bounds on the in- 
tegral kernels in question. Most results are obtained 
for regions in Euclidean space, but a final chapter 
covers operators on Riemannian manifolds. PZ 


Operator Theory, S(18), P. Variational Prin- 
ciples for Nonpotential Operators. V.M. Filippov. 
Transl. of Math. Mono., V. 77. AMS, 1989, xiii + 
239 pp, $99. [ISBN: 0-8218-4529-2] Written to fill 
a perceived need for a monograph on variational prin- 
ciples for equations with nonsymmetric operators. 
Chapters I and II develop the theoretical founda- 
tions; Chapter III treats variational principles for lin- 
ear boundary value problems; Chapter IV concerns 
nonlinear equations. Chapter commentaries; bibliog- 
raphy. JS 

Functional Analysis, P. Lecture Notes in Mathe- 
matics-1382: Metaplectic Groups and Segal Algebras. 
Hans Reiter. Springer-Verlag, 1989, xi + 128 pp, 
$13.50 (P). [ISBN: 0-387-51417-1] Explores a con- 
nection between Weil’s metaplectic group and a cer- 
tain Segal algebra. MLR 


Functional Analysis, P. Characterizations of Ba- 
nach Spaces Not Containing £'. D. van Dulst. CWI 
Tracts, V. 59. Mathematisch Centrum, 1989, 163 
pp, Dfl. 25.30 (P). [ISBN: 90-6196-366-4] Covers 
the problem of determining whether or not a Banach 
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space contains an isomorphic copy of £1. LC 


Analysis, P. Commutative Harmonic Analysis. Ed: 
David Colella. Contemp. Math., V. 91. AMS, 1989, 
xi + 305 pp, $36 (P). Proceedings of an international 
conference on harmonic analysis held in July 1987 at 
St. Lawrence University, sponsored by GTE. Volume 
includes twenty-three papers on a wide range of top- 
ics. PZ 


Analysis, T(18), P. Orthogonal Series. B.S. 
Kashin, A.A. Saakyan. Transl. of Math. Mono., V. 
75. AMS, 1989, xii + 451 pp, $146. [ISBN: 0-8218- 
4527-6] Presents the main idea and methods of the 
theory of general orthogonal series. Includes chap- 
ters on the Haar system, trigonometric and Walsh 
systems, the Faber-Schauder and Franklin systems, 
and results on convergence, divergence, and repre- 
sentations of orthogonal series. Aimed at the begin- 
ning mathematics student with some background in 
functional analysis and complex analysis. LC 
Algebraic Geometry, T(16-18: 1), S, P, L. In- 
troduction to Algebraic Curves. Phillip A. Griffiths. 
Transl. of Math. Mono., V. 76. AMS, 1989, x + 221 
pp, $93. (ISBN: 0-8218-4530-6] Translation from 
the Chinese text of notes based on a course taught by 
the author at Beijing University in 1982. Chapters 
include fundamental concepts (curves in the com- 
plex projective plane, Riemann surfaces, meromor- 
phic functions, differential forms, complex manifolds, 
algebraic varieties), the Normalization Theorem, the 
Riemann-Roch Theorem, and Abel’s theorem. Good 
first reference for non-specialists; nice use of graph- 
ics; numerous exercises within the text; annotated 
bibliography, and test problems in the Appendix. 
Assumes working knowledge of elementary complex 
function theory and algebra. SB 

Algebraic Geometry, P. Regular Differential 
Forms. Ernst Kunz, Rolf Waldi. Contemp. Math., 
V. 79. AMS, 1988, ix + 153 pp, $20 (P). [ISBN: 
0-8218-5085-7] Studies, in a general algebraic set- 
ting, local properties of sheaves of regular differen- 
tial forms on a quotient of Noetherian schemes. This 
work extends to the “relative” case a variety of re- 
sults already known in the “absolute” case, i.e., for 
varieties over fields. PZ 


Algebraic Geometry, S(18), P. Torsion Theoretic 
Algebraic Geometry. Frederick W. Call. Papers in 
Pure & Appl. Math., No. 82. Queen’s U, 1989, 118 
pp, (P). Brief lecture notes which could reasonably 
supplement an algebraic geometry course (categories, 
schemes, cohomology, etc.). GG 

Algebraic Geometry, P. Lecture Notes in Mathe- 
matics-1389: Algebraic Curves and Projective Ge- 
ometry. Eds: E. Ballico, C. Ciliberto. Springer- 
Verlag, 1989, 288 pp, $24.30 (P). [ISBN: 0-387-51509- 
7] Proceedings of a 1988 conference held in Trento, 
Italy containing over twenty papers on the topic. SG 


Differential Geometry, P. Lecture Notes in 
Mathematics-1377: Singularity Theory, Rod Theory, 
and Symmetry-Breaking Loads. John F. Pierce. 
Springer-Verlag, 1989, iv + 177 pp, $16.20 (P). 
[ISBN: 0-387-51304-3] Uses methods of modern 
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analysis and singularity theory to examine how orbits 
for rods subject to symmetric loads and in equilib- 
rium change when the loads are perturbed so that 
the symmetry is broken. For applied mathemati- 
cians and researchers in mechanics. Poses new prob- 
lems. DFA 


Geometry, T*(15-16: 1), L*. A Course in Mod- 
ern Geometries. Judith N. Cederberg. Undergrad. 
Texts in Math. Springer-Verlag, 1989, xii + 232 pp, 
$42. (ISBN: 0-387-96922-5] For junior-senior level 
survey course in geometry. Seeks to develop geomet- 
rical insights into abstract-appearing results. Brief 
history of Euclid’s Elements with major emphasis on 
geometries invented after Euclid. Synthetic approach 
followed by analytical treatment in later chapters. 
Axiomatics, finite geometries, transformations, and 
an introduction to projective geometry. Excellent 
reading and viewing lists. Exercises, many of which 
complement or extend the text. Prerequisites are 
some high school geometry and a bit of background 
in linear algebra. Nicely paced. JDEK 


Geometry, S*(16), P*, L*. The Ethics of Ge- 
ometry: A Genealogy of Modernity. David Rapport 
Lachterman. Routledge, Chapman & Hall, 1989, xiv 
+ 255 pp, $17.95 (P); $49.50. [ISBN: 0-415-90141- 
3; 0-415-90053-0] Scholarly treatment of the rela- 
tionship between philosophy and mathematics. “Es- 
sential differences in the ethos of mathematic3, i.e., 
the customary ways of undertaking and understand- 
ing mathematical procedures and their objects, pro- 
vide insight into the fundamental issues in the quar- 
rel of moderns with ancients.” Compares the roles 
of constructions in Euclid’s geometry and in that of 
Descartes. Extensive notes with references to both 
primary and secondary sources. Invites comparison 
with Fowler’s The Mathematics of Plato’s Academy 
and Knorr’s The Ancient Tradition of Geometric 
Problems. Heavy going. JDEK 


Geometry, S(18), L. Global Differential Geome- 
try. Ed: S.S. Chern. Stud. in Math., V. 27. 
MAA, 1989, ii + 354 pp, $36.50. [ISBN: 0-88385- 
129-6] Updated version of V. 4 of same series. Ten 
essays on fundamental topics, including four new 
chapters on vector bundles with connection, mini- 
mal surfaces, Riemannian comparison constructions, 
and manifolds of non-positive curvature. JO 


Geometry, P. Lecture Notes in Mathematics- 
1355: Computational Synthetic Geometry. Jurgen 
Bokowski, Bernd Sturmfels. Springer-Verlag, 1989, 
168 pp, $16.20 (P). [ISBN: 0-387-50478-8] Dis- 
cusses algorithmic aspects of certain realizability 
problems in discrete geometry. Problems covered in- 
clude those of constructing polytopes from simplicial 
complexes, vector geometries from incidence struc- 
tures, and hyperplane arrangements from oriented 
matroids. LC 


Algebraic Topology, P. Algebraic Topology. Eds: 
Mark Mahowald, Stewart Priddy. Contemp. Math., 
V. 96. AMS, 1989, xiii + 350 pp, $41 (P). [ISBN: 0- 
8218-5102-0] Proceedings of the International Con- 
ference on Algebraic Topology held at Northwest- 
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ern University, Evanston, Illinois from March 21-24, 
1988. Contains papers either presented or solicited 
at the conference. SB 


Algebraic Topology, P. Groups Acting on Graphs. 
Warren Dicks, M.J. Dunwoody. Stud. in Adv. Math., 
V. 17. Cambridge U Pr, 1989, xvi + 283 pp, 
$69.50. (ISBN: 0-521-23033-0] Research mono- 
graph on groups acting on low-dimensional topolog- 
ical spaces. Includes standard results on groups act- 
ing on trees, Boolean rings of graphs, characteriza- 
tion of Poincaré duality groups, and the equivalent 
loop and sphere theorems. Assumes familiarity of 
homological algebra and algebraic topology. LC 
Dynamical Systems, S(15-17), P*, L**. 
Chaos and Fractals: The Mathematics Behind the 
Computer Graphics. Eds: Robert L. Devaney, 
Linda Keen. Proc. of Symposia in Appl. Math., 
V. 39. AMS, 1989, ix + 148 pp, $31. [ISBN: 0- 
8218-0137-6] Expanded lectures from the August 
1988 AMS Short Course (at the Providence Centen- 
nial meeting): Smale’s horseshoe map, chaotic at- 
tractors, Julia and Mandelbrot sets, fractals, and ap- 
plications of iterated functions to data compression. 
Provides substantive introductions to many related 
topics revealing the mathematics behind the glitz. 
Many color plates enhance the appeal of this attrac- 
tive volume. LAS 


Dynamical Systems, S(17-18), P. Chaos in Dis- 
sipativen Systemen. Ronald W. Leven, Bernd-Peter 
Koch, Bernd Pompe. Friedr. Vieweg & Sohn, 1989, 
189 pp, DM 32 (P). [ISBN: 3-528-06356-4] 


Probability, T(17: 2), S. The Theory of Probabil- 
ity. B.V. Gnedenko. MIR (US Distr: Imported Pub), 
1988, 392 pp, $12.95. Translated from Russian. Cov- 
ers the fundamentals of probability theory including 
the concept of probability, independent trials, ran- 
dom variables and distribution functions, expecta- 
tion, limit theorems, laws of large numbers, charac- 
teristic functions, stochastic processes, and queueing 
theory. Good use of graphs; teaches through ex- 
amples; emphasizes limit theorems; does not cover 
combinatorics; notation complicated; organization 
of text hard to follow for undergraduates. Good 
theoretical resource text. (1967 Chelsea edition, 
TR, February 1968; 1976 MIR edition, TR, August- 
Spetember 1976; 1978 MIR edition, TR, November 
1981.) JJ 


Probability, S(17-18). Generalized Poisson Dis- 
tributions: Properties and Applications. P.C. Con- 
sul. Stat.: Textbooks & Mono., V. 99. Mar- 
cel Dekker, 1989, x + 302 pp, $89.75. [ISBN: 0- 
8247-7863-4] Complete treatment of the general- 
ized Poisson distribution and generalized Poisson 
process. Covers both applications and theoretical 
results. Develops confidence intervals and statistical 
tests for the two parameters, A and 6, using maxi- 
mum likelihood estimation. Includes computer pro- 
grams for calculating probabilities, confidence inter- 
vals, etc. Excellent reference text. JJ 


Elementary Statistics, T(13-14: 1). Principles 
of Statistics. Paul A. Herzberg. Robert E Krieger, 
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1989, xvi + 513 pp, $48.50, (ISBN: 0-89464-374-6]: 
Study Guide, Elke U. Weber, 343 pp, $16.50 (P). 
[ISBN: 0-89464-409-2] Introductory text emphasiz- 
ing concepts and requiring only algebra. Large por- 
tion devoted to hypothesis testing, including tests 
for categorical and ranked variables. Careful atten- 
tion given to requirements for procedures to be valid. 
Discusses elementary probability, descriptive statis- 
tics, several measures of association, regression, and 
ANOVA. Study Guide contains objectives, tips and 
reminders, and ten multiple-choice questions for each 
chapter of text. RWJ 


Elementary Statistics, S*, P, L**. Scien- 
tific Reasoning: The Bayesian Approach. Colin 
Howson, Peter Urbach. Open Court, 1989, xiii 
4+ 312 pp, $16.95 (P); $34.95. (ISBN: 0-8126- 
9085-0; 0-8126-9084-2] A comprehensive apologia 
for Bayesian methods as providing a firmer logi- 
cal foundation for scientific inference than the more 
fashionable classical theories of inference (hypoth- 
esis testing, randomized trials, confidence inter- 
vals). Explains and defends Bayesian methods— 
especially against the charge of subjectivity—and 
also criticizes classical statistics as inconsistent, il- 
logical, and contrary to the actual practice of sci- 
ence. Clearly written; carefully referenced; accessible 
to non-specialists. LAS 


Computational Statistics, T(13-14: 1). Guide 
to Mathematical Modelling. Dilwyn Edwards, Mike 
Hamson. Math. Guides. Macmillan Education (US 
Distr: Scholium Intern), 1989, x + 277 pp, $23.50 
(P). (ISBN: 0-333-45935-0] A first-year course in 
undergraduate mathematics, statistics, or comput- 
ing. Provides an approach to formulate and solve 
real problems in terms of mathematics and statis- 
tics. Modelling problems presented include those 
involving random features which demand statistical 
analysis, and situations which give rise to differen- 
tial equations. Communication skills addressed in 
one full chapter, and ample interesting example mod- 
els are presented in another. Many problems posed 
throughout. MSS 


Statistics, P. Lecture Notes in Statistics-57: Statis- 
tical Modelling. Eds: A. Decarli, et al. Springer- 
Verlag, 1989, ix + 343 pp, $37.80 (P). [ISBN: 0-387- 
97097-5] Proceedings of the July 1989 joint meeting 
of the GLIM 89 Conference and the Fourth Inter- 
national Workshop on Statistical Modelling held in 
Trento, Italy. Most of the thirty-seven papers deal 
with the use and/or development of GLIM software 
for general linear models. RWJ 


Statistics, S(17-18). Transformation and Weight- 
ing in Regression. Raymond J. Carroll, David Rup- 
pert. Mono. on Stat. & Appl. Prob. Chapman & 
Hall, 1988, x + 249 pp, $45. [ISBN: 0-412-01421-1] 
Examines regression methods for data in which the 
classical regression assumptions of constant variance 
and normally distributed data are violated. Methods 
discussed fit into two groups: weighting and weighted 
least squares, and data transformations. Discusses 
maximum likelihood estimators and transformations 
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on both sides of regression equation. Many examples 
with data; good use of graphs. JJ 


Programming, P. A Guidebook to Fortran on 
Supercomputers. John M. Levesque, Joel W. 
Williamson. Academic Pr, 1989, vii + 218 pp, 
$39.95. [ISBN: 0-12-444760-0] Guide to develop- 
ing and restructuring Fortran programs for super- 
computers (those that exploit parallelism) so that 
optimizing compilers can translate code to take full 
advantage of the underlying architecture. Discussion 
of architecture and vectorization techniques; includes 
many restructuring examples. RM 


Computer Systems, S(15-16), P, L. Supercom- 
puters and their Use, Revised Edition. Christopher 
Lazou. Clarendon Pr, 1988, xi + 263 pp, $55; $27.50 
(P). (ISBN: 0-19-853815-4; 0-19-853759-X] Relatively 
accessible introduction to and overview of super- 
computers: architecture and organization, under- 
lying technology, performance, illustrative applica- 
tions, likely future developments (some now dated), 
discussions of novel architectures. (Revision of 1986 
edition.) RM 

Theory of Computation, P. Lecture Notes in 
Computer Science-379: Mathematical Foundations of 
Computer Science 1989. Eds: A. Kreczmar, G. 
Mirkowska. Springer-Verlag, 1989, viii + 605 pp, 
$45.50 (P). [ISBN: 0-387-51486-4] Forty research 
papers originally presented at the Fourteenth Sym- 
posium in this series held in Poland in 1989. The 
papers address advanced and theoretical research in 
such subareas of computer science as automata the- 
ory, logic programming, theory of concurrent pro- 
cesses, and formal connectness proofs. GMS 


Theory of Computation, P. Theory of Deduc- 
tive Systems and Its Applications. S. Yu. Maslov. 
Transl: Michael Gelfond, Vladimir Lifschitz. Ser. in 
Foundat. of Comput. MIT Pr, 1987, x + 151 pp, $20. 
[ISBN: 0-262-13223-0] A deductive system consists 
of initial objects and rules for generating new objects. 
This monograph presents mathematics of deductive 
systems based on Post’s canonical calculi and de- 
scribes applications in economics, biology, computer 
science, and operations research. Translation of 1986 
Russian publication. KES 


Theory of Computation, P, L. Computational 
Complezity Theory. Ed: Juris Hartmanis. Proc. 
of Symp. in Appl. Math., V. 38. AMS, 1989, ix 
+ 128 pp, $33. [ISBN: 0-8218-0131-7] Introduc- 
tory survey lectures from the Short Course at the 
January 1988 AMS meeting in Atlanta. Six papers 
for the nonspecialist, including an overview of the 
subject by the editor. The others concern the iso- 
morphism conjecture and sparse sets; restricted rel- 
ativizations of complexity classes; descriptive and 
computational complexity (concerns parallel process- 
ing); complexity issues in cryptography; interactive 
proof systems. DFA 

Theory of Computation, P. Resolution of Equa- 
tions in Algebraic Structures, Volume 2: Rewriting 
Techniques. Eds: Hassan Ait-Kaci, Maurice Nivat. 
Academic Pr, 1989, xx + 379 pp, $64.95. [ISBN: 0- 


1990] 


12-046371-7] The second of two volumes containing 
papers originally presented at a colloquium held in 
Austin, Texas, May 1987. The twelve papers in this 
monograph are all directed at presenting advanced 
research in using the resolution method for solving 
symbolic equations. GMS 


Artificial Intelligence, P. Automated Theorem- 
Proving in Non-Classical Logics. Paul B. Thistle- 
waite, Michael A. McRobbie, Robert K. Meyer. Wi- 
ley, 1988, 154 pp, $19.95 (P). [ISBN: 0-273-08781- 
9] Discusses history of and motivations for non- 
classical theorem-proving, and describes develop- 
ment of program for deciding theoremhood in rel- 
evant logic. KES 


Artificial Intelligence, P. Blackboard Architec- 
tures and Applications. Eds: V. Jagannathan, Ra- 
jendra Dodhiawala, Lawrence S. Baum. Perspect. 
in Artif. Intellig., V. 3. Academic Pr, 1989, xxix 
+ 524 pp, $39.95. [ISBN: 0-12-379940-6] Twenty- 
one papers from the AAAI-87 and -88 Blackboard 
workshops presenting recent developments. Topics: 
organization and control, advanced architectures— 
concurrency and parallelism, real-time systems, de- 
velopment environments, applications. An introduc- 
tion traces the history of the blackboard concept. 
Over 250 references. DFA 


Applications, S, P. Urban Rank-Size Hierarchy: 
A Mathematical Interpretation. James W. Fonseca. 
Mono. No. 8. Institute of Mathematical Geography 
(2790 Briarcliff, Ann Arbor, MI 48105), 1989, 86 pp, 
$15.95 (P). We know that the Fibonacci numbers, 
F(n), appear “in nature” in certain growth patterns 
(e.g., conical shells, sunflowers, pine cones, galaxies). 
Here the author shows that in the United States, the 
urban rank-size hierarchy can be modeled quite accu- 
rately in this same way. One manifestation of this is 
that the ratio of the F(n+1)** ranked urban area of 
the F(n)'* is approximately ¢, the golden ratio. LCL 


Applications (Biological Science), $(16-17), P, 
L. Dynamic Modeling in Behavioral Ecology. Marc 
Mangel, Colin W. Clark. Mono. in Behavior & Ecol- 
ogy. Princeton U Pr, 1988, xii + 308 pp, $15.95 (P); 
$45. (ISBN: 0-691-08506-4; 0-691-08505-6] Self- 
contained introduction to construction and use of 
dynamic stochastic optimization models in behav- 
ioral ecology. Based on evolutionary principles; ap- 
proach allows consideration of many types of behav- 
ior; decisions to be considered simultaneously; in- 
corporation of various constraints. Applications to 
hunting behavior, insect reproduction, migration of 
aquatic organisms, clutch size in birds, movement of 
spiders. RM 


Applications (Economics), T(18: 2). Advanced 
Econometric Methods. Thomas B. Fomby, R. Carter 
Hill, Stanley R. Johnson. Springer-Verlag, 1984, xix 
+ 624 pp, $35 (P). [ISBN: 0-387-96868-7] Paper- 
back version of a comprehensive graduate-level text 
for econometrics. Covers most major topics in econo- 
metrics. Assumes readers who have had an econo- 
metric methods course, and who are familiar with 
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matrix notation in regression. Econometric theory 
covered, but only proofs of basic theorems are given. 
Some data for analysis provided in exercises. (1984 
hardcover edition, TR, February 1985.) JJ 
Applications (Engineering), S(17-18), P. Ez- 
pert Systems for Engineering Design. Ed: Michael 
D. Rychener. Academic Pr, 1988, xii + 306 pp, 
$39.95. [ISBN: 0-12-605110-0] Selection of papers 
on the application of expert systems techniques to 
engineering design problems. Discussion of existing 
systems, trends, with theme of synthesis, nature of 
expertise, integration of software, and graphics inter- 
faces. RM 


Applications (Fluid Dynamics), P. The Lin- 
earization Method in Hydrodynamical Stability The- 
ory. V.I. Yudovich. ‘Transl. of Math. Mono., 
V. 74. AMS, 1989, iii + 170 pp, $74. [ISBN: 0- 
8218-4528-4] Translation of the 1984 Russian edi- 
tion. Applies Lyapunov’s method of linearization to 
Navier-Stokes equations and the stability of steady 
and time-periodic fluid motion. Bibliography in- 
cludes many references to the Soviet literature. JO 
Applications (Physics), P. Noise in Nonlinear 
Dynamical Systems, Volume 1: Theory of Contin- 
uous Fokker-Planck Systems. Eds: Frank Moss, 
P.V.E. McClintock. Cambridge University Pr, 1989, 
xvi + 352 pp, $85. [ISBN: 0-521-35228-2] First of 
three volumes comprising a collection by different au- 
thors which may be read individually. This volume 
deals with the basic theory of stochastic nonlinear 
systems. It also includes a detailed history of the sub- 
ject and the first English translation of the seminal 
1933 paper of Pontriagin, Andronov, and Vitt. KS 
Applications (Quality Control), T(17-18: 1), 
S. Statistical Methods for Quality Improvement. 
Thomas P. Ryan. Appl. Prob. & Appl. Stat. Wiley, 
1989, xviii + 446 pp, $49.95. [ISBN: 0-471-84337-7] 
Graduate-level text and excellent resource on quality 
control. Provides extensive coverage of control charts 
including CUSUM charts, attribute control charts, 
and multivariate control charts. Basic introduction 
to probability and statistics covers distributions used 
in book. Other topics beyond control charts include 
graphical methods, linear regression, design of ex- 
periments (including factorials, fractional factorials, 
Taguchi methods), and analysis of means. Contains 
some examples and exercises, but could contain more 
examples presenting successful uses of these methods 
in actual companies. JJ 


Reviewers 


DFA: David F. Appleyard, Carleton; SB: Steve Benson, 
St. Olaf; LC: Laura Chihara, St. Olaf; SG: Steven Galovich, 
Carleton; GG: George Gilbert, St. Olaf; RWJ: Roger W. John- 
son, Carleton; JJ: Jason Jones, St. Olaf; JDEK: Joseph D.E. 
Konhauser, Macalester; LCL: Loren C. Larson, St. Olaf; JO: 
Jeff Ondich, St. Olaf; MLR: Margaret L. Reese, St. Olaf; KS: 
Karen Saxe, St. Olaf; GMS: G. Michael Schneider, Macalester; 
JS: John Schue, Macalester; KES: Kay E. Smith, St. Olaf; 
LAS: Lynn Arthur Steen, St. Olaf; MSS: Myriam S. Stein- 
back, Macalester; MW: Martha Wallace, St. Olaf; PZ: Paul 
Zorn, St. Olaf. 


MATHEMATICS: 
Queen and Servant 


of Science 


by E. T. Bell 


“Bell is a lively, stimulating writer, inoffensively crotchety 
and opinionated, with a good sense of historical circum- 
stance, a fine impatience with humbug, a sound grasp 
of the entire mathematical scene, and a gift for clear and 
orderly explanation.” 

James R. Newman, The World of Mathematics 


“Bell’s writing style makes you feel as if you are a for- 
tunate student spending time with a charming and gifted 
teacher. Not a history of mathematics, but a wide rang- 
ing exposition of ideas in the subject, the book is a classic 
and should be required reading for every teacher of 
mathematics.” 

Harold Jacobs, Grant High School, Van Nuys, California 


“This book deserves a place in today’s market. It is a 
much more popular work than most histories of the sub- 
ject, and that is exactly what makes it accessible to stu- 
dents as well as to non-mathematicians. It is rewarding 
reading for ... teachers and students of mathematics 
at all levels.” 

Morris Kline, Courant Institute 
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ATHEMATICS 
Queen &Servant 


ERIC TEMPLE BELL 


xv + 437 pp. Paperbound 
Republished 1987. 
ISBN-0-88385-446-3 

List: $18.50 

MAA Member: $14.00 


This is an absorbing account of the development 
in pure and applied mathematics from the geom- 
etry of Euclid more than two-thousand years ago 
to some of the more recent developments in math- 
ematical physics. Published 30 years ago, MATHE- 
MATICS: QUEEN AND SERVANT OF SCIENCE 
remains a valuable tool for teachers, students, and 
all those enthusiastic about mathematics. The 
book includes 20 chapters on selected areas in 
pure and applied mathematics, including discus- 
sions on topics such as geometry, algebra, logic, 
probability, infinity, transformations, rings, groups, 
topology, and continuity. Also covered are sub- 
jects such as: Mersenne primes, quantum mechan- 
ics, and three-valued logic. The book is not a 
history of mathematics, but rather an enticing 


sampling of selected topics in mathematics and 
science. 


The book contains fascinating information on the 
personalities and philosophies of the great math- 
ematicians, plus perceptive resumes of their spe- 
cific contributions to the moving force of this great 
science. 


Order from: 

The Mathematical Association of America 
1529 Eighteenth Street, N.W. 

Washington, D.C. 20036 


AUEMANN SORE ECE 
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NANON | AL 
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WIS ALGEBRA 
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ALCULUS 


DENNIS CARRIE 


Factors in Success 


College Algebra and Trigonometry 
About 712 pages + hardcover 


College Algebra 
About 448 pages - hardcover 


College Trigonometry 

About 320 pages « hardcover 

By Richard N. Aufmann, Vernon C. 
Barker, and Richard Nation 

All of Palomar College 


Study Guide -« Instructor’s Manual 

Student Disk -« Computer Tutor™ « Test 
Booklet - Solutions Manual - Computerized 
Test Generator « Just published 


The new Aufmann, Barker, Nation series 
helps students understand the interaction 
of theory and practice in mathematics. 
After each example, students work selected 
problems for a thorough grasp of key con- 
cepts. Applications from disciplines such 
as biology, business, economics, physics, 
archaeology, and psychology give meaning 
to problems and exercises. 


Precalculus: An Introduction 
Dennis Carrie, Golden West College 


About 720 pages « hardcover - Instructor’s 
Manual «- Solutions Manual - Computerized 
Test Generator « Printed Test Bank « Just 
published 


No other text prepares students for calculus 
as well as Carrie does. Elementary func- 
tions and graphing are covered early and 
reinforced throughout, developing essential 
skills for subsequent courses. Abundant 
word problems are based on real-life 
situations and range from routine to more 
conceptual manipulations. 


For adoption consideration, request examination packages 
from your Houghton Mifflin regional office. 


<<, Houghton Mifflin 


13400 Midway Rd., Dallas, TX 75244-5165 
1900 S. Batavia Ave., Geneva, IL 60134 
925 E. Meadow Dr., Palo Alto, CA 94303 
101 Campus Dr., Princeton, NJ 08540 


GRADUATE FELLOWSHIPS AT $12K PER ANNUM 
Mathematical Sciences, University of Arizona 


Up to 15 fellowships for outstanding new graduate students in the mathematical sciences will be available 
in 1990-91. Fellowship applicants should be seeking the Ph.D. and planning careers in teaching and/or fundamental 
research. Anticipated stipends are $12,000 for 12 months with both in-state and non-resident tuition waived. 


While a few fellowships may be offered to foreign students, most are restricted to citizens, permanent 
residents, or individuals who have established intent to become citizens or permanent residents of the United States. 
Applications from U.S. women and students belonging to U.S. minority groups are particularly invited. Currently 
one-third of the U.S. graduate students in pure and applied mathematics at Arizona are women. 


The University of Arizona has excellent programs in traditional pure and applied mathematics, and is a 
leading institution in interdisciplinary applied mathematics. This presents a wealth of opportunities for graduate 
study encompassing such areas as dynamical systems, number theory, computational science, computational group 


theory, nonlinear partial differential equations, mathematical physics, probability and problems in theoretical 
engineering and interdisciplinary applied physics. In addition, outstanding computational facilities for graduate study 
and research are available to the over 165 graduate students in the mathematical sciences at the University of 
Arizona. 


Fellowship applicants of superior quality will be among the students invited to the Fourth Annual 
Workshop for Advanced Undergraduates on Current Ideas in the Mathematical Sciences, March 3-6, 1990. Limited 
support is available for attendees. The workshop is designed to communicate topics in current active research in 
three areas: (i) Geometry in Analysis and Number Theory, (ii) Numerical Methods for Multi-Scale Problems, and 
(iti) Mathematical Applications in the Life Sciences. 


For information and application materials write: W. M. Greenlee or T. W. Secomb, Department of 
Mathematics/Program in Applied Mathematics, University of Arizona, Tucson, AZ 85721 (602) 621-2068. 


The University of Arizona is an EOE/AA Employer 


Arizona Center for the Mathematical Sciences 
announces the 
Fourth Annual Workshop for Advanced Undergraduates 


CURRENT IDEAS IN THE MATHEMATICAL SCIENCES 
March 3-6, 1990 


The main goal of these workshops is to communicate to potential graduate students the 
level of excitement and activity in mathematical sciences today. The program will focus on 
topics in three areas: 


1. Geometry in Analysis and Number Theory 
2. Numerical Methods for Multi-Scale Problems 
3. Mathematical Applications in the Life Sciences 


Partial travel and subsistence support is available. The deadline for application material is 
February 2, 1990. Direct all correspondence to: 


W. M. Greenlee or T. W. Secomb 
Department of Mathematics, Program in Applied Mathematics 
University of Arizona, Tucson, Arizona 85721 
(602) 621-4664 


EQUATIONS SYMBOLICALLY OR NUMERICALLY ¢ HIDDEN-LINE SURFACE PLOTTING ¢ 2-DIMENSIONAL DISPLAY OF ALGEBRAIC 


cab 9.0 


discover it! 


Are you wondering how you can use a computer algebra and graphics system in your 
mathematics classroom or laboratory? 

We have some ideas that may surprise you. 

cal 2. O, the computer algebra and graphics system for learning, offers you 


¢ Scripting: Teachers create scripts - interactive lessons for your students 
A lecture, a laboratory, or a textbook on disk! 
Illustrate your points with high-resolution color pictures 
Include, as needed, mathematical objects or programs to support student exploration 
¢ Interactive Learning: Students explore your scripts, experimenting with the ideas that you have 
introduced... ca¢ 2.0 is menu-driven, easy to learn 
¢ Slide Shows: For group presentations, lectures, seminars 
Design your own graphics and text slides and save them as shows on disk 


Don't take our word for it - see for yourself! 
Order now, or ask for 30 day no obligation preview 


cat 2.0 Bluejay Lispware 
is field tested in calculus laboratory classrooms e P.O. Box 1904 
is available for IBM AT or PS/2 and compatibles lu a | ay Gambier, Ohio 43022 P| 
requires 640K RAM and EGA or VGA graphics (614) 427-3894 man 


specify EGA or VGA ___ specify 3.5" or 5.25" disks Lispware Price: $225.00 


EXACT RATIONAL ARITHMETIC AND ALGEBRA ¢ DECIMAL ARITHMETIC ¢ SOLVE SYSTEMS OF 
1X31 N33¥9S-11N4 ¢ JOV4YN3S.LNI N3AING-NNAW ‘ASV3 «© SMOGNIM GANId3G-¥aSN ¢ SNOISSAYdX3 
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Take the problem out of problem solving 


Beginning Algebra Intermediate Algebra 

Robert G. Marcucci Norman Siever 

San Francisco State University Los Angeles Valley College 

Harold L. Schoen, The University of Iowa About 672 pages « hardcover « Instructor’s 
About 512 pages « hardcover + Student Manual « Student Solutions Manual 
Solutions Manual « Instructor’s Manual Answer Key + Computerized Test 

Answer Key * Computerized Test Gener- Generator * Printed Test Bank + Just 

ator ¢ Printed Test Bank + Just published published 

Marcucci/ Schoen introduces and re- Intermediate Algebra reflects Siever’s 
inforces mathematical concepts with a understanding of what skills students need 
variety of real-world applications that to know and how to teach them. Using a 
effectively develop problem-solving capa- friendly tone and carefully sequenced 
bilities. Case studies, boxed inserts, and organization, Siever reinforces concepts 
numerous examples support text exposition, with introductory outlines, boxed inserts, 
helping students solve the vast assortment case studies, and clear examples and sum- 
of word problems. maries. Siever creates a real-life context 


for learning by emphasizing problem- 
solving skills. 


For adoption consideration, request examination packages from your Houghton Mifflin regional office. 


<3, Houghton Mifflin 


13400 Midway Rd., Dallas, TX 75244-5165 1900S. Batavia Ave., Geneva, IL 60134 
925 E. Meadow Dr., Palo Alto, CA 94303 101 Campus Dr., Princeton, NJ 08540 


GEOMETRY & TOPOLOGY 


FIRST CONCEPTS OF TOPOLOGY: 
The Geometry of Mappings of Seg- 
ments, Curves, Circles, and Disks 


W.G. Chinn and N.E. Steenrod 


This clear and winning little book, for 
readers willing to come to genuine 
grips with the idea of a mathematical 
proof, presents topology . . . as math- 
ematicians see it. The parlor tricks are 
gracefully alluded to here and there, 
but they are distinctly for after hours. 
The center of interest is the stuff itself: 
powerful notions of set theory are... 
exploited to define open sets and their 
coverings, and from them [to prove] 
the key theorems. .. One cannot any 
longer doubt that a single stroke of a 
knife exists that divides any irregular 
ham sandwich so that ham and both 
bread slices can be shared with perfect 
fairness by two consumers. 


Philip Morrison in Scientific American 
Contains over one hundred and fifty prob- 
lems and solutions. 

160 pp., 1966, ISBN 0-88385-61 8-2 

List: $10.00 MAA Member: $8.00 

Catalog Number NML-18 


FROM PYTHAGORAS TO EINSTEIN 
K.O. Friedrichs 


Starting with area and dissection proofs of 
the Pythagorean theorem, Friedrichs pro- 
ceeds gently, step by step, to cover vectors, 
coordinates, elastic and inelastic impacts, 
and relativistic space-time—ending with a 
derivation of a contemporary formula rival- 
ing the Pythagorean theorem in fame, & = 


mc’. 


88 pp., 1965, ISBN 0-88385-616-6 
List: $9.50 MAA Member: $7.60 
Catalog Number NML-16 


GEOMETRY REVISITED 
H.S.M. Coxeter and S.L. Greitzer 


After seeing a proof that a segment has 
a unique midpoint, students may wonder 
whether there are any interesting theorems 
in geometry. If either they or their teachers 
get a hold of this book, they will learn that 
there are many beautiful and nontrivial theo- 
rems in geometry. Among those found here 
are the theorems of Ceva, Menelaus, Pap- 
pus, Desargues, Pascal, and Brianchon. A 
nice proof is given of Morley’s remarkable 
theorem on angle trisectors. The transfor- 
mational point of view is emphasized: re- 
flections, rotations, translations, similarities, 
inversions, and affine and projective trans- 
formations. Many fascinating properties of 
circles, triangles, quadrilaterals, and conics 
are developed. 


193 pp., 1967, ISBN 0-88385-619-0 
List: $11.75 MAA Member: $9.50 
Catalog Number NML-19 . 


The very lucid presentation takes the 
reader from the elementary problems 
of plane Euclidean geometry to the fun- 
damental concepts of non-Euclidean 
geometry . The book is rich in 
remarkable facts and thereby is very 
effective in promoting the significance 
and the value of geometry in mathe- 
matical teaching, a promotion which is 
very necessary. . . The always origi- 
nal developments use very simple tools 

. and soon proceed to higher con- 
figurations. 


K. Strubecker in Mathematical Reviews 


Order from: 


The Mathematical 
Association of America 

1529 Eighteenth Street, N.W. 
Washington, D.C. 20036 
(202) 387-5200 


ee 


Writing Mathematics Well, by Leonard Gillman 


64 pp., 1987, ISBN-0-88385-443-0 
Catalog Number - WMW List: $6.50 MAA Member: $5.00 


Good writing conveys more than the 
author originally had in mind, while poor 
writing conveys less. Well-written papers 
are more quickly accepted and put into 
print and more widely read and appreci- 
ated than poorly written ones—and for 
notes, monographs, and books the qual- 
ity of writing is more important than it is 
for papers. 

In Writing Mathematics Well, Leo- 
nard Gillman tells his readers how to 
develop a clear and effective style. All 
aspects of mathematical writing are cov- 
ered, from general organization and 
choice of title, to the presentation of 
results, to fine points on using words and 
symbols, to revision, and finally, to the 
mechanics of putting your manuscript into 
print. No book can by itself make you a 
better writer, but this one will alert you to 
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A Visit to the Newtonian N-body Problem via Elementary 
Complex Variables 


DONALD G. SAARI, Northwestern University, Evanston, Il 60208 


DON SAARI received his undergraduate degree from Michigan Technolog- 
ical University and his Ph.D. in mathematics from Purdue University. His 
Ph.D. dissertation, written under the guidance of Harry Pollard, concerned 
the qualitative behavior of dynamics in the general Newtonian N-body 
problem. After a postdoctoral position in the Yale Astronomy Department, 
Saari moved to Northwestern University where he currently is Professor of 
Mathematics. His main research interests revolve around applications of 
dynamical systems to celestial mechanics and physics and to issues coming 
from the social sciences. 


The study of how N celestial bodies move under gravitational forces is an old 
one. If one is willing to acknowledge the work of the ancient astrologers and 
shepherds—two groups that carefully plotted the positions of the stars and planets 
—then this subject area traces its origins to the earliest reaches of mankind. Indeed, 
had the title not been already preempted, one might suggest that the study of the 
N-body problem is “the world’s oldest profession.” If it isn’t the oldest, then, most 
surely, it is “the second oldest.” 

How do the heavenly bodies move? With the help of elementary complex 
variables, certain selective orbits will be described where, as it turns out, even 
“simple” motion can be surprisingly complicated. Also, some of the history of the 
Newtonian N-body problem will be related with an emphasis on the myth that only 
the two body problem has been solved. For practical purposes this assertion is 
correct, but at the turn of the century the Finnish mathematician K. Sundman 
“solved” the three body problem in an accepted sense of that time. While explaining 
why Sundman’s result isn’t widely known, it will be indicated how collisions, both 
the real type where two or more particles hit each other and the complex type where 
imaginary collisions occur at complex values of time, affect the behavior of the 
system. 

To start, consider the question that, at some time, probably bothered many of us. 
Namely, why did the early astronomers have so much trouble predicting the motion 
of the planets? Armed with a 20th-century education, we know that a reasonable 
approximation for the motion of a planet is uniform circular motion about the Sun. 
What is so difficult about describing motion as predictable as this? This question is 
closely related to the fable where Galileo silently disavowed his forced recanting of 
a Sun-centered solar system. Probably generations of irreverent school children 
silently wondered, “Who cares? What difference does it make if the Sun or if the 
Earth is the center?” It does matter. This simple change of variables introduces a 
significantly different perspective of the solar system, it explains the difficulties 
faced by the astronomers of antiquity, and it underscores the critical importance of 
the Copernican revolution. 

To understand why an Earth-centered prejudice creates problems, consider a 
simplified version of the Sun-Earth-Mars system. Mars is approximately 3/2 times 
as far from the Sun as the Earth (1.524 times) and it takes approximately 2 years 
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(687 Earth days) to complete one revolution of the Sun. To eliminate fractions, 
replace the traditional astronomical unit with half-astronomical units so that, in the 
new system, the Earth is distance 2 from the Sun. In this simplified model, the 
Earth’s position is z,(t) = 2e*7" where ¢ is measured in “Earth years,” while that 
of Mars is 2,,(t) = 3e7”. The position of Mars as observed from Earth is 


z(t) =2,,(t) — z(t) = 3e7" — 2e?7", (1.1) 
A convenient way to describe this orbit is with the translation 
z(t) — 2 = 3e7" — 2e?7" — 2 = e7™"(3 — 2e7 — 2e-™") = (3 — 4cos mt)e™" 


The graph of this equation—the polar coordinate representation of a limacon with 
a loop—depicts the orbit of Mars relative to Earth.! (See FIGURE 1.) 


Fic. 1. 


It is clear from FIGURE 1 why the pre-Copernican, Earth-centered prejudice 
made it so difficult to predict the motion of the planets and to develop a 
“Newtonian Theory.” While the orbit from a to b to c is not overly complex, it 
becomes quite complicated once Mars passes through point c. Mars continues in a 
counterclockwise direction until point d where it appears to stop and then change 
to a clockwise motion until point f. At f, Mars again reverses direction to return to 
a counterclockwise motion. Imagine what complications this motion presented to 
astronomers trying to predict positions. Next, imagine a theorist trying to determine 
the governing laws of motion. While Mars goes from a to b to c to d, a theorist 
might persuasively argue for a law of attraction. But, how does one explain the orbit 
from e to f to a? How does one justify the sudden law of repulsion? What is there 
about the Earth that drives Mars away? 

Actually, the motion of Mars in FIGURE 1 is simpler than that of the other 
planets. Similar elementary complex variable arguments prove that the orbits of 
Saturn, of Jupiter, and of the other planets that take much longer to circle the Sun, 
admit several loops similar to the one in FIGURE 2. This can be proved as above, but 


" The trigonometric version of this argument involves nothing more difficult than the double angle 
formulas, so it can be used to motivate several topics in calculus and precalculus classes. For instance, I 
find that this description of the orbit of Mars serves as a more persuasive illustration of the relevance of 
limacons and cardioids than many of the standard examples used in calculus. 
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now the limacon is relative to a rotating circle. (Instead of finding the orbit relative 
to a fixed translation, find it relative to a rotating one chosen so that the 
exponential terms collapse to a cosine term.) As an alternative way to demonstrate 
this, express the relative orbit, z(t) = ae*” — 2e77", a > 2, in the standard com- 
plex form of z(t) = r(t)e’®. The direction of the motion, whether it is clockwise 
or counterclockwise, is determined by the sign of 6’. This is the imaginary part of 
(In z(t))’ = 2’/z = r'/r + iO’. But 2'/z = wi(aa — 4e@7 9") (a — 2eP- 97), so 
after an elementary computation, it follows that the sign of 6’ changes if aa < 4. 
This inequality holds because Kepler’s third law asserts that a’a” = k where k is a 
constant; thus aa = (k/a)'”” is a decreasing function of a. 

Pictures of these looping orbits depicting the motion of the distant planets 
relative to the earth can be found in several books on the history of astronomy. 
Even more impressive are the photos of the planets taken over a several year period 
that are superimposed on one plate. When one compares this kind of dynamical 
behavior with the uniform motion obtained through a Sun centered system, it 
becomes clear why the Copernican change of variables has had such a profound, 
simplifying impact on science and astronomy. 

The Copernican representation significantly simplifies the description of the 
motion of the planets, but is it a correct theory? This is the essence of Cardinal 
Barberini’s query when he contested Galileo (at least in Brecht’s play Life of 
Galileo), “Are you sure... you astronomers aren’t just trying to make astronomy a 
little easier for yourselves? ... You like to think in circles or ellipses and in uniform 
velocities, in simple motions commensurate with your minds. But what if God had been 
pleased to make His stars move like this?’ where Barberini moves his finger through 
the air in a complicated course that presumably resembles FIGURE 2. Responding 
with his version of Occam’s razor, Galileo argued for Copernicus’s theory by 
asserting, “if God had created the world like this [He retraces Barberini’s course.] He 
would have constructed our minds like this too [He repeats the same course.] to enable 
them to recognize these courses as the simplest. I believe in reason.” 

Elementary complex variables disclose these somewhat surprising, relative orbits 
for the outer planets by capturing the effects of the rotating coordinate system 
defined by the motion of the Earth. There are other natural rotating systems in 
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astronomy, so one might correctly suspect that a related approach would exhibit 
other surprising astronomical behavior. To demonstrate this, consider the planet 
Mercury. It takes Mercury about 90 Earth days (87.967) to circle the Sun. Based on 
observations, it was believed until the 1960s that Mercury took about the same 
length of time to rotate on its axis. If this were true, then, as asserted in most of the 
older texts on astronomy, the same face of Mercury always would face the Sun—a 
solar day (or solar night) on Mercury would last forever. However, with radar 
observations, we now know that Mercury takes about 60 Earth days to rotate on its 
axis; this shortens a Mercury solar day to about 176 Earth days, or two Mercury 
years. Moreover, as shown next, when a more accurate representation for the orbit 
of Mercury is used, an observer on Mercury would find the apparent orbit of the 
Sun to be quite peculiar. 

To obtain a sharper approximation for the orbit of Mercury, or any other planet 
about the Sun, treat the orbit as an ellipse with eccentricity «; e.g., for the Earth, 
e = 0.0167, while for Mercury, e = 0.2056. The position of the planet on the ellipse 
is given by 


r(@) =a/(1 — ecos(@)) = a(1 + ecos(@)), (1.2) 


where a is a positive constant and 6@(t) is the angle of the planet determined 
by a reference line. Equation 1.2 has the complex representation 72(@) = 
a(1 + ecos(@))e!® = a(1 + (e/2)e" + (e/2)e)e”, or 


z(0) = ae/2 + ae” + (ae/2)e7?. (1.3) 


The rotation of Mercury is given by e!*7'. It follows that the apparent position of 
the Sun on Mercury is 


Z(t) = —2(O(t))e7 27", (1.4) 
Arg(Z(t)) = Arg(z(6(t)) — 12at + 7 = O(t) — 1274 2. (1.5) 


The apparent motion of the Sun changes directions whenever Arg( Z(t))’ changes 
sign. This happens, and the reason is based on Kepler’s second law, which asserts 
that 770’ is constant valued. Thus, if ¢ is sufficiently large, as it is for Mercury, (1) 
cannot be represented by uniform motion. In particular, when r is at perihelion (i.e., 
r(t) = |z(t)| is at a minimum), 6’ is at its maximum value. Indeed, using Kepler’s 
second law and eq. 1.2, we have that 6’ = n/(1 — ecos(@))” where n is the mean 
motion; e.g., for Mercury, n = 87. Consequently, at perihelion, 6’ = 82 /(0.7944)? 
> 127. This same argument shows that Arg(Z(t))’ > 0 if |@| < 26.8°; otherwise it 
is negative. From this, a schematic representation of the apparent motion of the 
Sun, given in FIGURE 3, is easy to determine. Each of the two loops corresponds to 
Mercury’s “yearly” passage through perihelion. (A similar argument shows that the 
apparent motion of the Sun would have no reversal of direction had the orbit of 
Mercury been circular enough so that e < 0.1835.) 

The change in the sign of Arg( Z(t)) creates an interesting phenomenon. There 
are locations on Mercury where after the Sun rises in the east on a Mercury 
morning, it reaches only a certain point in the sky before it stops to retrace its 
motion and to set in the east. Then, the Sun rises a second time this “morning,” but 
this time it progresses normally throughout the long day. That’s not all; after it sets 
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in the west, this changeable Sun reverses behavior once more to rise again in the 
west for a brief time before it finally sets for the long Mercury night. 


2. Epicycles. An important planetary theory was developed by Ptolemy. One can 
appreciate the genius of Ptolemy by examining the orbits depicted in FIGURES 1 and 
2. Recall the problem facing him: a scientific theory of planetary motion needed to 
satisfy the governing prejudices of the time. For Ptolemy, this meant a theory 
needed to accommodate the Earth as the center of the solar system, and we’ve just 
seen the complexities associated with this assumption. The next obstacle, left over 
from Aristotle, was that the circle is the most perfect figure in geometry. Obviously, 
heavenly bodies are “incorruptible,” so their motions should be described by 
uniform circular motion. But how does one build a predictive, planetary theory 
incorporating these assumptions? 

In his Almagest, written around A.D. 130, Ptolemy invented the ingenious 
approach of epicycles. This is where the position of the particles satisfies Aristotle’s 
constraint of uniform circular motion. The clever idea is that the point indicated by 
the motion of the first circle, the deferent, does not represent the location of the 
planet. It locates the center of a second circle spinning with uniform motion. The 
location of the planet is given by the moving point on the second circle—the 
epicycle. (See FIGURE 4.) Today this approach may seem to be hopelessly naive, but 
remember that variations of this theory dominated astronomy for more than a 
millennium—an incredibly long time for any scientific theory. Even Newton’s 


FIG. 4. 


110 DONALD G. SAARI [February 


equations didn’t enjoy such a long reign before being challenged by Einstein’s 
relativity. 

The long success of Ptolemy’s approach can be understood by using complex 
variables. Let a, be the radius of the jth circle where the period of the uniform 
motion is 1/2b(/), 7 = 1,2. This means that the motion of a planet, as described 
through epicycles, is given by 


z(t) = ae? + gie2@7it, (2.1) 


From this equation, the source of the success of the epicycles becomes obvious. 
Namely, with the appropriate choices of a; and b(/), j = 1,2, eq. 2.1 is the same as 
eq. 1.1. In other words, the epicycle approach can be viewed as recapturing the 
change of variables used to convert the Sun centered motion of a planet to an Earth 
centered one. With the appropriate choice of constants, this epicycle expression is 
the correct one for any planet. 

As time marched on, the accuracy of Ptolemy’s theory didn’t always satisfy the 
increasing demands of astrology and astronomy. In part, this is because the actual 
orbits are on perturbed ellipses rather than on circles about the Sun. (Some of 
Ptolemy’s orbits already incorporated elliptic behavior.) To achieve more accurate 
results, modifications of the same idea could be used. The approach is simple: 
instead of treating the location of the moving point on the second circle as the 
location of the planet, treat it as the center of a third (or fourth, or fifth, or ...) 
circle moving in uniform circular motion. The complex variable representation of 
these approximations is 


2(t) = Dae ae (2.2) 
F 


where j = 1,..., k, and k is the number of circles being used. As already illustrated 
by eq. 1.3, such an approach can achieve a higher degree of accuracy. Also, imagine 
how this never ending problem of determining the next values of a, and b(k) could 
serve as the source of an infinite number of Ph.D. dissertations. 

Epicycles were abandoned long ago. We now know, after extensive development 
of sophisticated mathematical theories applied to Newton’s equations, that there are 
situations where the orbits of planets are either quasi-periodic or almost periodic. 
What is quasi-periodic motion? It is motion represented by eq. 2.2, the epicycles, 
while almost periodic motion is the limit of this summation as k — oo. 


3. Collisions and spinors. Today, Newton’s equations are used to study the 
N-body problem. One approach to solving these equations might be to find the 
series solution. As we know from complex variables, the radius of convergence for 
this series is determined by the distance to the nearest singularity. Such a singularity 
appears to be a collision. (For N > 4, the situation is more complicated; see [13].) 
Thus, as a first step toward finding series solutions of Newton’s equations, we need 
to understand the properties of collisions. For instance, one might ask whether it is 
possible to mathematically continue a solution through a collision? If so, this could 
extend the radius of convergence of a series. 

To develop the behavior of collisions, start with the one body, or central force 
problem. If r(t) represents the vector position of the particle, then, with the 
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appropriate units of mass, time, and distance, the equations of motion are 


r(t)” = -r/r?, (3.1) 
where r = |r|. The solutions, given by eq. 1.2, are ellipses, parabolas, or hyperbolas. 
But, what happens near or at a collision? Can we define the motion through r = 0? 
This question was posed and answered by Sundman [16] in 1913. An improved, 
simplified theory was developed by the Italian geometer Levi-Civita [4] in 1920. 

To see the ideas, start with a family of orbits that have a collision in the limit, 1.e., 
a family represented by eq. 1.2 where e tends to unity. Such a family is depicted in 
FIGURE 5. The key point is that when a particle has a “close approach,” it rapidly 
spins around the central body. Therefore, if the dynamics can be mathematically 
extended beyond a collision, the colliding particle must make a 27 angular change 
—we should expect the particle to hit and then rebound from the central force. To 
remove this collision singularity from the equations of motion, this abrupt change 
needs to be removed by straightening out the orbit. 


=> 


Fic. 5. 


One way to see how to do this is to restrict the motion to the plane. Reexpress 
the position vector r = (x, y) as a complex variable z = x + iy, so the equations of 
motion are z” = —z/r°, where r =|z|. The collision occurs at r = 0, so this is 
where a change of variables is needed to convert the abrupt, 27 polar angle change 
into a form where the motion is on a straight line. To do this, the change of 
dependent variables must take half of the angular change at this point. With the 
complex variable representation of the coordinates, the appropriate change of 
variables is obvious; use the square root. The coordinate change is w = wu, + iu, = 
zi/? or 


w? =z, (3.2) 


The equations of motion for w, when accompanied with the change of indepen- 
dent variables ds = dt/r(t) introduced by Sundman, not only are well defined at 
w = 0, a collision, but they assume the particularly simple form 


uw’ + au=0 (3.3) 


where a is some positive constant and u = (uj, u,) is the vector representation for 
w. In other words, this transformation converts the nonlinear Newtonian equations into 
the linear equations for a harmonic oscillator. 

To see how eq. 3.3 arises, note that the change of the independent variable 
defines the operator d*/dt? = [rd*/ds? — r'd/ds|/r°, where (’) is differentiation 
with respect to s. An important contribution to the derivation of eq. 3.3 is the r° 
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term in the denominator of the operator; it cancels the r°* term in the Newtonian 
force. Thus, the equations of motion and the energy integral (v7 = 2(r~! + h)) are, 
respectively, 


rz" —r’z' +z=0, jz’\? = 2(r + hr’) (3.4) 


where h is the constant of energy. When the new dependent variable is included, a 
cancellation of terms occurs if r, r’ are replaced with their representation in terms 
of w and its conjugate. The new equations of motion and energy integral are 

2w” — — (2|w'? —-1)=0, 2|wP?=1+hr. (3.5) 
Making the obvious substitution with the energy integral, the equations of motion 
become w” — (h/2)w = 0. Thus, both this substitution and the coefficient a = 
—h/2 in eq. 3.3 reflect the fact that the transformed equations are on a fixed energy 
surface. Because the change of the dependent variables is not 1-1, two u values 
correspond to a single r value. The exception is u = 0 which corresponds to r = 0. 

In addition to being able to analyze the behavior of orbits near a binary collision, 
another advantage of eq. 3.3 is that they are linear equations with purely complex 
eigenvalues. From this it follows that all solutions are stable; a small perturbation 
has only a small effect on the solution. As such, numerical solutions of these 
equations retain the properties of the actual solution. This isn’t true for eq. 3.1. 
Here, a small numerical error can force the numerical solution onto a different 
energy surface, which, in turn, alters the frequency of the motion. Then, as is true 
for two pendulums with close, but different frequencies, the true and the computed 
solutions eventually will be at opposite ends of the orbits. 

Several important dynamical systems, such as the Earth satellite problem, are 
perturbed forms of eq. 3.1. (The perturbations for the equations of the Earth 
satellite problem reflect the Earth’s “middle age bulge’”’ around the equator.) Many 
of these systems cannot be solved analytically; instead they are numerically inte- 
grated. Consequently, it is natural to question whether perturbed forms of eq. 3.1 
also can be converted into a form that inherits some of the stability properties of the 
harmonic oscillator. However, “real’’ problems, such as the Earth satellite problem, 
are in a three-dimensional space; they can’t be restricted to the “Flatland” setting of 
a fixed two-dimensional plane. Therefore, to carry out the program of converting 
the satellite problem into a perturbed form of the three-dimensional harmonic 
oscillator, one first must be able to convert the three-dimensional eq. 3.1 into a 
three-dimensional harmonic oscillator. It appears from comments in the literature 
(e.g., see [15, p. 23]) that Levi-Civita unsuccessfully tried to find such a three-dimen- 
sional extension of his two-dimensional solution. 

As the world moved into the Space Age, this issue of finding a harmon- 
ic oscillator form for the three-dimensional, two-body problem added practical 
importance to its theoretical interest. This question was raised by E. Stiefel at 
the 1964 Oberwolfach conference he organized. Attending this conference was 
P. Kustaanheimo, a Finnish astronomer, who had been using spinors to analyze 
properties of his theory of relativity and other problems from physics. Spinors are a 
natural generalization of complex variables, and, sure enough, by mimicking Levi- 
Civita’s approach with spinors, Kustaanheimo solved the problem during this 
conference. More specifically, let w = u, + iu, + ju, + kuy, Zz = x, + ix, + jx, + 
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kx,, and u = (u,, U5, U3, U4). With the change of dependent variables, we = Z, 
along with Sundman’s change of the independent variable, Kustaanheimo [2] 
converted the system into the equations 


uw’ + au=0. (3.6) 


Of course, the true problem is in R°, not in R*. Fortunately, as indicated later, 
this change of variables maps each solution of eq. 3.6 to a fixed plane x, = c. Thus, 
x, is a dummy variable that can be set equal to zero, and solutions of eq. 3.6 are 
mapped to solutions of eq. 3.1 in the plane x, = 0. Moreover, because a point z in 
the plane x, = 0 determines the magnitude of the corresponding u, it follows from 
the dimensional difference that each point in the plane x, = 0 corresponds to a 
circle in the u variables rather than the two points in the Levi-Civita transformation. 
The only exception is when r = u = 0, where this circle degenerates into a point. 

Now that the spinor transformation solved this decades old problem, Kustaan- 
heimo and Steifel [3] joined forces to use this approach to analyze perturbed 
problems of equation 3.1. Among other conclusions, they showed that this transfor- 
mation does provide numerical advantage. (Extensions are in [15].) This transforma- 
tion, which is quite widely used today (more so in Europe than in the U‘S.), is 
known as the KS transformation. 

What was the source of the difficulties in extending Levi-Civita’s approach? Why 
can’t we find a similar relation for R°? One way to see this is to express 


Levi-Civita’s change of dependent variables, x = u? — us, y = 2u,u, in the differ- 
ential form 
dx\ (Uy —Uy\( du, 
ba = 1 uy Wan. (3.7) 


The first column in this orthogonal matrix can be viewed as locating a point in 
space, say, on the unit circle u? + u3 =1, while the second column defines a 
tangent vector. Thus, the Levi-Civita transformation can be identified with a vector 
field of unit length on S'—the unit circle in R?. Presumably, the appropriate R° 
change of dependent variables leading to the harmonic oscillator would be given by 
a 3 X 3 orthogonal matrix where the first column locates a point on S’, the unit 
ball in R°, while the remaining two columns correspond to tangent vectors to the 
sphere. But the existence of any such matrix is frustrated by the “hairy billiard ball” 
effect—one can’t comb a hairy billiard ball without getting a “cowlick’”’ where at 
least one hair stands upright. In other terms, as we know from Poincaré and 
Brouwer, there doesn’t exist a smooth, nonzero, tangent vector field on S?. In our 
setting, this means there doesn’t exist a 3 X 3 orthogonal matrix of the appropriate 
form, or, in turn, these topological reasons proscribe the existence of the desired 
change of variables. 

On the other hand, going back to his Ph.D. dissertation and his earlier work in 
algebraic topology, E. Stiefel [15, see Chp 11] was aware of the fact that while it is 
impossible to comb a hairy S*, one can comb a hairy S’, the unit ball in R4, in 
several ways. This unit ball admits a smooth frame where if (u,, u,, uz, u4) deter- 
mines a point on S°, then the remaining three mutually orthogonal, unit tangent 
vectors are (— U5, U,, U4, — U3), (—U3, — Ug, Uy, Uy), and (uy, — Uz, Uy, — u,). Once 
these four vectors are expressed as columns of a 4 X 4 matrix, much as in eq. 3.7, 
this defines the KS transformation of the dependent variables. Note that the last 
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row of this matrix is orthogonal to u; this ensures that x, = 0. Indeed, the resulting 
mapping is one of the Hopf maps from S? to S”. 

Subsequent to the KS transformation, other representations of the two-body 
motion have been found to eliminate the difficulties of collisions. A geometric 
approach is given by J. Moser [7], where he showed that two-body motion in a 
d-dimensional physical space can be related to geodesic flow on S“. Thus in the 
specific case d = 3, the problem in R? is related, in a different manner, to motion 
on S°. In Moser’s representation, the positions on the sphere correspond to the 
velocity of the particle through stereographic projections. Hence, the north pole of 
S¢ represents the collision where the velocity becomes infinite. J. Milnor [6] has a 
nice exposition of this and related ideas. A second important development is due to 
R. McGehee [8]. Remember, Sundman and Levi-Civita found transformed equa- 
tions that included the point r = 0 as a regular point. These approaches don’t 
extend for triple collisions. However, McGehee found a way to “paste on” the 
behavior of orbits at triple collisions when r = 0. He did this by expressing the 
equations of motion in a spherical coordinate framework, and by introducing 
an appropriate change of independent variable. The next step was to exploit 
the singularity in spherical coordinates that occurs when the radius is zero but all of 
the angles are admitted. In his representation, the “angles” correspond to the 
configurations formed by the particles while the radius of the sphere measured the 
distance between colliding particles. In this manner, the motion in the collinear 
three-body problem can be extended to a “collision manifold” corresponding to 
fictitious motion where the radius of the system is zero; i.e., this is the limiting 
motion when the distances between the particles approaches zero. R. Devaney [1] 
has a nice exposition of these ideas. 


4. Sundman theory. At the beginning of this century, K. Sundman made several 
important contributions to our understanding of the N-body problem—contribu- 
tions that have stood the test of time. Ironically, one of his major conclusions killed 
interest in a line of inquiry, so this particular result is not very well known. It should 
be; it is where Sundman “solved” the three-body problem according to accepted 
standards of the late 1800s and early 1900s. Indeed, in the late 1800s the King of 
Sweden and Norway established a prize for anyone who could find the solution of 
the N-body problem. The prize was awarded to Poincaré in 1889 even though he 
hadn’t solved the original problem. (On the other hand, Poincare’s prizewinning 
paper contains a wealth of ideas that remain influential.) The originally stated 
problem finally was solved in 1913 by Sundman [16] when he found a converging 
series solution for the three-body problem. Unfortunately, his series converges so 
slowly that, essentially, it is useless for any practical purpose. Consequently, this 
nice result is not as well known as it should be. Still, Sundman’s work remains a 
beautiful and important combination of several deep ideas that have a continuing 
influence on celestial mechanics. Sundman’s results can be described via complex 
variables, so these variables will be used to outline what Sundman did and why his 
series converges so slowly. Part of this description of Sundman’s contributions is 
based on Cauchy’s theorem showing that the radius of convergence of a power series 
is determined by the location of the nearest singularity. 

A serious complication hindering achievement of a convergent power series 
solution for the three-body problem are the singularities caused by binary collisions. 
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We now know that collisions of any kind are improbable [11]. But collisions exist, 
and when they do, they restrict the accompanying radius of convergence for any 
power series solution. One way to avoid these complications might be to restrict 
attention to those conditions with collision-free orbits. The fault with such an 
approach is that we don’t know how to characterize these conditions; in general, we 
don’t know whether an initial condition will, or will not lead to a collision-free 
trajectory. 

As already indicated, Sundman cleverly avoided the problem of binary collisions 
by converting the equations to a new system where a binary collision is not a 
singularity—it becomes a regular point. Because of the central role played by this 
result, another complex variable argument will be outlined to suggest why we should 
expect this conclusion and why Sundman’s change of independent variable is a 
“natural” one. 

To start, consider colliding particles in the collinear, central force problem. The 
motion is on the positive half of the x-axis so the defining equations of motion are 
x’ = —x7~?, Because the right-hand side of this equation always is negative, the 
solution, x(t), is concave down. This forces the system to have a collision either 
forwards or backwards in time, and if x’(t) < 0, then x’ remains negative in the 
future. To determine how the collision occurs, multiply both sides of the equations 
of motion by x’ and integrate to obtain the energy integral 


(x’)? = 2x71 4 2h, (4.1) 


where h is a constant of integration. 

If there is a collision at time ¢), then x — 0 as t — fp. This changes the energy 
integral to x(x’)? = 2 + 2hx ~ 2. In turn, this means that x!/*x’ ~ —2!/?, or that 
x3/2(t) ~ A(t) — t). Consequently, 


x(t)~A(tp-t)”? = ast> ty (4.2) 


where A is some positive constant. 

It turns out [10, 12] for the general N-body problem that if there is a collision of 
any kind at ¢ =f), then the colliding particles must approach each other like 
(t. — t)?/?; i.e., the rate of approach for collinear collisions extends to all possible 
kinds of collisions. This is important information about collisions, but it doesn’t 
explain the complex nature of this singularity. Is it an algebraic branch point? Is it a 
logarithmic singularity? As indicated next, binary collisions always correspond to 
algebraic branch points. This is true even if several different binary collisions occur 
at the same time [12]. 

One way to show this for the collinear problem is to “blow-up” the singularity by 
defining X(t) as X(t) (t. — t)?/? = x(t). To simplify the equations, assume that the 
time of collision is at ¢) = 0 (this just defines the origin on the time axis) and that 
we approach the collision through positive values. (The system is time reversible; the 
equations of motion are invariant with respect to the change of independent 
variable —1t.) By substituting [¢7/*X(7t)]” into the defining equations of motion, the 
equations for X(t) become 


t°X” + (4/3)tx’ — (2/9) X = — x7. (4.3) 
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To convert this Euler differential equation into a form that admits analytic 
solutions, use the change of the independent variable s = 7!” to obtain 


s*X" + 2sX’ -2X = -9x~?, (4.4) 


where the primes now denote differentiation with respect to s. By using series 
methods from elementary ordinary differential equations and a standard majoriant 
argument to justify convergence, it follows that this system has an analytic solution 
in s. Moreover, in a neighborhood about the time of collision the solution is 


x(t) = Ya, t?*”, (4.5) 


It follows from eq. 4.5 that a binary collision for the system is an algebraic branch 
point where s = t'”? serves as a local uniformizing variable. As ¢ passes through 
zero, the solution runs through the sheets of the Reimann surface to emerge as 
though the collision corresponds to an exact rebound or a perfectly elastic collision. 
Consequently, we should expect that this collision singularity can be removed, and 
that it doesn’t constitute a serious obstacle to the analysis of the system. (The same 
assertion does not hold for triple or more complicated collisions. Here, some of the 
exponents in the expansion from eq. 4.3 or 4.4 depend continuously on the value of 
the masses of the colliding particles. Thus, logarithmic singularities result. See [14] 
for triple collisions and [12] for a general discussion.) 

Incidentally, eq. 4.5 provides the basis for an “intuitive” argument to explain 
why Sundman’s change of independent variable, ds = dt/r(t), works. As in our 
derivation of eq. 4.5, if we know that the time of the collision is ty, then we could 
substitute s = (t — t,)'”7, or ds = dt/3(t — t,)?/7, to obtain an analytic solution. 
But, we don’t know, a priori, when or even whether a collision will occur. However, 
perhaps one way to avoid this complication about when a collision occurs is to base 
the time change on the appropriate power of r(t). After all, a collision occurs if and 
only if r approaches zero, so the growth properties of r determine when a collision 
happens. If such an approach is to be successful, then the growth properties of r 
need to be further exploited by using the appropriate choice of a so that r° 
effectively replaces the (t — t,)?/* term in the change of the independent variable. 
The asymptotic expression given in eq. 4.2 shows that a = 1, so a natural choice for 
the change of variables is ds = dt/r(t). This is the Sundman change of the 
independent variable. 

After Sundman eliminated binary collisions from the equations of motion, the 
only remaining, real singularity for the transformed three-body problem is a triple 
collision. Triple collisions can be avoided by appealing to a result proved by 
Sundman, and already known to Weierstrass (see [9, p. 66]). This result uses the 
integral of angular momentum )jm,r,xv, = ¢ where c is a vector constant of 
integration and where m,, r,, and v, are, respectively, the mass, the position vector, 
and the velocity vector of the ith particle. The Weierstrass-Sundman theorem 
asserts that if c # 0, then the system cannot have a complete collapse. (For an 
elegant proof, see [9, p. 66].) Namely, if N = 3, then triple collisions cannot occur 
off of the algebraic variety c = 0. 

By restricting attention to c # 0, we’ve removed all real singularities from the 
transformed equations for the three-body problem. One last constraint on the radius 
of convergence of a series solution is the possibility of complex-valued singularities; 
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are there imaginary collisions of the particles at imaginary values of time? There are. 
To show this and to develop some insight about how they are related to real 
behavior, consider elliptical solutions of the central force problem eq. 3.1. The 
solution r(t) = (r(t), r(t))'”? does not have a closed form representation. Instead, it 
is defined implicitly through the Kepler equations 


r(u) = a(1 — ecosu) t=u-—esinu, (4.6) 


where e« is the eccentricity of the ellipse, a is the length of the semimajor axis, and u 
is a variable. 


Elliptical solutions are well behaved without any possibility of a collision, so one 
might guess that the power series converges for all values of time. It doesn’t; this is 
because there are complex singularities. By letting u = u, + iu, and ¢t = ¢, + it,, a 
computation that just involves the expansion of sine and cosine for complex values 
proves that there is an “imaginary collision” where r = 0 at 


u, = 2ka +icosh ‘(1/e), k =1,2,..., t, =u,—esinu,. (4.7) 


Thus, not only do complex singularities occur, but Re(t,) is determined precisely 
where, on the real axis, r(t) assumes its minimum value. Moreover, the smaller the 
value of this minimum (ie., the larger the value of ¢), the closer this complex 
singularity is to the real axis. This suggests that, if in some manner, the three-body 
system can be forced to be “bounded” away from a triple collision, then perhaps 
the complex singularities are bounded away from the real axis. 

This is what Sundman proved. His first step, which is a significant improvement 
over the Weierstrass-Sundman Theorem, was to show that if c # 0, then for each 
value of ¢, max{|r,(¢) — r,(t)|} > D(c) > 0. In other words, if c # 0, then not only 
can there be no triple collision, but the system is strictly bounded away from a triple 
collision. (Subsequently, Sundman’s result has been extended to the N-body prob- 
lem for all values of N. See [5].) Using this fact along with the Cauchy existence 
theorem for differential equations, Sundman proved that the system has no complex 
singularities in a strip (depending on the value of c) in the complex plane centered 
around the real axis. With this, the remainder of his proof is immediate. All one 
needs to do is to conformally map this strip to the unit disk. For instance, if the 
strip is |Im s| < B, then the change of independent variables is o = (e7°/*8 — 1)/ 
(e”/*8 + 1). In this new system, the equations of motion have no singularities in 
the unit disk, so the resulting power series converges. Along the real axis in the unit 
disk, o corresponds, in a 1-1 fashion, to all real values of time. Thus, in this way, 
Sundman proved the existence of power series solutions for the three-body problem 
that converges for all real values of time. 

Sundman solved the three-body problem but, unfortunately, the series solution is 
of little practical value because it converges too slowly and it requires too many 
terms to achieve any interesting degree of accuracy. It is fairly easy to see why this is 
so. Remember, Sundman used two changes of the independent variable. The first 
was to eliminate the singularities due to binary collisions. As we see from eq. 4.2, 
this involves a change that has the form s = (t, — t)'/? near a binary collision. This 
change of variables “slows down” the dynamics with the accompanying effect that a 
numerical value of s tends to be larger than the corresponding numerical value of ¢. 
(This is similar to the situation in numerical integration where smaller step sizes are 
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used to achieve accuracy with rapid changes in the dynamics. Accompanying the 
smaller steps is an increase in the number of steps; this number of steps corresponds 
to the value of s.) Next, this larger value of s is exponentially mapped to the unit 
disk in the complex o-plane. Thus, it follows that fairly small values of t can be 
identified with values of o near the boundary of the unit disk. When this happens, 
we should expect the convergence to be slow. This occurs. In other words, while 
Sundman successfully solved the three-body problem, the theoretical properties of 
his analysis demonstrate that, in general, an universal power series solution of the 
three-body problem is impractical. His success in solving an age-old problem killed 
interest in this line of inquiry. Nevertheless, many of Sundman’s results used to 
achieve his series solution continue to play a significant role in the development of 
celestial mechanics. 


5. A footnote. The emphasis in this brief description of aspects of the Newtonian 
N-body problem has been on the role played by complex variables. It is appropriate 
to end with a historical aside about how certain problems from celestial mechanics 
played a role in the development of some of the classical results in complex 
variables. 

As mentioned in Section 3, the solution for the simple, central force problem, 
r(t), given by eq. 4.6 for the elliptic case, does not have a closed form representa- 
tion. This solution is only defined implicitly through Kepler’s equations. Conse- 
quently, several mathematicians sought to find a useful direct expression, even a 
power series solution, for r. A. Wintner [17], in his book Analytical Foundations of 
Celestial Mechanics, notes in a footnote on page 217, “The direct proof of |this aspect 
of Kepler’s equation] played an important historical role in the theory of analytic 
functions ...[A] principal impetus for Cauchy’s discoveries in complex function theory 
was his desire to find a satisfactory treatment for Lagrange’s series | for Kepler’s 
equation]. Cauchy was led to his fundamental theorem connecting the radius of 
convergence with the location of the nearest singularity, as well as to his maximum 
principle, precisely in his papers dealing [with this problem]. Also the facts usually 
referred to as the argument principle and Rouche’s theorem were first observed in 
connection with this problem concerning Kepler’s equation.” 
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In 1875 Edouard Lucas challenged the readers of the Nowvelles Annales de 
Mathématiques to prove the following: 


A square pyramid of cannon-balls contains a square number of cannon-balls 
only when it has 24 cannon-balls along its base [4]. 
In other words, the only nontrivial, natural number solution of 
174+ 274+ 37+ .--- 4x7 =y? 
is x = 24and y = 70. 

We begin this article by recalling the history of this Diophantine equation. We 
then present a new, elementary solution of it. 

In 1876 Moret-Blanc gave a “proof” of Lucas’s assertion which, although 
incomplete, at least gives a good framework for solving it [7]. Moret-Blanc divided 
the problem into two cases, one with x even and one with x odd. This is exactly the 
approach we take in this paper. In 1877, having pointed out that there was a gap in 
Moret-Blanc’s argument, Lucas published a proof which also contained a gap 
[5]. Lucas could handle the case with x even but not the case with x odd. In 1918 
G. N. Watson managed to fill Lucas’s gap with about 14 pages of a specially 
extended theory of Jacobian elliptic functions [9]. This was the first complete proof 
of Lucas’s assertion but it was very complicated. In 1952 Ljunggren used Skolem’s 
method to give a simpler, more arithmetic solution to the square pyramid problem 
[3]. In 1966 Baker and Davenport used a theorem from transcendental number 
theory to solve the simultaneous Diophantine equations 3x* — 2 = y* and 8x — 7 
= z? [1]. Their method, recently made more practical by Waldschmidt, can easily be 
adapted to demonstrate Lucas’s proposition. In 1975 Kanagasabapathy and 
Ponnudurai gave an elementary solution for the simultaneous Diophantine equa- 
tions 3x* — 2 = y” and 8x? — 7 = z? [2], and, in 1985, De Gang Ma used the ideas 
of Kanagasabapathy and Ponnudurai to arrive at an elementary solution to Lucas’s 
problem [6]. This was the first complete solution accessible at the undergraduate 
level. 

In this paper we draw on one of Ma’s ideas to give a new proof of Lucas’s 
assertion. This proof is so short and simple that it is surprising that it escaped 
mathematicians for over a hundred years. 
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I 


Our equation is equivalent to x(x + 1)(2x + 1) = 6y”. In this section we solve it 
assuming that x is even, and, in the next section, we solve it assuming that x is odd. 
Suppose, then, that x is even. Under this assumption we can solve the equation with 
the help of the following three lemmas which were probably all known to Fermat. 


LEMMA 1. The area of a Pythagorean triangle is never a square. 


Proof. Suppose, on the contrary, there are Pythagorean triangles with square 
areas. Let w’ be the smallest area for which such a triangle exists. Let x and y be 
the legs of a Pythagorean triangle with area w*. Then x* + y? = z” for some integer 
z, and xy/2 = w’. Since w is minimal, x and y are relatively prime and, without 
loss of generality, we may take x odd and y even. (It follows by a congruence 
consideration modulo 4 that x and y are not both odd.) By the well-known theorem 
for Pythagorean triangles, there are relatively prime positive integers r and s of 
different parity such that x = r? — s* and y = 2rs. Hence (r* — s”)rs = w? and 
s/4<s < w?. Since r, s, r—s and r+ are pairwise relatively prime, and since 
(r — s\(r + s)rs = w?”, it follows that there are positive integers a, b, c and d such 
that r= a’, s=b*, a*—b*=r—s=c* and a*+b*=r4+s = d’. Note that c 
and d are relatively prime since r — s and r+ s are relatively prime. Noting also 
that c and d are odd (because r and s have different parity), let XY = (c+ d)/2 
and Y = (d — c)/2. Then X and Y are relatively prime and X? + Y* = a’. Hence 
one of X and Y is even, and XY/2 = (d? — c’)/8 = b’/4=5s/4 is a square 
integer. Since the triangle with sides X, Y and a is a Pythagorean triangle with 
square area s/4, it follows from the minimality of w? that w? < s/4. Contradic- 
tion. 


LemMA 2. There are no positive integers x such that 2x* + 1 is a square. 


Proof. To obtain a contradiction, suppose that (x, y) is the least positive integer 
solution of 2x4+1=y. Then for some positive integer s, y = 2s+1 and 
x* = 2s(s + 1). If s is odd then s and 2(s + 1) are relatively prime and, for some 
integers u and v, s = u* and 2(s + 1) = uv". This gives 2(u* + 1) = v* with u odd 
and v even. Hence we have 2(1 + 1) = 0 (mod 8). Since this is impossible, s cannot 
be odd. Since s is even, 2s and s + 1 are relatively prime, and there are integers u 
and v, both greater than 1, such that 2s = u* and s + 1 = v*. Let w be the positive 
integer such that u = 2w. Let a be the positive integer such that v*? = 2a + 1. Then 
u*/2+1=s5+1=0" so that 2w4 = (v4 — 1)/4 = a(a + 1). Since v? = 2a + 1, 
it follows from congruence considerations modulo 4 that a is even. Since 2w* = 
a(a + 1), it follows that there are positive integers b and c such that a = 2b* and 
a+1=c*. However, this implies that 2b4 + 1 = (c”)* and hence y < c* (by the 
minimality of (x, y)). On the other hand, c? < a+ 1 <v*<s+1< y. Contradic- 
tion. 


LemMMA 3. There is exactly one positive integer x, namely, 1, such that 8x* + 1isa 
square. 


Proof. Suppose 8x* + 1 = (2s + 1)”. Then 2x* = s(s +1). If s is even then 
there are integers u and v such that s=2u* and s+1=v*. In that case, 
2u4*+1=s5+1=v* and, by Lemma 2, u =0 and, hence, x = 0. If s is odd 
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then there are integers u and v such that s=u* and s+1=2v*. In 
that case, u* + 1 = 2v*. Since u is odd, a congruence consideration modulo 4 
shows that v is odd. Squaring both sides of u* + 1 = 2v*, we obtain 4uv® — 4u4 = 
u® — 2u* +1 and hence (v4 — u*)(v4 + u”) = ((u* — 1)/2)*, an integer square. 
Since v* and u* are relatively prime, it follows that both (v+— u*)/2 and 
(v* + u*)/2 are integer squares. Now (v? — u)? + (v? + u)? = 4(v4 + u?)/2 = A? 
and (v* — u)(v? + u)/2 = (v* — u*)/2 = B*. By Lemma 1, this is impossible unless 
v* = +u. Since u* + 1 = 2v+, we obtain u* — 2u2 + 1 = O and wu? = 1. From this 
it follows that s = 1 and x = +1. 


With the above three lemmas, we are now in a position to solve x(x + 1) 
(2x + 1) = 6y? under the assumption that x is even. Suppose, then, that x is even. 
Then x + 1 is odd. Since x, x + 1 and 2x + 1 are relatively prime in pairs, it 
follows that x + 1 and 2x + 1 (both being odd) are either squares or triples of 
squares. Thus x + 1 ¥ 2 (mod 3) and 2x + 1 ¥ 2 (mod3). Hence x = 0 (mod 3), 
and for some nonnegative integers p, g and r, we have x = 6g’, x + 1 = p” and 
2x +1=r*. Thus 6g? = (r— p)(r + p). Since p and r are both odd, 4 is a factor 
of (r — p)(r + p) = 6q? and thus gq is even. Let gq’ be the integer such that g = 2q’. 
We now have 69” = ((r — p)/2)((r + p)/2) and, since (r — p)/2 and (r + p)/2 
are relatively prime (because r* and p” are relatively prime), we obtain one of the 
following two cases. 


Case (i). One of (r — p)/2 and (r + p)/2 has the form 6A? and the other has 
the form B* (where A and B are nonnegative integers). Then p = +(6A” — B?) 
and g = 2AB. Since 6g? + 1 = x + 1 = p’, we have 24A4’B? + 1 = (6A” — B*)* or 
(6A? — 3B*)* — 8B* = 1. By Lemma 3, B = 0 or 1 and, hence, x = 6q* = 0 or 24. 
The only nontrivial solution is thus with x = 24. 


Case (ii). One of (r — p)/2 and (r + p)/2 has the form 3A? and the other has 
the form 2B” (where A and B are nonnegative integers). Then p = +(3A* — 2B’) 
and g = 2 AB. This gives 24A*B? + 1 = (3A* — 2B’)? and hence (3A? — 6B’)? — 
2(2B)* = 1. By Lemma 2, B = 0 and hence x = 6q? = 0. 

Thus when x is even, the only solution to Lucas’s puzzle is x = 24 cannonballs 
along the base of the square pyramid. 


II 


We have solved Lucas’s problem under the assumption that x is even. In this 
section we solve it under the assumption that x is odd. To do this, we first 
investigate the solutions of the Diophantine equation X* — 3Y? = 1. 

Let a = 2 + y3 and b =2 — y3. Note that ab = 1. Where n is any nonnegative 
integer, let u, = (a" + b")/2 and v, = (a" — b")/(2V3). Then u, and v, are 
integers, and it is a well-known result (from the theory of the Pell equation) that 
when n > 1, (u,, v,) is the nth positive integer solution of X? — 3Y? = 1. (See [8, 
pp. 209-211].) Of course, when n = 0, we have the solution X = 1 and Y = 0. 

In order to show that x =1 is the only odd positive integer such that 
x(x + 1)(2x + 1) has the form 6y7, we use the following lemmas. 


LEMMA 4. Where m and n are nonnegative integers, U,,4, = UmU, + 30,~, and 
Unin = Umy + UjYm Also ifm—n>O, then u,,_, = U,U, — 30,b, and v,,_, = 


n n 
—U,v, + U,D,,- 
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Proof. This follows by straight calculation from the definitions of u, and v,. 
Using Lemma 4, it is not hard to obtain the following result. 


LEMMA 5. Where m is a nonnegative integer, U,,47 = 2UjU,.41 — Up», And v.45 = 


2UpV_ +1 _ Um: 


Using the fact that (u ) is a solution of X* — 3Y” = 1, we also have the 


following. 


m? Um 


Lemma 6. Where m is a nonnegative integer, uy,, = 2u2,— 1 = 6v2,+1 and 


mine 


LEMMA 7. Let m, n andr be nonnegative integers such that 2rm — n is nonnega- 
tive. Then uy, 4, = (—1)'u, (mod u,,). 


Proof. Using mathematical induction on r together with Lemma 4, we can show 
that U,41)m = 9 (mod u,,) and v3,,, = 0 (mod u,,). Since, by Lemma 6, u,,, = 
2u2,,-1= 6v2,,+ 1, it follows that u,,,=(—1)’" (modu,,). Thus uy,54, = 


UspmUn £ 30am, = (—1)"u, (mod u,,) (using Lemma 4). 


Let us consider the first few values of u,. Starting with n = 0, we have 1, 2, 7, 26, 
97, 362, and so on. If we consider these values modulo 5, we have 1, 2, 2,1, 2,2,.... 
By Lemma 5 it follows that this sequence is periodic. If we consider the values of u, 
modulo 8, we obtain 1, 2, 7, 2, 1, 2,.... By Lemma 5, this is a purely periodic 
sequence with period length 4. Note that when n is even, u, is odd. Using the laws 
of quadratic reciprocity, the above comments lead us to the following two lemmas. 


5 
LEMMA 8. If n is even then u, is an odd nonmultiple of 5 and | =liffnisa 
Uu,, 


multiple of 3. 


LEMMA 9. If n is even then u, is odd and (= = 1 iff n is a multiple of 4. 
u 


n 


The following and final lemma was first proved by Ma. 


LEMMA 10. Where n is a nonnegative integer, u,, has the form 4M* + 3 only when 
u,=i. 


Proof. Suppose u, = 4M? + 3. Then u, = 3 or 7 (mod8) and, from the se- 
quence of values of u,, modulo 8, it follows that n has the form 8k + 2. Suppose 
n # 2 (and hence u, # 7). Then we can write n in the form 2k’2° + 2 where k’ is 
odd and s > 2. By Lemma 7, uw, = ug,95 4. = (—1)* uy (mod us). Since k’ is odd 
and u, = 7, it follows that 4M’? = u, — 3 = —10 (mod u..) and, hence, 


—2\/{ 5 —10 4M? 1 
Us U4s 7 Us 7 Us 7 
By Lemma 9 it follows that the first factor on the left is 1. By Lemma 8 it follows 


that the second factor on the left is —1. Since this is impossible, we may conclude 
that n = 2 and hence u, = 7. 
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Suppose now that x is an odd positive integer and x(x + 1)(2x + 1) = 6y? for 
some integer y. Since x, x + 1 and 2x + 1 are relatively prime in pairs, it follows 
that x is either a square or a triple of a square, and hence x # 2 (mod 3). Moreover, 
x + 1, being even, is either double a square or six times a square, and, hence, 
x + 1 #1 (mod 3). Thus x = 1 (mod 3) and hence x + 1 = 2 (mod 3) and 2x + 1 
= 0 (mod 3). Thus for some nonnegative integers u, v and w, we have x = 
u*, x + 1 =2v? and 2x + 1 = 3w’. From this we obtain 6w? + 1 = 4x +3 = 
4u? + 3. Also (6w? + 1)? — 3(40w)* = 12w2(3w? + 1 — 407) + 1 = 12w2(2x 4+ 1 
+1 — 2(x + 1))+1=1. Hence, by Lemma 10, 6w?+1=7. Thus w =1 and 
x = 1. This gives us the trivial 1 cannonball solution to Lucas’s problem. 

We may conclude that if a square number of cannonballs are stacked in a square 
pyramid then there are exactly 4900 of them. 


I would like to thank Professor J. Lambek for his kind help and encouragement. 
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Editor’s Corner: The Euclidean Algorithm Strikes Again 


STAN WAGON 
Department of Mathematics, Smith College, Northampton, MA 01063 


On page 144 of this issue of the MONTHLY Don Zagier presents an extremely 
short, elegant, and elementary proof of the classic result that any prime p congruent 
to 1 (mod 4) is a sum of two squares. The problem of finding a representation of p 
as a’ + B often arises in computational number theory; for example, it is a key 
step in factoring Gaussian integers into prime Gaussian integers. Fortunately, this 
problem can be solved by a very fast and easy-to-program algorithm—it is not 
widely known, although it uses nothing beyond undergraduate mathematics. In this 
note we will present the algorithm and a proof of its correctness. 

We pause for a moment to point out a beautiful formula for a and f due to 


1 
Gauss: If p = 4k + 1 then a = 504) and B = ((2k)!a) are such that p = 


a’ + B* (here (n) denotes the residue of n mod p with |(n)| < =) For a proof see 


[3, chaps. 4-5]. Simple as it is, Gauss’s formula does not seem to be of any value for 
computation of a and B. 
Now, let p be a prime of the form 4k + 1. The algorithm has two steps: 


A. Find x such that x? = —1 (mod p). 
B. Apply the Euclidean algorithm to p and x; the first two remainders that are 
less than VP may be taken as a and B. 


Step A is easy in practice: First find c, a quadratic nonresidue of p; recalling Euler’s 
criterion (which states that for a not divisible by p, a is (resp., is not) a quadratic 
residue mod p iff a‘?~)/* = +1 (resp., —1) mod p), we have that c?*= 
—1 (mod p), whence x may be taken to be c* mod p. How to find a nonresidue? 
Start with c = 2 and let c take on increasingly larger prime values, checking each 
until a nonresidue is found. To do this efficiently, recall that 2 is a nonresidue 
mod p iff p = 5 (mod 8) and 3 is a nonresidue mod p iff p = 2 (mod 3). For primes 
c > 3, observe first that, because p = 1 (mod 4), quadratic reciprocity implies that c 
is a nonresidue mod p iff p is a nonresidue mod c iff d is a nonresidue mod c, 
where d is the mod-c reduction of p. Now, one can simply use Euler’s criterion via 
d‘°~)/2 mod c. Alternatively, continue to reduce via quadratic reciprocity and the 
Jacobi symbol (see, e.g. [9, §3.3)). 

The search for a nonresidue cannot go on too long, at least if, as expected, the 
Extended Riemann Hypothesis is true: Under ERH the first nonresidue is less than 
2 (log p)? [1] (and hence this part of the algorithm runs in polynomial time). In 
practice things are generally much better than the ERH bound. For example (see 
[1]), if p = 1 (mod 4) and p < 1,000,000, then the least nonresidue is at most 37, 
while 2(log p)? is about 381 for primes near 1,000,000. As an experiment, I 
computed the least nonresidue for the first 418 primes! greater than 10'> (for which 


‘Actually, the 418 subjects of the experiment are only probably prime. The PrimeQuery function in 
Mathematica is highly reliable but has not been proved to be failsafe, and it is easy to forget that some of 
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the ERH-bound is 2,385). The first nonresidue is never greater than 29, and only 19 
of the primes have their first nonresidue outside of {2, 3, 5,7}. 

The essence of this two-squares algorithm was first presented by Serret and 
Hermite in 1848 [8, 11] using the language of continued fractions. The streamlined 
form presented above is due to G. Cornacchia [3] (see also [2]). Here we present a 
self-contained proof using only elementary facts about the Euclidean algorithm. 

First, an example. Let p = 848654483879497562821. Because p = 5 (mod 8), c 
may be taken to be 2 and x is easily calculated by the power-mod algorithm. 
Applying the Euclidean algorithm yields the following remainders: 


P = 848654483879497562821 25504875423829707 
x = 354060813206257083018 3100076540181815 
848654483879497562821 704263102375187 
140532857466983396785 283024130681067 
72995098272290289448 138214841013053 
67537759194693107337 6594448654961 
5457339077597182111 6325867913833 
2049690263526922005 268580741128 
1357958550543338101 148510867889 
691731712983583904 120069873239 
666226837559754197 a = 28440994650 


B = 6305894639 


And, as predicted, 284409946507 + 63058946397 = 848654483879497562821. This 
computation took a quarter of a second using Mathematica on a Macintosh. 

Now to the proof that the algorithm works (which also provides a proof of 
existence of a and £). Some notation: if a and 5b are the starting values of the 
Euclidean algorithm, let the sequence of remainders be 7) (= a), 7, (= b), thy. +5 Mos 
with 7, = gcd(a, b). Let g; denote the quotients, that is, g, = |a/b], gq. = |b/r],... - 
Recall that along with the remainders and quotients, one often computes two 
sequences {s,} and {t;} so thatr; = s,a + t,b, i= 0,1,..., n. We will need only the 
ts, which are defined as follows: t,) = 0, ¢, = 1, and ¢t;,, = ¢;_, — q,t;. It is easy to 
see by induction that t,b =r, (moda) for each i (this forward computation of 
t—and of s, which is defined similarly—is much easier than the “back-substitution”’ 
method discussed in some texts). We need some additional facts about the t- 
sequence. It is immediate from the definition that these integers alternate in sign 


and that the sequence {|,|: i = 0,1,..., } is strictly increasing. We will need one 
more term than is usual, so let t,,, be ¢t,_, — q,¢,- Because |t;,,| = 9,|¢,| + |t,_1| 
and |t;_,| < |t,|, the sequence {|¢,|:7=n+1,n,...,1} is a Euclidean algorithm 


remainder sequence starting from the two values |t,,,| and |¢,|. 
Let’s look at an example more closely, where a is p, the prime in question, and b 
is x which, from now on, denotes the smaller of the two square roots of 


the “primes” may be composite. While the experiment was running I thought it was checking actual 
primes. I was probably right. ( Added in proof: Ilan Vardi has since confirmed, using an elliptic curve 
method, that the 418 “primes” are in fact prime.) 
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—1 mod p (the other root is p — x). Here’s an example with p = 73, for which 
x = 27 and n = 6: 


a 
eo 
| 2tiwan) | 2) | ot 
a 
rr ee 
re ee ee 
a ee ee 
ee 


The surprising thing is that the ¢-sequence, in absolute value, is simply the reverse of 
the remainder sequence generated by p and x. This is the key point in the proof 
that the algorithm works (in this case: 8* + 37 = 73). To carry out the proof, we list 
the facts we need in the case of r-, g-, and t-sequences defined from a prime p (a 
prime congruent to 1 mod4) and x (the smaller of the two square roots of 
—1 mod p). 


1. The f-sequence is, in absolute value, just the reverse of the r-sequence. 
2. n is even. 

3. The g-sequence is symmetric about its center. 

4. p divides each r? + 7. 

5. 7, /2 1s the first remainder under yP- 


ne validity of the algorithm tollows from these facts. By (4), p divides s = r? nya + 
t- a) which, by (1), equals r? ayy tr heyy ,,- But by (5) each of these two remainders is 
under VP; so s < 2p; because s is divisible by p, s must in fact equal p. 

Before proving (1)-(5) we need an additional result, one first discovered in the 
context of continued fractions. This result is valid for any starting values, so we 
revert momentarily to the general starting values a and b, with gcd(a,b) = 1. The 
proof is a straightforward induction on n, combining the inductive assumptions that 
b = f(q,---,9,) and r, = f(q3,.-., 4,) with the equation a = q,b + rn. 


EULER’S FORMULA. There is a function f, described below, such that a = 
f(4,,---54,). The function f is a sum of products: First take the product of all the 
qs; then add every product that can be obtained by omitting any pair of consecutive 
terms; then add every product that can be obtained by omitting any two separate 
pairs of consecutive terms; and so on. For example: if n = 5, then f(q,..., 95) = 


9192939495 + 939495 + 1949s + 919095 + 119293 + 45 + 43 + G. (The empty prod- 
uct, which occurs if n is even, yields the summand 1.) 


We now return to the starting values p and x and conclude the proof of validity 
of the two-squares algorithm by proving (1)-(5). 
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1. First note that ¢,,, = tp. This is because the ¢-sequence is, in absolute value, 
a Euclidean algorithm sequence with the same quotients, but in reverse order, as the 
r-sequence; the function in Eulers formula is invariant under reversal of its 
arguments, so |t,,,|, being the same function of the quotients as p is, must equal 
p. Now, the fundamental congruence t,b = r, (mod a) implies that t,x = 1 (mod p). 
Multiply this congruence by x to get ¢, = —x; because |t,| < p, t, must be either 
—x or p — x. But the latter is impossible, for then the ¢-sequence (reversed, and in 
absolute value) would be a remainder sequence beginning: p, p — x, x,..., which 
would mean it would be longer than the r-sequence, contradiction. Therefore 
t, = —x. Now, because the |f|-sequence is, working backwards, a remainder 
sequence beginning with p and x, it coincides with the r-sequence. 

2. This is immediate because the f-sequence alternates in sign and, as just 
proved, ¢, is negative. 

3. By (1), the quotients in the |t|-sequence (working back from t¢,,,) are 


identical to the elements of the g-sequence; but from the equation |t;,,| = |t;-,| + 
q,|t;|, these quotients are the reverse of those in the q-sequence. Therefore each 
Gi = Qnt+1-i- 


4. Simply square both sides of the congruence t,x = r, (mod p), using x* = 
—1 (mod p). 

5. Use the symmetry of the quotients and Euler’s formula applied to the second 
half of the sequence of quotients to show that all the terms occurring in the square 


n 
of las + Lees de] also occur in the sequence /f(q,,...,4q,). That 1s, re _ 


n 
L415 +++ Inyadf 15 + 1,...,q,} and each term in this expansion is also in the 


expansion of f(q,,...,94,) = p- Thus rer < p. Since p is prime, the inequality is 
strict. It remains to show that the preceding remainder, +, ,._,, squares to a value 
larger than p. This can be done in the same way by showing that every product in 
the f-representation of p occurs in the square of the f-representation of the 


n n 
preceding remainder, which is Fars 15 + 1s an> 15 + 1,..., an} The de- 


tails are straightforward. 


There does exist a provably polynomial-time algorithm for representing a prime 
as a sum of two squares, though in practice it is not as fast as the algorithm just 
given. It is not too hard to show that if @ is such that p + 2a is the number of 
points (x, y) in Z, X Z, such that y* = x’ — x, then a” is one of the two squares 
summing to p. (This can be done with the help of Jacobsthal’s formula (see [5, 
p. 123]), which asserts that one of the two squares summing to p is one half of the 
sum over Z, of the Legendre symbols ((x? — x)/p).) Now, there is a polynomial- 
time algorithm for finding the number of points on an elliptic curve (due to R. 
Schoof [10]) and it can be used to find p+ 2a, thus yielding a polynomial-time 
algorithm for finding a. 

We close by mentioning some recent work on the more general problem of 
representing an integer as fa? + gB* for given positive integers f and g (the case 
f = 1 was considered by Cornacchia [4]; [12] treats the case f= 1, g = 5). In [7] 
K. Hardy, J. B. Muskat, and K. S. Williams show that the Euclidean algorithm 
technique can be used, even when the number to be represented is composite. For 
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the two-squares problem (f= g = 1) their approach finds all primitive solutions 
(i.e., those with relatively prime a and £) as follows: First factor the given integer 
n; then for each prime dividing n find a square root of —1 modulo that prime (if 
one of these roots fails to exist—that is, the prime is congruent to 3 mod 4—then 
there is no primitive solution). Standard techniques (see, e.g. [6, p. 67]) can be used 
to transform these roots to a complete set of roots of — 1 modulo each of the prime 
powers dividing n. These, in turn, can be used (with the help of the Chinese 
Remainder Theorem) to get all values of x such that x? = 1 modulo n. Now, for 
each such x apply the Euclidean algorithm to n and the smaller of x, n — x. The 
first two remainders under yn form a primitive solution, and all primitive solutions 
arise from the various roots x in this way. The proofs of facts 1-5 do not require 
that p be prime (that was used only in Step A), and so provide a proof that the 
preceding algorithm generating primitive solutions for composite n is valid. If all 
solutions to n = a* + B* are sought, then find the primitive solutions for each 
integer of the form n/d? (where d* divides n), multiplying each solution by d to 
obtain a representation of n. 


Acknowledgment. The author is grateful to Don Zagier for telling him of the algorithm presented in 
this note and suggesting improvements in the exposition. 
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LETTERS TO THE EDITOR 
Editor: 


The following comments on two recent articles in the MONTHLY may be of 
interest. Regarding the paper [1] on the Radon-Nikodym theorem, an essentially 
identical argument appears as one of the three proofs under the heading “First 
Proof (Probabilistic) in the graduate text book [2], on pages 258-259. There it was 
called “Probabilistic” since the sequence, whose limit is the desired Radon-Nikodym 
derivative, forms a martingale with a directed index set. Similarly the following 
article [3] contains a short proof of the fact that, when » < yp are measures of which 
u is finite and vy arbitrary, v has a purely infinite set. This is also found (with even a 
shorter proof) on page 273 of [2]. 
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M. M. Rao 
Mathematics Department 
University of California 
Riverside, CA 92521 


Editor: 


I wish to make some observations concerning the paper “On the proof of the 
Radon-Nikodym Theorem” by Albert Wilansky that appeared in the MONTHLY on 
p. 441 of vol. 96(5), May 1989. The main result of this paper is an exercise in [1]; see 
Exercise 31(7) on p. 131. This exercise refers to Exercise 30.11 which in turn refers 
to Exercise 17.3, and the latter exercise contains all the techniques used by the 
author. Halmos calls the technique the method of exhaustion. 

The idea of the method of exhaustion is as follows: Suppose that u and vp are 
nonnegative measures on a sigma algebra > of subsets of a set S and that the 
measure p is finite. Define a to be the sup of the set of 4 measures of sets that are v 
sigma-finite. Choose an expanding sequence (A,,) of sets such that for each n, v is 
sigma-finite on A, and p(A,) > a — 1/n. Define 


oO 
A= UA,. 
n=1 
Then pu(A) = a, and for every measurable subset F of S\ A, either u(£) = 0 or 
vy(E) = oo. 

In my opinion, this approach to the method of exhaustion is more elegant than 
the author’s. The author also seems to suggest that his method is being offered as an 
alternative to a transfinite recursion; but I don’t think that transfinite methods 
would normally be selected in this situation. 

Finally, there is no need to assume »v < yp for this argument to work. As a matter 
of fact, the argument is even more interesting when the absolute continuity goes the 
other way, i.e., js < v. See, for example, Theorem 2.2 in [2]. 
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JONATHAN LEWIN, 
Kennesaw State College, 
P.O. Box 444, Marietta, GA 30061 


Editor: 


Let ABCDEF be a regular hexagon with sides of length n, a positive integer; it 
can be tiled with 6n’ equilateral triangles with sides of length 1. Now let the 
hexagon be tiled with 3n? “calissons,” i.e., rhombi formed by joining two of the 
triangles along a common side. David and Tomei [1] showed that such a tiling must 
contain exactly n* calissons with each of the three possible orientations. The 
purpose of this letter is to give a simpler proof of this fact. 

Let a be the number of calissons with sides parallel to FA and AB, b the 
number with sides parallel to AB and BC, and c the number with sides parallel to 
BC and CD. Let Ly,..., L,,, be equally spaced parallel lines such that AB lies on 
L, and DE on L,,. For 1 <i < 2n, let k; be the number of calissons with one side 
on L,_, and another on L,. 

It is clear that k, =n and k,=k,,, for 1 <i<2n. Hencek, =k, = -:-- = 
k,,=nanda+b=k,+- -+k,, = = 2n’. By symmetry we also have b +c= rn? 
and ¢-+a=2n2. whencea=b=c=n2 

The author was partly supported by NSF grant DMS 8802856. 
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FRED GALVIN 

Department of Mathematics, 
University of Kansas, 
Lawrence, KS 66045 


Editor: 


It was gratifying to read the article [1] in which George P. Richardson decried the 
omission of the midpoint rule in calculus textbooks. Indeed, an additional argument 
can be made for its use as a complement to the trapezoidal rule. 

In estimating {?f(x) dx, let T, and M,, be the trapezoidal and midpoint rule 
approximations to the integral respectively, where the interval [a, b] is divided into 
n subintervals of equal length. In order to implement Romberg’s method, it is 
necessary to compute a sequence of estimates T,, doubling the value of n each time. 
If both the trapezoidal and midpoint rules are used, then one may take advantage of 
the following formula, which may be verified easily by the reader, 


qT), — (T, + M,,)/2. 
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Consequently, once an initial value of the trapezoidal rule, say for n = 1, is 
established, function evaluations are done only when computing the midpoint rule. 
Since it is necessary to evaluate a function at a given point only once, the 
computation is extremely efficient. 
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ROBERT O. STANTON 
St. John’s University 
Jamaica, NY 11439 


Editor: 


Regarding Larry Cusick and Peter Tannenbaum’s article, “Fixed Points of the 
Twisted Cyclic Shift Operator,” Amer. Math. Monthly (Oct 89), the number of 
states of an (n + 1)-bit twisted cyclic shift counter is 2(m + 1) rather than 2 to the 
nth power as implied by the article. 

These values agree only when n = 3 which is exactly the case in the example 
given in Fig. 4. of the article. 

Unfortunately, the implied ability to reduce the complicated logic of an n-bit 
Gray code sequencer by replacing it with an (” + 1)-bit twisted cyclic shift operator 
is not valid. 


WILLIAM J. BLEWETT, 
Four Pi Systems, 

10905 Technology Place, 
San Diego, CA 92127 


Editor: 


I would like to point out that the results and the way to prove them, contained in 
the paper “A simple estimate of the error in linear approximation” by R.M. 
Gethner, this MONTHLY, Vol. 96, N° 6, pp. 522-523, are exactly the same as those 
presented on page 181 of the Classic “Introduction to Calculus and Analysis”, by 
Courant and John, Vol. 1, 1965, Interscience. 


Sincerely Yours, 


ROBERTO A. MACiAS 
INTEC-PEMA 
Guemes 3450 

3000 Santa Fe 
ARGENTINA 


UNSOLVED PROBLEMS 


EDITED BY RICHARD GUY 


In this department the MONTHLY presents easily stated unsolved problems dealing with notions 
ordinarily encountered in undergraduate mathematics. Each problem should be accompanied by 
relevant references (if any are known to the author) and by a brief description of known partial results. 
Manuscripts should be sent to Richard Guy, Department of Mathematics and Statistics, The 
University of Calgary, Calgary, Alberta, Canada T2N 1N4. 


Open Problems in Grid Labeling 


JOSEPH A. GALLIAN* 
Department of Mathematics and Statistics, University of Minnesota, Duluth, MN 55812 


Let P,, denote the path with m vertices and C,, the cycle with m vertices. From 
these we may form three kinds of grid graphs using the Cartesian product: 


P,, X P, a planar grid with m copies of P, (rows) and n copies of P,, (columns) 

P,, X C, a grid on a cylinder with m copies of C, (horizontally) and n copies of 
P_, (vertically) 

CX C, a grid on a torus with m copies of C, (latitudinally) and n copies of C,, 
(longitudinally) 


where, by the Cartesian product G, X G,, we mean the graph with vertex set 
{(x, y)|x © G,, y © G,} and two vertices (x, y,) and (x, y,) of G, X G, are 
adjacent if x, = x, and y, and y, are joined by an edge or if y, = y, and x, and x, 
are joined by an edge. 

In the past few years a number of authors, including four undergraduates, have 
investigated the problem of whether these grids can be labeled in various ways. In 
particular, a connected graph with v vertices and e edges is called graceful if it is 
possible to label the vertices x with distinct integers f(x) in {0,1,2,...,e} so that 
when each edge xy is labeled |f(x) — f(y)|, the resulting edge labels are distinct 
(and thus form the entire set {1,2,3,...,e}). One type of graceful labeling is 
interesting enough to warrant its own name. Thus, a graceful labeling f is called an 
a-valuation if there is an integer k such that for any edge xy, either f(x) < k < f(y) 
or f(y) < k < f(x). The notions of graceful graphs and a-valuations were intro- 
duced by Rosa in 1967 [17] (although the term “graceful” was coined by Golomb 
[10] and popularized by Martin Gardner [9]). Problems about graceful graphs have 
appeared in this section of the MONTHLY on several occasions [5], [11], [14]. 

If f is an a-valuation on a graph G with vertex set V and k is the smaller of the 
labels f(x) and f(y) for the edge xy with |f(x) — f(y)| = 1 then G is bipartite 
with parts {x € V|f(x) > k} and {x € Vif(x) < k}. A standard result in graph 
theory says that a bipartite graph cannot have an odd cycle. Thus, the graph 


*This paper is based on work supported by the National Science Foundation under Grant No. 
8709428. 
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C,, X P,, does not have an a-valuation when m is odd and the graph C,, x C, does 
not have an a-valuation when m or n is odd. 

In 1980, as an additive analog of graceful graphs, Graham and Sloane [13] 
defined a connected graph with e edges to be harmonious if it is possible to label the 
vertices x with distinct integers f(x) in {0,1,2,...,e — 1} so that when edge xy is 
labeled f(x) + f(y), the resulting edge labels are distinct modulo e. 

Grace [12] introduced what appears to be a stronger form of harmonious graph 
by calling a connected graph with e edges sequential if it is possible to label the 
vertices x with distinct integers f(x) in {0,1,2,...,e — 1} so that when edge xy is 
labeled f(x) + f(y) the set of edge labels is a block of e consecutive integers. 
Obviously, these edge labels are distinct modulo e. Surprisingly, there is no known 
example of a harmonious graph which doesn’t admit a sequential labeling. 

Although there have been upwards of 150 papers written on graceful and 
harmonious graphs, there are few general results. Indeed, even for problems as 
narrowly focused as the ones involving the aforementioned grids, the labelings have 
been hard-won and involve a large number of cases. The progress to date in grid 
labeling is summarized in the table below. We invite readers to fill the gaps. 

Surveys of other results and open problems on graph labeling are given in [2], [7], 
and [8]. Applications are discussed in [3] and [4]. 

In the table below we always exclude the trivial cases involving P, and C,. The 
entry NO means the labeling is impossible; an asterisk refers to the earlier 
observation that graphs with a-valuations can not have odd cycles. 


a-valuation Graceful Harmonious 


PX P, all [16, 1, 15] all (16, 1, 15] all except (m, n) = (2,2) [15] 
Com X Ph, all [15] all [15] all except (m, n) = (2,1) [15] 
Cay X Pay vt all [15] all [15] all [15] 

Cam+2 % Pons ? ? ? 


Coma X P, NO* n = 2 [6]; n > 2? all [13] 

Cam X Co, all [15] all [15] m=1,n> 1 [15]; otherwise? 
Cam X Con 41 NO* ? ? 

Cam42 X Cong NO* ? NO [13, Theorem 11] 
Come X Cong] NO* NO [16] NO [13, Theorem 11] 
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NOTES 


EDITED BY DENNIS DETURCK, DAVID J. HALLENBECK, AND RODICA SIMION 


-A Theorem on the Numerators of the Bernoulli Numbers 


KURT GIRSTMAIR 
Institut fir Mathematik, Universitét Innsbruck, TechnikerstraBe 25, A-6020 Innsbruck, Austria 


The Bernoulli numbers B,,, m = 0,1,2,... are useful in several fields of mathe- 
matics, such as number theory, combinatorics, and numerical analysis. For odd 
m > 3, B,, = 0. For even m > 2, write B,, = N,/D,,, where N_, D,, are relatively 
prime integers, D,, => 1. The denominators D,, are well known, due to a theorem of 
von Staudt and Clausen of 1840. In fact, D,, is the product of all primes p with 
p —1|m. The numerators N,, are less well known. However, N,, deserves special 
interest; for instance, many fundamental results concerning Fermat’s Conjecture 
rest on arithmetical properties of N,, (cf. [2]). One of the few explicit results on 
divisors of N,, is the following: 


THEOREM. Let m > 2 be even, and pa prime with p—11+m. If p" divides m 
(r => 1), then p’ divides N,,,, too. 


This theorem also goes back to von Staudt, who published a proof in a hardly 
accessible “Universitatsschrift’” (“De numeris Bernoullianis commentatio altera,” 
Erlangen 1845). It was rediscovered by several 19th century mathematicians such as 
Sylvester [7], Adams [1], or Lipschitz [5]. Misattributions were made even in recent 
times (cf. [6]). 

A number of proofs of this theorem are known. More “advanced” proofs are 
based on p-adic methods (e.g. [4]). “Elementary” proofs use subtle identities of 
power series, involving the generating function 


oO 
f(t) =t/(e'-1) = LB, t™/m! 
m=0 
of the Bernoulli numbers, and congruence considerations (cf. [2], [6], [3]). However, 
it is not easy to grasp the ideas behind these proofs. 

The aim of the present note is to convey a better understanding of the theorem 
within the framework of algebraic number theory, more precisely, of cyclotomy. Let 
n > 2 be an integer, and let { = e?7'/” (ie., a primitive nth root of unity). We 
consider the numbers B,, as special cases of more general “cyclotomic” Bernoulli 
numbers B,, ,, k =0,1,..., — 1, defined by means of the generating functions 

oO 
f(t) = t/(6* +e! — 1) = Bree /m!i. 
m=0 
Indeed, fy = f, hence B,, 9 = B,,. 

One source of our proof of the theorem is a divisibility property of B,, , for 

k # 0. It reads as follows. 


I. For each m>2, n>=2, the numbers (2n)"B,,,/m,k =1,...,n —1, are 
algebraic integers. 
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Let us briefly recall the relevant facts about algebraic integers (cf. [3, pp. 66—68)). 

(A) An algebraic integer is a complex number that is a root of a polynomial with 
coefficients in Z and leading coefficient = 1. 

(B) The set of all algebraic integers is a ring (with the usual operations 
+,-,X). 

(C) All algebraic integers contained in @ are in Z. 


The second source of our proof is a relation between the numbers B, , and B,,, 
namely, 
n—-1 


II. y Bn ~ = (n"- 1B,  (m>0). 
k=1 


Indeed, I, II, and property (B) show that (2n)"(n” — 1)B,/m is an algebraic 
integer. Due to (C), this number is in Z. Suppose now that m > 2 is even, p’|m 
(yr >1), and p—1+™m. In particular, p # 2. We can choose the number n as a 
primitive root modulo p (cf. [3, p. 41]). This means that n # 0 mod p, and that 
n? #1 mod p whenever p — 1 + g (q = 0). In particular, p divides neither 2n nor 
n™ — 1, so p” must divide N,,. 


Proof of I. As a first step, we write the numbers B,, , in a “trigonometric” shape. 
For this purpose observe that 


(er + 1)/(e — 1) = -i- cot(t) = 14+ 2/(e** — 1). 
Since f,(t) = t/(e7/("™*/"—"/2) — 1) we get 
f(t) = (-it/2) - cot(ak/n — it/2) — t/2, k=0,...,n—1. 
Now Taylor expansion of either side yields 


B,,., = m(—i/2)” cot” )(rk/n), 
for m >2 and k = 1,...,n —1. 

The occurrence of m on the right side is the decisive point in the proof of I. 
Actually we shall show that n”(—i)” cot“"~(ak/n) is an algebraic integer for 
each m > 1. In the case m = 1, write —i- cot(mk/n) = (&* + 1)/(f* — 1). Fur- 
thermore {* is a root of the polynomial 


(x"—1)/(x -1) =x" 14+--- 41 = [[(x- 8), 


and thus an algebraic integer. On evaluating this polynomial at x = 1, we get 
n—-l 


n= {[(1-%). 


i=1 
Hence —in - cot(wk/n) = —(§* + 1I1,,,(1 — $°) is an algebraic integer, by (B). 
When we differentiate the identity 
—cot(t) = 1 + cot(t)? = 1 — (-i- cot(r))’ 


repeatedly, we find that (—i)” cot’"~ (rt) is a polynomial in —i - cot(t) of degree 
m with coefficients in Z. For this reason (— in)” cot“"~(ak/n) is expressible as 
an integral polynomial in —in - cot(7k/n), and (B) proves I. 


138 RICHARD G. SWAN [February 


Proof of I. The relation II between cyclotomic and ordinary Bernoulli numbers 
is the result of a corresponding relation between the generating functions. This, in 
turn, is in substance nothing but the decomposition of 1/(x” — 1) into partial 
fractions, which has the shape 

n—-1 
1/(x"-1) = )i¢/(x - $'), withe, EC. 
/=0 
There are standard methods to determine the coefficients c,, but we shall do it in an 
ad hoc way here. First multiply the above equation by x — ¢*. Then put x = ¢* and 
obtain 


n—-1 
LTT (S* = 8") =, =o} TT - &). 
1#k i=1 

As we have seen in the proof of I, the final denominator equals n, so c, = §*/n. 
Hence 


n/(x"—1) -1/(x - 1) = Luss — 1). 


The substitution x ~ e’ shows that 


n—-1 
fo(nt) — fo(t) = » f,(t), 
k=1 
and comparison of coefficients yields II. 
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Another Proof of the Quadratic Reciprocity Theorem? 


RICHARD G. SWAN 
Department of Mathematics, University of Chicago, Chicago, IL 60637 


Here is a short proof of quadratic reciprocity based on an idea of Milgram [2]. 
Since this theorem probably holds the world’s record for the number of known 
proofs, it is hard to believe that this one is really new, but I have not been able to 
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find this particular version in the literature. I would like to thank Milgram for 
pointing out the paper [1] which contains the theorem proved below. The proof in 
[1] makes use of Gauss’ lemma rather than the norm calculation used here. 

Let o, be a primitive pth root of 1 and set K, = R M Q(o,). The element 1 — o, 
is prime in Q(o,) with norm p. Let a, be the norm of 1 —o, in K,. Then 
N(a,) =p and 7,=(1—o,)\(1—0,*)=2—0,-0,". If p and q are primes 
consider 7 = 7, — 7, in L = K,K,,. It is a unit since 9 = o, (1 — 6,0, )(1 — 6, G,). 
Since 9 = 7, (mod 7,), N(q) = N(a,) (mod q). Now N(1,) = Nx, /oNt/x(%) = 
Nx yo(m* °/*) = pq D/2 = (p/q) mod q by Euler’s criterion. But N(n) = +1, 
so we have proved the following result. 


THEOREM. N(7, — 7) = (p/q) for odd primes p and q. 
Interchanging p and qg now gives quadratic reciprocity. 


COROLLARY. ( p/q)(q/p) = (— 1)? PG" for odd primes p and q. 


When q = 2, we can consider instead + = 2 — 7, = 6, + 6," 


” in K,. The same 
argument gives N(r) = (2/p) but switching p and 2 is useless. Instead consider 
fp= xr bees +1 +--+ 4x7" = 82) where z=x+x7'. Then 2g,(z) = 
8n-1(Z) + 8n4i(Z) 80 8,41(0) = —g,-1(0) showing that g,(0)=1 for n=0,1 
mod4 and g,(0) = —1 otherwise. Since g, is the minimal polynomial of 7 for 
n=(p-—1)/2, we see that (2/p) = M(t) = (—1)"g,(0) = (-1)" 9 as _re- 
quired. 
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Adding up to Powers 
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Here is a short combinatorial proof that a somewhat mysterious way of generat- 
ing the Ath powers of the natural numbers actually works. It generalizes the 
well-known fact that the sum of the first n odd numbers is n?. The proof makes a 
nice presentation in an introductory combinatorics course. It uses recurrence 
relations, proof by 1-1 correspondence, and counting of lattice paths in a problem 
that students are likely to find appealing. 
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After discussing the kth powers, we will discuss a generalization of this proce- 
dure. We can still explain the resulting numbers as values of polynomials. We will 
see, however, that generating the Ath powers is a very special case. 


1. Generating the Perfect Powers. The procedure is as follows. Begin by writing 
down the natural numbers in sequence, calling this row 1. Next, strike out every kth 
term. Take partial sums of the surviving sequence to generate row 2. In general, 
generate row j + 1 from row j/ by striking out every k + 1 — jth term and taking 
partial sums of the surviving sequence. The claim is that row k consists of precisely 
the kth powers of the natural numbers, in sequence. The procedure is illustrated 
below for k = 4. Note that the successive deletions make the numbers appear in 
wedges; a number is in wedge n if the first deletion to its right is the nth deletion in 
that row. 

1234 5 6 78 9 101112 13 1415 16 17 18 19 20 
136 111724 33 43 54 67 81 96 113 131 150 
14 15 32 65 108 175 256 369 500 
1 16 81 256 625 
The numbers in the major diagonal of each wedge have a recognizable form. Let 


a, , be the number in row j of the nth major diagonal. Then in fact a, , = ni(* ). 


n 


We prove this by showing that the a, 
same recurrence. First we study the 5, ,. 


and the numbers 5, ,, = ni(« satisfy the 
, J 


LEMMA 1. The values 6; ,, = ni(*) satisfy the recurrence b; , = Dia * = Nb, n—1 


Proof. Consider the sequences formed from the integers {0,...,}. Let S, ; be 
the sequences that have k terms, of which 7 are nonzero. Both sides of the claimed 
equality count S, ;. To show |S, ;| = ni(4 ), simply choose the positions for the 
non-zero entries and fill them in. To show the sum also counts S, ,, partition S, , 
by the number of positions containing terms greater than 1. This number may be 
anywhere from 0 to /; if it is i, the positions of terms exceeding 1 can be chosen in (* 
ways, the positions can be filled in (n — 1)' ways, and the j — i 1’s can be placed in 


- ‘ ways. O 
To prove the desired theorem, we need only verify the initial conditions and show 
that the a, , satisfy the same recurrence. First, we need a row corresponding to 
j = 0. So, add a Oth row to the display, consisting entirely of 1’s. In particular, 
ay, = 1. The wedge n = 0 is entirely 0 except ayy = 1. 


THEOREM 1. a, ,, = ni(*), 
, J 


Proof. The proof is by induction; we have already verified the basis. We need 


only show a, , = Lio‘ - ‘Na, ,_, for n > 1. Note that every entry in the display is 
the sum of the entry to its left and the entry above it. If we imagine an a — 1th row 
consisting of 0’s, this means every a, ,, can be broken down into contributions from 
entries in the main diagonal of the n — 1st wedge. We need only determine how 
many times each a; ,_, contributes to each a, ,. This is precisely the number of 


paths from a, ,,_, to a,,, that always move rightward or downward at each step. 
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Due to the deletions in each row, there is no element below a, ,,_,, So every path 
from a, ,_; must take its first step to the right. From there, the paths to a, ,, are 
precisely the lattice paths counted by the appropriate binomial coefficients. These 
paths enter the nth wedge in the ith diagonal, and they end at the kth diagonal, so 
the total number of steps is k — i. The number of steps downward is j — i, so the 


correct coefficient is indeed (\ =‘). (In fact, more generally, the jth entry in the /th 


diagonal of the nth wedge is ane - ‘\n4(*).) Oo 
In particular, a, ,, = n*, as desired. 


2. A Generalization. The procedure generating the powers can be generalized to 
yield other polynomials. As before, begin with row 0 having a 0 in position 0 and 1’s 
thereafter. At row j, choose a modulus m, and a remainder r, with 1 <r; < m,. 
Create row j + 1 by 1) deleting from row j the entries r, + sm, for s > 0, and 2) 
computing partial sums of what remains. We call this the addition procedure. Row 1 
is always the natural numbers, so we don’t bother to specify mp, 7. To generate the 
powers of the natural numbers, we used the addition procedure with m, = k + 1 — j 
and r, = m,. We claim that, for any choice of {m,} and {7,}, the sequence arising 
at each row consists of the values of a finite number of polynomials, evaluated at 
consecutive integers. To prove this, we first review elementary facts about polynomi- 


als and binomial coefficients. 


LEMMA 2. Any polynomial p(n) of degree j has a unique expression as a linear 
combination of "), -..5( 7). 

Proof. If the leading coefficient of p(n) is c, let the coefficient of @ be j!c and 
apply induction. 0 


LEMMA 3. If a,b are constants and p is a polynomial of degree j with leading 
coefficient c, then &'_,.p(as + b) is a polynomial in t of degree j + 1 with leading 
coefficient ca//(j + 1). 


Proof. The expression p(as + b) is a polynomial of degree 7 in s. Using the 
preceding lemma (and its proof), we express it as Li ocl° ), with c; = ca/j\. By 
interchanging the order of summation and applying the fundamental combinatorial 
identity ri _o(’) = (+), the desired sum becomes Loe : ‘), This is a polyno- 
mial in ¢ of degree j + 1, with leading coefficient c,/(j + 1)! = ca’/(j + 1).0 

We say that a sequence D,,... is a polynomial sequence of degree k if b; = p(i) 
for 1 <i <n for some polynomial p of degree k. 


THEOREM 2. Let {(7;,m,)} be an instance of the addition procedure. For each j, 
there is a modulus M, such that the congruence classes of positions mod M, partition 
row j into polynomial subsequences of degree j. Furthermore, the leading coefficients of 


these polynomials are equal and depend only on (m,,..., M,). 


Proof. We prove this by induction on j; for 7 = 1 we may choose M, = 1 and 
the polynomial p(i) = i. 

Consider 7 > 1, and suppose the claim holds for row j with polynomials 
Po>---> Py —1 having leading coefficients c. Let f, denote the ith element of row 
j + 1. Let ‘a= Icm(M,, m,), and define M,,, = a(1 — 1/m,). Fix an integer r with 


142 EDWARD A. BENDER, FRED KOCHMAN, AND DOUGLAS B. WEST [February 


0 <r <M, and consider the subsequence g, = f, M +r The difference g,,, — g, is 
composed of contributions from the polynomials { p,}. The choice of M,,, ac- 
counts for the deletions from row / so that the pattern of contributions to g,,, — g, 
from elements explained by a given p, is the same for each value of s. Each 
polynomial p, contributes a consecutive segment of its values, except that gaps arise 
due to deletions from row j. In particular, the contribution from p, has the form 
L4_.p)(n + a,), where d < a/M,, the a;,’s are d of the first a/M, natural numbers, 
and the omissions depend only on r and /. For any value of n, this is the sum of d 
jth-degree polynomials in n. Hence it also has degree j, and it has leading 
coefficient dc. 

The value of n used in this polynomial to compute the contribution of p, to 
£541 — 8, is (a/M,)s + |r/M,|. The contribution to g,— gq from p, is thus a 
partial sum of equally-spaced terms from a polynomial sequence of degree 7. By the 
lemma, the sum is a polynomial in ¢ of degree 7 + 1, with leading coefficient 
dc(a/M Ay /(j + 1). Finally, we collect the contributions to g, from all p, and add 
the constant g, = f. to express g, as a polynomial in s of degree j + 1. The leading 
coefficient is c(a/M i)! /(j + 1) times the sum of the values d associated with the 
various p,’s. This sum is always M,,,, because it equals the number of elements of 
row j added in to increase g, to g,,,. This completes the induction. 0 


With M, = 1 and M,,, = lem(M,, m,) - (m, — 1)/m, as in the proof, we con- 
clude that the leading coefficient for the polynomials explaining row j; is 
(1/j IT f=7-M, , ,(iem(M,, m,)/M,)". 

3. Additions and Differences. In the special case considered in Section 1, the 
polynomial b,(n) = (‘|n’ is precisely the polynomial generated by the above 
procedure for row j when m,=7,=k +1—j for j =1,...,k. 

As another example, consider the addition procedure with m, = 3, r, = 2, 
m,= 5, r, = 3. The first three rows of the resulting display are 


O123 45 6 7 8 9 10 11 12 #13 14 #215 += «+16 17... 


0 1 4 8 14 21 30 =640 52. 65 80 696 
0 1 5 19 40 70 110 175 255 351 


Taking the polynomial n for row 1, the two polynomials obtained for row 2 are 
6(") + 4n = 3n* +n and 6(”) + In + 1 = 3n? + 4n + 1. In passing from row 2 
to row 3, the pattern of deletions versus usage of polynomials repeats after 8 
surviving terms of row 3, because 8 = Icm(2,5) - (4/5). Thus eight polynomials 
suffice to explain the elements of row 3; the first two are 1200(”) + 1300(”) + 255n 


and 1200(") + 1450(”) + 350n + 1. 

Note that we expressed these polynomials in terms of binomial coefficients. Once 
we know which values are covered by one of these polynomials, we can obtain the 
full polynomial quickly from 7 + 1 elements of row j by using the method of 
repeated differencing. Take the first 7 + 1 values of one congruence class mod M,, 
the first of which will be the value of the desired polynomial at n = 0. Take the 
differences of consecutive values to get a sequence of j values. Continue this 
differencing to produce successively shorter sequences, the j + 1th sequence having 
only a single value. Let cy,..., c; denote the initial values in these j + 1 successive 
difference sequences. It is easily proved by induction that the ¢th difference 
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sequence consists of the first 7 + 1—t values of the polynomial is cd, ” ). In 
particular, the polynomial giving the initial sequence is Ye"). 

At this point, the curious reader will wonder what polynomials can arise from 
this procedure. In general, this appears to be difficult. However, suitable restrictions 
of the question yield interesting results. 

First, note that winding up with a single polynomial as we did in Section 1 is a 
fairly special occurrence; it seems to require M, = 1. Nevertheless, we can always 
guarantee it by choosing m; = k + 1 — j. This still leaves a fair amount of variabil- 
ity in the choice of 7,, but since 0 < 7; < m,, this limits us to at most k! polynomials 
for each value of k. Furthermore, the computation in the proof of Theorem 2 shows 
that these are all monic polynomials, which makes this class even more interesting. 

Using the differencing procedure and the relationship between {(")} and {n'‘} 
via the Stirling numbers, it is a pleasant exercise to calculate some of these 
polynomials. For k = 2 the two polynomials are n? and n* + n. For k = 3, the six 
polynomials can be described concisely; n? has a different form from the other five, 
which are n? + j(n* + n)/2 —n, for j = 1,2,3,4,5. For n = 4, more formulas are 
necessary to explain all the polynomials, and the polynomials are not all distinct. 
We will close by listing those for which all coefficients are integers. 


The appearance of the pure powers no longer looks quite so strange. Certainly 
there are some patterns here that are worth exploring. 


Note added in proof: We have learned from C. T. Long and from this MONTHLY, 
95 (1988) 708 that the process of our Section 1 is called Moessner’s process and has 
been known since 1951. See Long’s paper in this MONTHLY, 73 (1966) 846-851, for 
references and a proof of a certain generalization, different from our generalization. 


THE TEACHING OF MATHEMATICS 


EDITED BY MELVIN HENRIKSEN AND STAN WAGON 


A One-Sentence Proof That Every Prime p = 1 (mod 4) 
Is a Sum of Two Squares 


D. ZAGIER 
Department of Mathematics, University of Maryland, College Park, MD 20742 


The involution on the finite set S = {(x, y,z) € N?:x* + 4yz = p} defined by 
(x+2z,z,y—-x-—z) ifx<y-z 
(x,y,z) (Qy-—x,y,x-ytz) ify-z<x<2y 
(x-2y,x-y+z,y) ifx>2y 


has exactly one fixed point, so |S| is odd and the involution defined by (x, y,z) 
(x,z, y) also has a fixed point. 0 


This proof is a simplification of one due to Heath-Brown [1] (inspired, in turn, by 
a proof given by Liouville). The verifications of the implicitly made assertions— that 
S is finite and that the map is well-defined and involutory (i.e., equal to its own 
inverse) and has exactly one fixed point—are immediate and have been left to the 
reader. Only the last requires that p be a prime of the form 4k + 1, the fixed point 
then being (1,1,). 

Note that the proof is not constructive: it does not give a method to actually find 
the representation of p as a sum of two squares. A similar phenomenon occurs with 
results in topology and analysis that are proved using fixed-point theorems. Indeed, 
the basic principle we used: “The cardinalities of a finite set and of its fixed-point 
set under any involution have the same parity,” is a combinatorial analogue and 
special case of the corresponding topological result: ““The Euler characteristics of a 
topological space and of its fixed-point set under any continuous involution have 
the same parity.” 

For a discussion of constructive proofs of the two-squares theorem, see the 
Editor’s Corner elsewhere in this issue. 
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Inverse Functions and their Derivatives 


ERNST SNAPPER 
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If the concept of inverse function is introduced correctly, the usual rule for its 
derivative is visually so obvious, it barely needs a proof. The reason why the 
standard, somewhat tedious proofs are given is that the inverse of a function f(x) is 
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usually graphed by flipping the graph of f(x) over the 45° line. And although this 
procedure is formally correct, it hides the visual obviousness of the formula for the 
derivative of the inverse. In short, graphing the inverse by flipping over the 45° line 
is an error in pedagogy. 

The concept of the inverse of a one-to-one and onto function should first be 
explained on the set-theoretic level. This means that domains (of definition) and 
ranges should not be restricted to subsets of the real numbers, but arbitrary sets, 
and also subsets of the plane and 3-space should be used. It should be stressed that 
the inverse is obtained by simply interchanging domain and range. If this is done 
properly, the students should feel irritated that their time is being wasted on such 
trivialities. 

What does this mean for the graph of the inverse of a real variable function f(x) 
in one variable? Suppose the graph of f(x) is given: 


Fic. 1. The graph of y = f(x). 


How does one find the graph of the inverse g(x) of f(x)? Since the students 
know that this is a matter of interchanging domain and range, they will quickly and 
correctly say: Interchange the x-axis and y-axis. Hence the graph is: 


Fic. 2. The graph of the inverse y = g(x). 
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The instructor should stress that the fact that the two axes are now in an 
unnatural position is entirely immaterial. This is the graph of the inverse which is 
helpful and effective. For example, the standard theorem that f(x) is strictly 
increasing (decreasing) iff its inverse has the same property is now visually obvious. 
The same holds for all other theorems. Let us work it out for the derivative. 

The students have learned that the derivative of f(x) at a point P on the curve 
y = f(x) is tana, where a is the angle which the positive x-axis makes with the 
upward direction of the tangent of y = f(x) at P: 


Fic. 3. f’(P) = tana. 


Interchanging the x-axis and y-axis gives, by the same rule, that the derivative of 
the inverse g(x) of f(x) at P is tan B: 


Fic. 4. g(P) = tan B. 
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Since a + B = 90°, tan B = 1/tana and the formula for the derivative of the 
inverse has been obtained. 

This method of interchanging the x-axis and y-axis is also very efficient for 
viewing the graph of the inverse function with the axes in their natural position. 
Simply graph the function f(x) on a piece of paper, putting a little pressure on the 
pencil, interchange the x and y and now look at the graph through the back of the 
paper with the axes in their natural position. What you see is the customary graph 
of the inverse of f(x). It is very effective to do this in class, choosing for f(x) some 
trigonometric function or a polynomial function such as y = x” or y = x*. You can 
easily look through the back of the paper by holding it up to a window on a sunny 
day or up to a light. No visual aids are necessary or even helpful. 

Of course, once the above has been done, there is nothing against making the 
students observe that the graph of the inverse of f(x), viewed through the back of 
the paper, is the same as the graph of f(x) flipped over the 45° line. This flipping 
over the 45° line can again be done easily with pencil and paper but, at least with 
some programs, computers can do it too. The fact remains, however, that for the 
purpose of understanding the laws which govern the inverse, nothing beats inter- 
changing the x-axis and y-axis in the graph of f(x) and doing nothing else. 


Extending the Principal Axis Theorem to Fields Other Than R 


STEPHEN H. FRIEDBERG 
Department of Mathematics, Illinois State University, Normal, IL 61761 


Perhaps one of the most important theorems in all of linear algebra is the 
principal axis theorem, which states that a real symmetric matrix is orthogonally 
equivalent to a diagonal matrix; see, for example, [1, p. 337]. This theorem finds 
many applications in geometry and numerical analysis as well as statistics and 
physics, particularly in cases where symmetric bilinear forms arise. It is seldom 
pointed out that this theorem is not true, however, if no constraints are made on the 
underlying field F. The first four examples illustrate the difficulty in extending the 
principal axis theorem to fields other than the field R of real numbers. 


Example 1. Let F = C, the field of complex numbers, and 


A=) 


The characteristic polynomial of A is f,(x) = x7; so the only eigenvalue of A is 0. 
If A were diagonalizable, then A would equal the zero matrix. 


Example 2. Let F = Q, the field of rational numbers, and 


(2 3) 


The characteristic polynomial of A is f,(x) = x” — 4x — 1, which has no rational 
zeros. So A is not diagonalizable. 
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Example 3. Let F = Z.,, the field of two elements, 0 and 1. If 


_{0 1 
4=( i) 
then f,(x) = x* +x +1. Because f(0) = 1 and f(1) = 1, we see that A has no 
eigenvalues, and hence A is not diagonalizable. 


Example 4. If F = F(C) is the field of algebraic numbers (the complex zeros of 
polynomials in Q[x]), the matrix of Example 1 shows that the principal axis 
theorem fails. 


Example 5. Let F = F(R) be the subfield of F(C) consisting of the real algebraic 
numbers. Suppose that A is a symmetric matrix over F(R) and f, is the characteris- 
tic polynomial of A. By the principal axis theorem, f, factors over R in such a way 
that A is orthogonally diagonalizable. But because F(C) is algebraically closed, the 
eigenvalues must be elements of F(R). So A is orthogonally diagonalizable over 
F(R). 


The preceding example shows that the principal axis theorem is valid when the 
underlying field is F(R). The main result of this article demonstrates that the 
principal axis theorem does not extend to any finite field. In fact, the situation there 
is hopeless in that there always exists a 2 X 2 symmetric matrix without any 
eigenvalues. Although purely algebraic proofs may be constructed, they assume 
results from field theory. The proof given here uses only a simple counting 
argument, and therefore can be given to students in an introductory linear algebra 
course. 


THEOREM. Let F be a finite field. Then there exists a 2 X 2 symmetric matrix 
(over F) that possesses no eigenvalues. 


Proof. Suppose that F has n elements, and that A is an arbitrary 2 Xx 2 
symmetric matrix, say, 


The characteristic polynomial of A is 
f(x) =x? — (at+c)x + (ac — bd’). (1) 
A quadratic monic polynomial f(x) is reducible if and only if it has the form 
f(x) = (x — d)(x - e). 
If we first count the number of distinct choices of d and e, and then add the 


number of choices where d = e, it follows immediately that the number of reducible 
quadratic monic polynomials is 


(2) + n= n( 22 


; ; (2) 


We shall show that the number s of polynomials of the form in (1) is greater than 
the number of reducible quadratic monic polynomials. It will then follow that there 
exists a symmetric matrix A whose characteristic polynomial is irreducible, and, 
hence, A has no eigenvalues. 
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For each k € F, let s(k) equal the number of ways of choosing the constant term 
ac — b? in (1), given that a + c =k. Notice that if a + c = k, then ae — b* = ak 
— a* — b’. So s(k) equals the number of elements in the set 

{ak — a* — b*: a, be FY}, 


and 


If we can show that s(k) > (n + 1)/2 for every k, with strict inequality for at least 
one k, then we will have that s > n(n + 1)/2, which by (2) equals the number of 
reducible quadratic monic polynomials. Note that for any k, if we let a = 0, then 
we obtain the constant term —b’. Because —b? = — bs if and only if b, = +by, 
there are at least 1 + (n — 1)/2 = (n + 1)/2 (distinct) negative squares. So s(k) > 
(n + 1)/2 for every k. 

If every element of F is a (negative) square (e.g., if F has characteristic 2), then 
there are n such constant terms, and we are done. So suppose that there exists e € F 
such that e is not a negative square. Let k, =e + 1, a= 1, and b = 0. Then 

ak, - a? — b? = (e+ 1)-1-0°% =e. 
Thus 

s(ky) > (n+ 1)/24+1>(n+1)/2, 
and the proof is complete. 


Problem. Classify exactly those fields for which the principal axis theorem is true. 
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ELEMENTARY PROBLEMS 


E 3367. Proposed by John Kieffer, University of Minnesota, Minneapolis. 


Let A, and A, be disjoint infinite sets of positive integers whose union is the set 
Z* of all positive integers. For each infinite subset S of Z* let f(S) be the set 
obtained from S by removing the least element s of S and then adjoining all the 
elements of A,_, less than s if s isin A,. Let f7(S) = f(f(S)), f°(S) = f(f7(S)), 
etc. Given any positive integer k, prove that for sufficiently large n either 


kef"(Ay) Nf"(A,) or k Ef"(Ag) U f"(A;). 


E 3368. Proposed by I. E. Leonard, University of Alberta, Edmonton, Alberta, 
Canada. 


Suppose a > 0. Let us define a sequence a,,a,,... by taking a, as an arbitrary 
positive number and putting a,,, = a,exp{—a%} for n = 1,2,... . Find the values 
of a for which the following series converge: 


oO oO oO 

(i) Lia,, (ii) Lean, (ii) Da 
n=1 n=l n=] 

E 3369. Proposed by Jiro Fukuta, Aichi University, Aichi, Japan. 


Suppose we are given a piece of paper in the shape of an equilateral triangle 
ABC. Suppose P is a point in the intersection of the three open circular discs with 
diameters AB, BC, CA. If we fold the three corners of the paper in such a way that 
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the vertices coincide with P, we get a hexagon three sides of which are the creases 
formed and the other three sides of which are portions of the sides of triangle ABC. 

Prove that the area and perimeter of this hexagon are both maximized when P is 
the centroid of ABC. 


E 3370. Proposed by Gavin Brown, University of New South Wales, Australia and 
Larry A. Shepp, A.T.& T. Bell Laboratories, Murray Hill, NJ. 


Suppose f and g are nonnegative continuous functions compactly supported on 
the real line. Hdlder’s inequality gives 


sup [f(x — y)g(y) dy <I lipllall, 


for positive numbers p,g with p~! + q~! = 1. Prove or disprove that 


[sup {f(« — y)a(y)} de = WN ipllalla- 


E 3371. Proposed by A. M. Garsia and B. A. Sethuraman, University of California 
at San Diego. 


Let S, denote the symmetric group on the n symbols 1, 2,..., 7. Given o in S,, 
let 
M(o) = {i: 1 <i<n,i2zo(i)}. 
Put 
P(o) =|M(o)l? DY {i+ o(i)}. 
te M(o) 
Prove that 


y P(c) =(n+1)! 


o&ES,, 


E 3372. Proposed by W. A. Bassali, Kuwait University, Safat. 
Suppose 6, = (20 ar for n = 0,1,2, ... . Prove that 


[7 sin-\(sin? 0) d9 = [ cos” vcos 6 d6 
0 0 


c sinh’ x _ of"? 6 d0 
0 0 


Vsec*6+1 


x 
= = [oleosec 8 + 1d6@= > (9) 6, 
2 0 n=0 (2n + 1) 
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For general a,,..., 4,, b,,..., b, we have X(a, — b,)? > 0, which may be rewrit- 


ten 


n? 


2-2¥ab,>1- Vat+1—- Ye. (5) 


i=1 i=] i=1 


We now assume (4). In view of (2) and (3), we see that (5) becomes 
2), *a,a, > dL *4,a, + » *b,b;. 


Dividing by 2 and applying the arithmetic-geometric mean inequality, we get the 
inequality of the problem. Furthermore, equality holds only when Jb, = Xa, 
(i = 1,..., 2) for some constant X. 


Editorial comment. Allen Pedersen and Walther Janous independently remarked 
that the result of the problem is a special case of Aczél’s inequality. It can be found 
in D. S. Mitrinovi¢é, Analytic Inequalities (Springer-Verlag, 1970), p. 57, where there 
is also the generalization (L*a,b,)? > [(a,)? — LaP|[(XUb;)? — Lb?], where p > 1 
and the numbers a,,...,a,, b,,..., b, are nonnegative. 


n? 


Also solved by J. Fukuta (Japan), W. Janous (Austria), F. Kubo and K. Kubo (Japan), L. Kuntz 
(student, West Germany), H. E. Lomeli (student, Mexico), O. P. Lossers (The Netherlands), W. Noll, 
A. Pedersen (Denmark), S. Philipp, G. S. Rogers, H.-J. Seiffert (West Germany), the Chico Problem 
Group, the University of South Alabama Problem Group, and the proposer. 


Derivatives of Chebyshev Polynomials 


E 3224 [1987, 681]. Proposed by Bruce Reznick, University of Illinois at Urbana- 
Champaign, and John Todd, California Institute of Technology. 


Let n > 3 be an integer. Determine all real polynomials p of degree n having n 
real zeros a, < a, < --: <a, such that 


[ip()ia (1<k<n-1) 


aK 
is independent of k. 


Solution by Jim Delany and Kempton Huehn, California Polytechnic State Univer- 
sity, San Luis Obispo. The first derivatives of the Chebyshev polynomials have this 
property. Except for scale changes and translations along the x-axis, these are the 
only such functions. 

We first demonstrate that after normalization there is at most one such polyno- 
mial. Let p(x) be such a function. The n — 1 arches created by p and the x-axis 
enclose the same area. Since p has no multiple roots, the arches are alternately 
above and below the x-axis. We normalize p by translating to obtain a, = 0, scaling 
the x-axis to obtain a, = 1, and scaling the y-axis so the area of each arch is 1 and 
p is positive between a, and a,. This replaces p(x) by 


(a, — a,) p(a, + (a, - a,)x) . 
[r() dt 


Suppose p and gq are two such normalized polynomials, with zeros a,,..., a, and 
b,,..., 5,, respectively. Let P(x) = {p(t) dt and Q(x) = f5‘q(t) dt. By construc- 


9 n? 
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tion, P(a,) = Q(b,) = 0 if i is odd, and P(a,) = Q(b,) = 1 if i is even. Now, P 
and Q are polynomials of degree n + 1. We will show that R = P — Q has more 
than n + 1 zeros, thus implying that P — Q is identically zero and p = gq (since 
p(9) = q(0) = 0). 

Since P,Q, P’,Q’ are all 0 at 0, R has a double zero at 0; ie. x* divides R. 
Similarly, P(1) = Q(1) and P’(1) = Q’(1) = O imply that (x — 1)” divides R. In 
addition to these four zeros of R, we need to guarantee n — 2 more. 

We may assume b, < a,. Since Q(b,) = 1 => P(b,) and P(a,) = 1 > Q(a,), we 
have R(a,) => 0 > R(b,). Thus R has a zero in [b,, a,]. In each of the intervals 
[a,,a3],...,[@,_5, 4,1], P ranges between the extremes 0 and 1, and Q is bounded 
by 0 and 1, so there is a point where they are equal. Counting [b,, a,], we have zeros 
in n — 2 interior intervals. These are distinct unless a, is the zero for the two 
intervals starting and ending at a,. However, if R has a zero at a,, then 
QO(a,) = P(a,) = {1+ (—1)*}/2; thus a, is one of the b, and so Q’(a,) = 
P’(a,) = 0. Hence, if R has a zero at a,, it is a double zero and we can count it 
once in each of the two intervals starting and ending at a,. Summing up, we see that 
R has at least n + 2 zeros and hence is identically zero. 

For existence, consider the Chebyshev polynomial of degree n, given by 


T(x) f(x + x2—-1) +(x- = 1)'| 


cos(n cos~' x) if |x| <1 
cosh(ncosh7'x) ifx>1 


The first few such polynomials are 1, x, 2x? — 1, 4x? — 3x, 8x*— 8x*+ 1, and 
16x°* — 20x? + 5x. The only relative extrema are T(x) = +1, when x = 
cos(ka/n), for k = 1,...,n — 1. When k = 0 or k = n, we also have T(x) = +1, 
but the derivative is not 0. The derivative of J, ,,(x) is a polynomial of degree n 
having the desired property. Its roots are a, = cos[7(n + 1—k)/(n + 1)] for 
k = 1,...,, where we have taken care to specify the zeros in increasing order. 


Editorial comment. Shorter solutions were offered by O. P. Lossers and W. W. 
Meyer, who applied the following known result from approximation theory to 
P = f{p: the polynomial of fixed degree which best approximates a constant on an 
interval (in the uniform norm) is, up to translation and scaling, a Chebyshev 
polynomial. See, for example, E. W. Cheney, Introduction to Approximation Theory, 
McGraw Hill, 1966, p. 61. 


Solved also by S. M. Gagola, Jr., O. P. Lossers (The Netherlands), W. W. Meyer, S. Philipp, 
L. Verde-Star (Mexico), and the proposers. One partial solution was received. 


Points on a “Polynomial Approximation” to a Circle 


E 3225 [1987, 786]. Proposed by Bjorn Poonen, Winchester, Massachusetts. 


Let r be a fixed real number greater than 1, and let s,(x) and c,(x) denote the 
nth Maclaurin polynomials for the sine and cosine functions, respectively. If x, is 
the smallest positive solution of 


s(x) + ¢,(x)° =r, 
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show that 


° xi 
lm — = -. 
noo Nn e 


Solution by Stan Philipp, Pennsylvania State University, Altoona. We shall show 
that if is sufficiently large (depending on r), then 
x,>e Nn (1) 
and 
int tt" = eInel8"/" < en + logn. (2) 


The method actually shows that x, = e~'n + (logn)/(2e) + O(]). 
For real x put 


x, <e 


(6) 


Ty(x) =e" — {eq(x) + isg(x)} =D (ix) "7k! (3) 
and 
P,(x) = ¢, (x) + s,(x)? — r= |e™ — T, (x)? =r. (4) 


To prove (1) note that for 0 < x < n/e we have 


BONS =| + ope? | 


lA 
=| * 
a 
oO | = 
% | 

+ 
—— 


n” 0 1 
<— < — =O|—~ |. 
~ ni en! yn 
Therefore, if n is sufficiently large, we have |T,(x)| < vr — 1 for0 < x <n/e and 
so P(x) < 0 for 0 < x < n/e by (4). This proves (1). 
To prove (2) note that if x = e 4n'*!/", then by (3) 


xr ix (ix)? 
IT, (x)| = ake a) + Ga Dn? 3) + | 
~ (n+1)! 1 ~ (n+ 2)(n + 3) 


CESS CE CES CE 


ntl x2 
> ——_ (1 - — 
I atROTS| 
the last inequality being true because the terms in the preceding alternating series 
decrease in absolute value. (We have x = e-1n't!/" < n/2 for n > 6.) Hence 
x" x | 1 | x" 13 n™n 3 


T,(x)|=— > —-—=——— > 
IT, (=) nint+]1 nie 4 e"n! 4e evn 


156 PROBLEMS AND SOLUTIONS [February 


for a positive constant c, by Stirling’s formula. Hence, if n is sufficiently large, 
\T,(e-'n*1/")| > vr + 1 and hence P,(e~1n!*1/") > 0 by (4). Thus (2) is proved. 


Editorial comment. Several solvers verified that positive roots exist for each 
positive integer n by noting that s, (0)? + c,(0)? = 1 while 


Jim {s,(x)° + c,(x)°} = 00. 


Solved also by P. Alsholm (Denmark), P. Bracken (Canada), O. P. Lossers (The Netherlands), T. S. 
Norfolk, the University of South Alabama Problem Group, and the proposer. 


Parallel Ratios in a Semicircle 


E 3259 [1988, 350]. Proposed by Jordi Dou, Barcelona, Spain. Let R be a 
semicircular region bounded by a line L and a semicircle S with center on L. 
Suppose P, and P, are given points in the interior of R. We wish to find parallel 
lines /,,/, through P,, P,, respectively, such that 

PC, PLC, 
P \D 1 7 P 2 D 2 


(*) 


where C,, D, are the intersections of /, with S and L and C,, D, are the 
intersections of /, with S and L. Give a necessary and sufficient condition on 
P,, P, for such parallel lines to exist. 


Solution by Emil Raychev, Portland, Maine. The necessary and sufficient condi- 
tion is that the line P,P, intersect L outside R or be parallel to L. 

For necessity, suppose /, and /, are lines with the desired property. By the 
relation (*), the lines C,C, and P,P, pass through a common point M external to R 
on the line L. Hence the existence of /,, /, implies that the line P,P, intersects L 
outside R or is parallel to L. 

Furthermore, if P,P, meets L at M outside R (with P, between P, and M), let ¢ 
be a line through M and tangent to S (at a point T). Then MC, - MC, = MT? and 
(MC,/MC,) = (MP,/MP,). Multiplying these yields 


(Mc)? = (MT 
| 2 MP,” 


Therefore, it is sufficient to construct a point A on segment MT such that 
MA = MT - (MP,/MP,) and then to construct a segment MC, with C, on S such 
that (MC,)* = MT - MA. 

To produce A, construct a line through P, parallel to P,T; let A be the 
intersection of this with MT. The parallel lines cut MP, and MT in the same ratio, 
so MA has the desired length. To obtain C,, construct a semicircle with diameter 
MT and erect the segment through A perpendicular to MT. This meets the 
semicircle at a point B with (MB)* = MT- MA. Thus MB has the desired length 
MC,, and the circle centered at M that has this radius meets S at the desired point 
C,. C, is the other intersection of MC, with S. 

If P,P, is parallel to L, then the solution is obtained by constructing a chord 
C,C, parallel to and having the same length as P,P,. 
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Solved also by C. W. Dodge, J. Fukuta (Japan), H. Guggenheimer, W. Janous (Austria), L. Kuipers 
(Switzerland), and the proposer. 


Asymptotic Estimates for Convergents of a Continued Fraction 


E 3264 [1988, 352]. Proposed by Stanley Rabinowitz, Alliant Computer Systems 
Corp., Littleton, MA. 


Let P/Q, be the nth convergent for the continued fraction 


“T+ 243444 0°” 


i.e., let P; = 1, 0, = 1, P, = 2, Q, = 3, and 
P,, _ nP,,-} + P25 QO, = nQ,-1 + Q,-2 (n a 3). 


(Cf. Hardy and Wright, An Introduction to the Theory of Numbers, Chapter 10.) 
Give asymptotic estimates for P, and Q,. 


Composite solution I by G. A. Edgar, Ohio State University, and Klaus Zacharias, 
Berlin. As n tends to o, P, and Q, are asymptotic to [,(2)n! and [,(2)n!, 
respectively, where J,, J, are the modified Bessel functions usually so designated. 
The numerical values are /,(2) = 1.59063685... and J)(2) = 2.27958530... . 

It should first be noted that the given continued fraction converges (cf. W. B. 
Jones and W. J. Thron, Continued Fractions: Analytic Theory and Applications, 1980, 
p. 88). Next, both P, and Q,, satisfy the recurrence 


A,, _ nA,,-1 + A,,_»- (1) 


If K, and J,, denote the usual modified Bessel functions, then (see G. N. Watson, A 
Treatise on the Theory of Bessel Functions, 2nd edition, 1966, pp. 79-80) 


Kyailx) = Ky(x) + Ky (2), (2) 


Iyal) = ——"Dy() + Ty al), (3) 
Kyl) IC) + Kylx)Igaa(et) =~. 


(4) 


From (2) and (3), we see that A, = K,,,(2) and A, =(—1)"*'J,,,(2) are 
solutions of (1). Using (2)-(4) to check the initial conditions, we find that, for 
n> 1, 


P, = 2(—1)"*"K,(2)1,,41(2) + 21,(2) K,,41(2) 
Q,, = 2(-1)"Ko(2)1,41(2) + 2Lo(2) Knsi(2). 


From the definition of J,(2) [ibid, p. 77], we see that [,(2) tends to 0 as n tends to 
oo. Furthermore [ibid, p. 80], K,.,(2) ~ jn! as n tends to oo. This yields the 
asymptotics claimed for P,, Q,. 
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Solution II by Robert B. Israel, University of British Columbia. For any fixed k 
we show that D and Q,, have the asymptotic expansions 


P, ~ p y (-1)/(n -j)!/j! = pni{1— n+ [2n(n- 1] *- «+f, 
j=0 
k . 
0,~4 (-1)’(n- si, 
j=0 

where p and g are constants. More precisely, we show that if n>k, then 
pruiio(-1)(n — j)!/j! is greater than P, if k is even and less than P, if k is odd. 
A similar result holds for Q,,. 

Let b, = P,/n! or Q,/n!. The recurrence yields 
b,—2 


+ — 
n n—-1 n(n — 1) 


Since b,, b, => 1, we obtain by induction 1 < b,_, < b, for all n = 3. Thus 


1 
n(n - Sway} < OKT 2 


<bT] 1 + 


j=3 


b < {1+ 


~~ ft 
‘Ga 


=) 
Ty | SO 
i(j- 1) 
since XL j(j — 1)]-* converges. Therefore b,, = lim 
all n 

Let b, = b, (24.9(-1)(n — i)! /(i!n!) + R(n, k)) for k < n, so that the desired 
inequality i is (—1)*R(n, k) < 0. We prove this by induction on k. Note that it is 
true for k = 0 because b, < b,. Assume that (—1)*R(j, k) < 0 for all j > k. For 
n>k-+1, we write 


by — Os. 
b 


oo j=n oa) 


b, exists, and b, < b, for 


n-oun 


| 
| 
418 
Ss 
ui 


ee ee rit Dt | RUGi- 1K) 
L5G+b, ~ PX G+ Db! * GGFD | 

— & (=I & G-i- VD! & RU-1,4) 

~ ha he Gap 7 Hap 


The interchange of summations is justified by absolute convergence. For fixed i, the 
sum L°_,(j —i- l!/(j + D! is telescoping, because 


(j-i- 1)! 1 {(j-i-1)! (j-1)! 
“GFP itil jG FD! 
Thus we obtain 
On pe (oD (msi at © R(j-1,k) 
b. %m (+t! —, J(jt+1) 
k+1(—1)'(n—-i)! @ R(j—-1,k) 
~ int GED) 
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If we replace b, by the formula defining R(n, k + 1), namely 
k+1 (—1)'(n ~ i)! 


b,=b, > 7 - + R(n,k +1), 
=0 i'n! 
we obtain | ' 
o R(j-1,k 
R(n,k+1)= - y ———_————— 
\ jan JCA + YD) 


Hence the sign of R(n, k + 1) is opposite to the common sign of R(n — 1,k), 
R(n, k),..., which completes the proof. 


Solved also by P. Bracken (Canada), D. Callan, L. L. Foster, E. Grosswald, S. Heller, K.-W. Lau 
(Hong Kong), A. M. Mercer (Canada), W. A. Newcomb, C. S. Sutton, the Howard University Problems 
Group, and an anonymous contributor. R. K. Guy and Len Bos remark that the sequences { P,} and 
{Q,,} are given as sequences 704 and 1151 in N. J. A. Sloane’s Handbook of Integer Sequences, Academic 
Press, 1973. 


Equilateral Triangles Inscribed in Closed Curves 


E 3273 [1988, 555]. Proposed by Orrin Frink, Pennsylvania State University. 


Give appropriate conditions under which a simple closed curve in the plane 
contains three points that form the vertices of an equilateral triangle. For the 
corresponding problem for the square, see Problem 4325 [1949, 39; 1950, 423] and 
Victor Klee, “Some unsolved problems in plane geometry”, Math. Magazine 
52(1979), 131-145. 


Solution by Albert Nijenhuis, Seattle. No extra conditions are necessary. Let C be 
a simple closed curve with interior S and exterior S’. Given any points P,Q € C, 
let ¢(P, Q) be the third vertex of the equilateral triangle with base PQ that lies on 
the left of the ray PQ. If P,Q are chosen at maximum distance, then either 
o(P,Q) € C, solving the problem, or (P,Q) € S’. Since @ is continuous, it 
suffices to show there exist P, Q such that ¢(P, Q) € S. Choose R € S, and obtain 
D by rotating C through 60 degrees about R. Since C and D contain the same area, 
neither can lie inside the other, so there is a point O& CN D. If P € C 1s its 
pre-image under the rotation, then R = $(P, Q), as desired. 


Editorial comment. Mark Meyerson points out that the problem was solved by 
A. N. Milgram in “Some metric topological invariants,” Reports of a Mathematical 
Colloquium (Univ. of Notre Dame), second series 5-6(1939-1948), 25-39, where he 
also generalized the result to closed curves in any metric space. In “Equilateral 
triangles and continuous curves,” Fundamenta Mathematicae 110(1980), 1-9, 
Meyerson shows that all but at most two points of a simple closed curve in the plane 
are vertices of such a triangle, and that for any fixed triangle A, every simple closed 
curve in the plane contains the vertices of a triangle similar to A. He also notes that 
the generalization of the square problem to three dimensions is false (MONTHLY 
Problem 6385 [1982, 279; 1983, 651]). Although Jerrard proved that a smooth simple 
closed curve in the plane has an inscribed square (Trans. Amer. Math. Soc., 
98(1961) 234-241), the existence of an inscribed square in a general simple closed 
curve in the plane is still open and is quite difficult. The solution printed for 
Problem 4235 [1950, 423] is not adequate. 


Solved also by O. P. Lossers (The Netherlands), M. D. Meyerson, and Victor Pambuccian. 
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Sufficient Conditions for Ring Commutativity 


E 3275 [1988, 555]. Proposed by Thomas J. Laffey, University College, Dublin, 
and Desmond MacHale, University College, Cork, Ireland. 


(a) Suppose R is an associative ring, not necessarily with unity, in which every 
element is a square. If squaring is a ring homomorphism, prove that R is commuta- 
tive. 

(b) Suppose R is an associative ring and suppose there is a function f from R 
onto R which satisfies f(x + y) = f(x) + f(y) and f(xy) = (xy)? forall x, y € R. 
Must R be commutative? 


Solution by John Henry Steelman, Indiana University of Pennsylvania, Indiana, 
PA. (a) Note that x* + y?=x?4+ xy + yx + y?=(x+4+ y)* if and only if xy = 
—yx. Thus squaring is an additive group homomorphism if and only if R is 
anti-commutative. If every element is a square, anticommutativity implies commuta- 
tivity. 

(b) R need not be commutative. An elementary counterexample is the set of all 
sequences of integers with the usual termwise addition, and with multiplication 
defined by 


(X1, X55 X3y--- (Nps Yoo Van--+) = (Xo y3,0,0,...). 


This ring is not commutative, but the function f defined by f(x, x5, x3,...) = 
(X5, X3, X4,...) has the specified properties. 


Editorial comment. Many readers noted that the hypothesis of (a) can be weak- 
ened by deleting the requirement that f be a multiplicative homomorphism, and 
that the conclusion can be strengthened by adding that the ring has characteristic 2. 
Readers also noted that in (b) commutativity would be implied by a variety of 
additional hypotheses on R. 

Complete solutions were submitted also by S. F. Barger, F. J. Flanigan, A. A. Jagers (The 


Netherlands), C. Lanski, and O. P. Lossers (The Netherlands). Part (a) was solved by 15 readers and the 
proposers. 


Closed Sets in a Special Topological Space 


E 3280 [1988, 655]. Proposed by Peter J. Ferraro, Roselle Park, NJ. 


Let (X, T) be a topological space such that { p }’ is closed for every point p € X. 
Prove that A’ is closed for every subset A of X. Here A’ is the set of points x € X 
such that every open set containing x contains a point of A — {x}. 


Solution by José Heber Nieto, Universidad del Zulia, Maracaibo, Venezuela. We 
show that A” C A’ for every A C X. Let z be a point in A” and U be an open set 
containing z. Then V = U — {z}’ is also open and contains z, since {z}’ is closed 
and z € {z}’. Therefore, there is a point x € A’ such that x © V — {z}. Since 
x € {z}’, there is an open set W such that x € W and z € W. Then WN V (and 
therefore U) contains a point of A — {z}. 


Editorial comment. Many readers noted that the problem appears as Problem D 
in Chapter 1 of J. L. Kelley's General Topology, Van Nostrand, 1955. Kelley 
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attributes the problem to C. T. Yang. The result of the problem also appears as 
Theorem 14.5 in W. J. Thron’s Topological Structures, Holt, Rinehart, and Winston, 
1966. Peter Landweber notes that it is customary to call A’ the derived set of A and 
its members the accumulation points of A, so that the statement becomes “the set of 
accumulation points of every subset is closed if and only if the set of accumulation 
points of each singleton set is closed.” Stephan Carlson proved the stronger result 
that these properties are equivalent to a third property: Every singleton subset of X 
is the intersection of a closed set and an open set. 


Solved also by the proposer and about 40 other readers. 


Valence of Analytic Functions 


E 3295 [1988, 953]. Proposed by A. W. Goodman, University of South Florida, 
Tampa. 


A function f on the open unit disc EF = {z: |z| < 1} is said to be univalent if it is 
analytic on FE and takes no value more than once. A function f on E is said to be 
infinite-valent if it is analytic on E and takes some value infinitely often. 

(a) Is there a univalent function f on E such that /f’ is nonconstant and 
infinite-valent on EF? 

(b) Is there an infinite-valent function g on E such that g’ is univalent on E? 


Solution by Kee-Wai Lau, Hong Kong. We answer both questions in the affirma- 
tive. An elementary consideration of the range of the principal branch of log(1 + z) 
shows that f(z) = (1 + z)'*! is univalent, f(z) = g(z) = (1 + i)(1 + z)! is infi- 
nite-valent, and f’(z) = g/(z) =(—1+ i) + z)‘“! is univalent. That f and 9’ 
are univalent is a special case of a lemma (A. W. Goodman, Univalent Functions, 
Vol. IT (Polygonal Publishing Company, Inc., 1983), Chapter 15, Lemma 1), stating 
that for 8 # 0, the function (1+ z)* is univalent on E if and only if either 
|8 — 1) <1 or |8 + 1| < 1. That f’ and g are infinite-valent follows from the fact 
that 


— e7 = g(- en = i)e(— 27”) = i 
f'( 1+ oan) 2g( 1+ oan) (1 4 ) i(—2an) 1+ 
for n = 1,2,3,.... 
Solved also by Wancang Ma and the proposer. 
ADVANCED PROBLEMS 


6622. Proposed by Robert C. Dalang, Ecole Polytechnique Fédérale, Lausanne, 
Switzerland. 


Does there exist a Lebesgue measurable function f:[0,1] — [0,1] which is positive 
on a set of positive measure and which satisfies 


ore) —]l h 4 h 1/2 
yy > [fe  resya f° /(x)ax| < +00? 
_ 2 ( 


j/2" 2j+1)/2" 
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6623. Proposed by Ernesto Bruno Cossi, Universidade Federal do Rio Grande do 
Sul, Porto Alegre, Brasil, and the editors. 


Let R(n) denote the number of ways of expressing the positive integer n as a 
sum of two squarefree positive integers relatively prime to n. Is it true that 
R(n) > c $(n) for some positive constant c, where @ denotes the Euler function? 


6624. Proposed by K. A. Ross, University of Oregon, Eugene. 
(i) Show that there is a subset A of the plane R’ that intersects every straight 
line in exactly two points. 
(11) Can the set A be selected to be Lebesgue measurable? 
(ii1)* Can the set A be selected to be a Borel set? 


SOLUTIONS OF ADVANCED PROBLEMS 
Vanishing “Triangular” Functions 


6526 [1986, 659]. Proposed by Grzegorz Rzadkowski, Agricultural University in 
Warsaw, Poland. 


Let f be a continuous real-valued function defined on the interval [0, L]. If there 
is an equilateral triangle ABC of sidelength ZL such that 
f(AP) + f( BP) + f(CP) = 0 
for all points P inside the triangle, prove that f is identically zero. 
Solution by O. P. Lossers, Eindhoven University of Technology, The Netherlands. 
Without loss of generality we take L = 1. It is easy to give a noncontinuous 
function on [0,1] that has the desired property for all points P in the closed 


triangle: define f(0) = 1, f(1) = — 4, and f(x) = 0 for 0 < x < 1. We shall prove 
the following generalization of the assertion of the problem. 


THEOREM. Let f be any function defined on the open interval (0,1) such that 
f(AP) + f(A2P) + f(A3P) = 0 (1) 


for all points P inside the equilateral triangle A,A,A, of side 1. Then f is identically 
zero (and hence continuous ). 


— 
Proof. Let M be the centre of mass of the triangle. Then all vectors MA, have 
length (1/3)V¥3 and subtend angles 27/3. So, by the cosine rule, we have 
2ka 


A,P* =4+ MP*- 31 MP cos| » + =| (k = 1,2, 3) 


for some q. Elimination of MP and yields 
)°A,P4— > A,P?-A,P? — )A,P? +1=0. (2) 
k k<l k 

It is useful to indicate an interior point P of the triangle by its triple of squares 


P: (m4, %, 7) = (A,P?, A,P?, A,P?). (3) 
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For example, M is indicated by M: (3, 3, 3). In general, we have by (2) for any P 
the relation 


2 2 2 
Ty + Ty + Ty — 1M, — 11; — 11, — 7, — 1 —~7,+1=0. (4) 


Consider any ordered 4-tuple of points, [P, Q, R, S] such that 4,0 = A,R, A,P = 
A,R, A,Q = A,S, A,;P = A,Q, and A,R = A,S. (See Fig. 1). Such a 4-tuple will 


A, 


Fic. 1 


be called “ pentacyclic.”’ Then, using (1), we find by elimination that 

f(A:P) — 2f(4,Q) + f(A,S) = 0. (5) 
A simple, important consequence of (5) is the fact that f(x) is constant (and hence 
0) when it is constant on some interval. 

Such an interval will be given in the sequel. 

We shall construct a continuously variable pentacyclic 4-tuple [P, Q, R, S](t) 
(0 <¢t <1) such that P(t) and Q(t) (and hence R(t)) move along circles with 
centre A,, but such that the distance A,S(t) is non-constant (and therefore runs 
through an interval of positive length, whereas f(A,S) remains constant). 

Let [P, QO, R, S] and [T, M, U,V] be two special pentacyclic 4-tuples: A,P = 
A,P, A,Q = A,M (= (1/3)y3), and A,P = A,T (see Figure 2). 

Assume that 4,S = A,V. 


By symmetry, the triple square coordinates of P, Q, R and S can be written in 
the form 


Q:(3,3+4,5—-8), R: (3,3 —-B,5 +4) (6) 
P:(4+y7,3-B,4-8), S:($-84+a,44a). 
Also by symmetry we have 
AV = A,U = AT = A,P. 
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A, 


Fic. 2 


This implies the following triple square coordinates for T, U, and V: 
T(S+ yes) U (edt Ve E-as sty). 


Here the missing coordinates * both have to be equal to 1/3 — 6, since two 
coordinates of a point uniquely determine the third coordinate. All points given by 
(6) and (7) must satisfy (4), so 


a? + a8 + B2-a+ B=0 (from Q and R) 
B7+2By+y7+2B-y=0 (from P) (8) 
y2 + 78 + 82—-y+5=0 (from T, U and V ) 


6° + 26a + 07+6-—2a=0 (from S$). 


We are interested in solutions (a, B, y, 5) # (0,0, 0,0). Elimination of B and 6 from 
(8) forces two resultants R,(a, y) and R,(a, y) to be zero: 


1 at+l1 a’—-a 0 
0 1 at+1l a-a 
Ri(a, 7) = 1 2y+2 y’-y 0 ) 
0 1 2y+2 y’-y¥ 
(9) 
1 ytl y-y 0 
1 +1 * 
R,(a, ) =|° y y y 
1 2a+1 a’*—2a 0 
0 1 2a+1 a’*—-2a 
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It is immediately verified that both R,(a, y) are quartic forms in a and y together, 
and that 
R,(1, y) = y°(y - 1)(y- 5), 
whereas 
R,(1,y) = y* — 5y? + lly? — Ty + 3. 


Thus the four roots of R,(1, y) = 0 are all distinct from the four roots of R,(1, y) 
= 0, 1e., the quartics R,(a, y) = 0 and R,(a, y) = 0 do not have a component in 
common. 

In other words (a, 8, y, 5) = (0,0,0,0) is an isolated solution of (8), so our 
assumption that A,S = A,V is false, provided that P is sufficiently close to M. 

Therefore, for P sufficiently close to M, the pentacyclic 4-tuples [P, Q, R, S] and 
[T, M,U,V] can be used as the endpoints of a continuously variable 4-tuple 
[P, QO, R, S\(t), where 0 < ¢ < 1. The result follows. 


Also solved by the San Bernardino Problem Solving Group, Aage Bondesen (Denmark), and David 
C. Kay. 


The Harmonic Conjugate of an Algebraic Function 


6567 [1988, 150]. Proposed by Gerald A. Edgar, Ohio State University, and Lee A. 
Rubel, University of Illinois at Urbana-Champaign. 


Let us say that a function u(x, y) is algebraic if there exists a nontrivial 
polynomial P in three variables such that P(x, y, u(x, y)) = 0. Suppose u(x, y) is 
a harmonic and algebraic function on a region G in R*. Must its harmonic 
conjugate v also be algebraic? 


Solution by the proposers. Yes, v must be algebraic. First we show that f(x + iy) 
= u(x, y) + iv(x, y) is algebraic as a function of z = x + iy. Write Z = x — iy and 


u(z,z)=u(x,y), 6(2,z) =v(x,y). 


If f(z) = L%_,a,z”" (suppose for convenience that (0,0) € G and that f(0) = 0) 
then 


f(z) = La,2" 


and 
1 —_ 
i(z,z) = 5 | Lane” + ya,2"]. 
Now for some nontrivial polynomial Q, we have 


Q(z, z,)0a,2" + b 4,2") = 0. (*) 


Suppose for simplicity that Q(z, Z, w) has no factor z“, k > 0. In (*), set Z = 0 to 
get O(z,0, f(z)) = 0. Hence f(z) is algebraic, as claimed. 

We need some explanation of the above directions “set z = 0.” It is well known 
that u(x, y) is represented by a convergent series of powers of x and y near (0, 0). 
Thus we may consider x and y as complex variables, and then consider u(x, y) on 
the set {x = iy}. 

We conclude, from the just-established fact that f(z) is an algebraic function of 
z, that it is also an algebraic function of (z, z). From this it follows easily that f(z) 
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is an algebraic function of (z, Z). But, since the difference of any two algebraic 
functions is algebraic, we see that 


2id(z, Zz) = f(z) — f(z) 
is an algebraic function of (z, Z), and consequently that 6(z, Zz) is an algebraic 
function of (z, z). Hence v(x, y) is an algebraic function of (x, y). 


No other solution was received. 
Infinite Cardinals 


6569 [1988, 264]. Proposed by Wilson Castrellon and Samuel Camargo, Universi- 
dad de Los Andes, Bogota, Colombia. 


Show that for every infinite cardinal 4 there exists a cardinal k > A, a set A of 
cardinality x, and a family F of subsets of A such that card( Ff’) = 2" and XN Y is 
finite for all X,Y € F, X # Y. 


Solution by Fred Galvin, University of Kansas, Lawrence. We first claim there is a 
cardinal « > A such that 


K®o = 2K, 
Choose 
ne a Le ae 
By routine cardinal arithmetic, 
K®o < (2")"° = 2" %o = 2" 
and 
en a a eae ee 
The claim follows. Now let S be a set of cardinality x, let A be the set of all finite 
sequences of elements of S, and B the set of all infinite such sequences. Each £ in 


B corresponds in a natural way to a subset of A, namely the set A() of all initial 
segments of 8. In other words, if 


B= (x1, X5, X3,...) EB 


then 
A(B) = { (x), (%, Xo), (4, X2, X3),-+-} CA. 
If 
F={A(B): BEB}, 
then 


card( F') = «®o 
and « has been chosen so that this is 2°. Finally, 8, # 8, implies 
A(B,) 1 ACB) 
is finite. 
All solutions received were essentially the same; that of Dennis Burke described a construction of the 
above type in the language of topology. 


Also solved by Andreas Blass, Charles Lanski, James C. Owings, Jr. & Philip W. Steitz, Istvan Szalkai 
(Hungary), and F. P. Weber (Canada). 
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Representations of *-Algebras, Locally Compact Groups, and Banach *-Algebraic 
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XVili + 1486 pp. 
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Groups provide the language for describing symmetries. In this role they act as 
groups of automorphisms of mathematical objects. (One thinks of Galois groups in 
algebra, of Lie groups in geometry, and of the many symmetry groups of physics). 
Accordingly, it is important to understand the structure of actions of groups. 
Foremost actions are those on vector spaces (that is, linear actions, or “representa- 
tions,” as they are more commonly called), because any action can be linearized, by 
considering the corresponding action by translation on the vector space of scalar- 
valued functions on the object. There is by now a vast literature concerned with 
representations. 

Frobenius, Schur, and others first studied finite groups acting on finite-dimen- 
sional vector spaces around the beginning of this century. This theory is by now 
highly developed. Soon Weyl and others saw that the theory of continuous represen- 
tations of compact groups, such as the group of rotations of Euclidean space, is 
quite similar to that for finite groups; and so this theory too is now well understood. 
But for noncompact groups (even discrete ones) a sharp dichotomy exists. On the 
one hand, there are the representations by unitary operators on Hilbert spaces. 
These have the crucial property that if a subspace of the Hilbert space is invariant 
(that is, carried into itself by the actions of all the elements of the group), then so is 
its orthogonal complement, and the original representation is then the direct sum of 
these two subrepresentations. Thus one can hope that unitary representations can be 
decomposed in some relatively transparent way into the “simple” pieces consisting 
of the irreducible representations (those with no invariant subspaces), so that the 
theory is “semisimple.” The main task is then to determine the irreducible represen- 
tations. 

On the other hand, if one considers nonunitary representations, then even when 
one can determine the irreducible representations, one is still left with the often 
bewildering problem of understanding all the ways in which irreducible representa- 
tions can be combined to obtain more general representations. For the group Z of 
integers and its representations on finite-dimensional complex vector spaces, this 
leads to the Jordan canonical form for matrices; while for the groups Z” for n > 3 I 
believe that the situation is still poorly understood. For this reason, while at present 
the theory of unitary representations is very well developed (even though for many 
groups it is essential that one work with infinite-dimensional Hilbert spaces), the 
theory of nonunitary representations (even just on finite-dimensional vector spaces) 
consists of relatively fragmentary pieces, with few unifying principles. 


167 


168 MARC A. RIEFFEL [February 


Even for unitary representations, the theory is well developed only for those 
groups whose topology is locally compact. The basic reason for this is that for 
groups which are not locally compact one is often hard pressed to exhibit any 
nontrivial, continuous unitary representations at all. But for locally compact groups 
Haar, in 1933, showed that they possess an analogue of Lebesgue measure, namely, 
a positive Borel measure which is invariant under left translation; then one sees that 
any locally compact group actually has a faithful unitary representation, consisting 
of translation on the Hilbert space of functions on the group which are square-inte- 
grable for this Haar measure. A few years later, Weil showed that locally compact 
groups are the only ones to possess such a Haar measure. Nevertheless, more 
recently the needs of various subjects have led to much discussion of the unitary 
representation theory of certain classes of groups which are not locally compact 
(loop groups, gauge groups, diffeomorphism groups—see [5] and the references it 
contains). But this theory is still in its infancy, with few general results and unifying 
principles. 

The literature concerning unitary representations of locally compact groups splits 
to some extent into two parts. The larger part consists of treatments of particular 
classes of groups (say, nilpotent Lie groups), with the aim of obtaining explicit 
classification and construction of their irreducible representations, or of describing 
explicitly how various naturally occurring representations decompose into irre- 
ducible representations. This subject can be extremely rich in detail (for example, in 
the case of semisimple Lie groups). 

The second, and smaller, part of the literature consists of discussions of general 
methods and principles. Perhaps the first striking result in this category was the 
demonstration by Gelfand and Raikov in 1943 that every locally compact group has 
a separating family of irreducible unitary representations (with a necessarily non- 
constructive proof, since so little is assumed about the structure of the group). This 
general theory is now highly developed. It receives its first thorough modern 
exposition in the two-volume work under review (though [1] comes somewhat close, 
and contains a broad collection of examples and topics). 

The two volumes differ somewhat in character, with the first volume, entitled 
Basic Representation Theory of Groups and Algebras, being devoted to the more 
foundational material. In the theory of representations of finite groups, one of the 
first steps is to introduce the group algebra of the group, consisting of the 
scalar-valued functions on the group, with convolution as product. This is useful 
because the representations of the group correspond exactly to the representations 
of the algebra, while the two operations in an algebra allow much more flexibility of 
manipulation than one has with the group alone. When one studies representations 
of locally compact groups, it is again useful to introduce an appropriate group 
algebra having the universal property that its representations correspond exactly to 
the continuous unitary representations of the group. This consists of the continuous 
functions of compact support with convolution as product (making another essential 
use of Haar measure), with an involution corresponding to the map x — x7! at the 
group level, and then completed with respect to a norm which comes from the 
unitary representations. The result is a C*-algebra, that is, a Banach algebra over 
the complex numbers, with involution, whose norm satisfies the innocent-looking 
identity ||a*a|| = ||a||’. (This identity has remarkable consequences; notably, ac- 
cording to a famous theorem of Gelfand and Naimark from 1943, it exactly 
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characterizes the Banach *-algebras which have faithful representations as norm- 
closed self-adjoint algebras of operators on a Hilbert space.) One is thus led to the 
study of C*-algebras and their representations. Such study is more generally useful, 
because C*-algebras arise in a number of related contexts, ranging from quantum 
Statistical mechanics, to foliations of manifolds, to dynamical systems. The first 
volume consists of a thorough and leisurely exposition both of the basic theory of 
locally compact groups, and of C*-algebras and their representations, with emphasis 
on those aspects most germaine to the unitary representation theory of groups. 

The second volume is more specialized in character. The most important general 
construction in the representation theory of groups is that of induced representa- 
tions. This construction takes a representation of a subgroup of a group, and 
constructs from it a representation of the group itself. For finite groups, if one views 
matters in terms of group algebras, this is nothing more than a familiar change-of- 
rings construction, by which one takes a module over a subalgebra and constructs 
from it a module over the algebra. Induced representations for finite groups were 
introduced by Frobenius at the turn of the century (see [3] for a nice description of 
the history); while for locally compact groups the general theory of induced 
representations was developed by Mackey in a series of influential papers in the 
1950s (though Wigner’s 1939 paper on the representation theory of the Poincaré 
group was a vital precursor). 

The theory of induced representations gives its most definitive results when one is 
analyzing what happens in the presence of a normal subgroup, N, of the group in 
question, say, H. Let G = H/N. Then the action of H by conjugation on N gives 
an action of G on the isomorphism classes of irreducible representations of N. If 
this action is sufficiently well-behaved, there is an algorithm, using induced repre- 
sentations, for constructing the irreducible representations of H in terms of those of 
N and of the representations of the stability subgroups in G for the isomorphism 
classes of irreducible representations of N. This algorithm is often fondly referred to 
as “the Mackey machine.” 

The second volume is primarily devoted to the theory of induced representations, 
culminating in a thorough treatment of the Mackey machine. Actually, the senior 
author, in a series of earlier research papers, had shown that it is very natural to 
develop the Mackey machine in a more general context, in which the quotient group 
G above is retained, but H and N are replaced by a more general structure called a 
Banach *-algebraic bundle over G. It is in this more general context that the authors 
develop the Mackey machine, hence the second volume’s title, Banach *-Algebraic 
Bundles, Induced Representations, and the Generalized Mackey Analysis. 

Actually, if one looks at the extensive literature concerned with classifying the 
irreducible representations of specific classes of groups, one finds that much of it 
does not mention C*-algebras at all (though various kinds of induced representa- 
tions are ubiquitous). The reason is that there is yet another dichotomy in the 
subject. Many groups have a quite tractable unitary representation theory. For 
historical reasons they are said to be “of type I.” (Nilpotent and semisimple Lie 
groups are of type I.) The theory of C*-algebras is not essential for studying the 
unitary representation theory of type I groups (though it still can provide a very 
convenient framework). But groups not of type I have a badly behaved representa- 
tion theory—there is no middle way. (Many solvable Lie groups, and most infinite 
discrete groups are not of type I.) There is ample evidence that the irreducible 


170 MARC A. RIEFFEL [February 


representations of nontype-I groups are essentially unclassifiable. Accordingly, there 
is little literature concerning their representation theory and, in recent years, as the 
needs of other subjects have made it increasingly important to deal with nontype-I 
groups, attention has necessarily shifted away from their representation theory and 
towards the group C*-algebra itself. By now, this has permitted a number of 
important advances, ranging from Pukansky’s treatment of nontype-I Lie groups 
over a decade ago [6], to the very recent successes with the Novikov conjecture [4, 2], 
where the effects of the fundamental group of a manifold are controlled by using its 
group C*-algebra. The two volumes do not stress the nontype-I aspects (although 
examples are given), but the modern treatment which they provide should prove 
useful in the further development of the nontype-I theory. 

Finally, let me comment that these volumes have been prepared with great care. 
The exposition is clear and thorough. At the end of each chapter there is an 
extensive collection of exercises and a brief historical summary, and each volume 
concludes with an extensive bibliography, name index, subject index, and index of 
notation. 
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General, S, P**. Speaking Mathematically: Com- 
munication in Mathematics Classrooms. David 
Pimm. Routledge, Chapman & Hall, 1987, xvii + 
217 pp, $14.95 (P). [ISBN: 0-415-03708-5] Very in- 
teresting study of how students perceive mathemat- 
ics, especially its language. The primary thesis is 
that students cannot think effectively (in mathemat- 
ics) until they have developed what the author calls a 
“mathematical register,” that is, a reservoir of terms 
with their meanings developed in some sort of con- 
text. The author claims, and documents, that many 
if not most students learn about the form but not 
the meaning of mathematics. This book should be 
of interest to anyone who is wondering what students 
are learning in their classroom. MW 


General, P, L*. Shiing-shen Chern: Selected Pa- 
pers, Volume IJ-IV. Shiing-shen Chern. Springer- 
Verlag, 1989. Volume II, xxx + 444 pp, $45 [ISBN: 0- 
387-96816-4]; Volume III, xiv + 504 pp, $49 (ISBN: 0- 
387-96817-2]; Volume IV, xiv + 462 pp, $45. (ISBN: 
0-387-96820-2] A complement to Volume I (TR, 
March 1979) which contained a selection of shorter 
and less-accessible papers, these three volumes con- 
tain major papers arranged chronologically: 1935- 
1965, 1965-1981, 1981-1987. Includes a complete 
Chern bibliography and (in Volume IJ) a recently 
revised scientific autobiography of Chern. LAS 


General, P*. Symbolism: Sculptures and Tapestries 
by John Robinson. Ronnie Brown, John Robin- 
son. Mathematics and Knots (School of Mathe- 
matics, U. of Wales, Bangor, Gwynedd LL57 1UT, 
Wales, UK), 1989, 36 pp, £6 + £1 surface, £250 
air (P). (ISBN: 0-9514947-08] An exhibit catalogue 
prepared by mathematician Ronnie Brown to accom- 
pany a display of mathematically-inspired sculptures 
by John Robinson that have been assembled for the 
Pop Maths Road Show that opened in September 
1989 in conjunction with the ICME workshop at 
the University of Leeds on popularization of math- 
ematics. Trefoil knots, Mobius strips, tori, and 
other mathematical forms come to life in four-foot 
bronzes. LAS 


General, S*, L***. The Penguin Dictionary of 


Mathematics. Eds: John Daintith, R.D. Nelson. 
Penguin Books, 1989, 350 pp, $7.95 (P). [ISBN: 0-14- 
051119-9] A student’s delight—virtually every term 
a college mathematics student would be expected to 
know (and lots more too) in a compact 8 oz. paper- 
back. 2800 definitions, including 200 one-paragraph 
biographies. Extensive cross references signalled by 
asterisks by keywords helps reduce repetition. No 
computer terms, but does cover basic statistics, al- 
gebra, analysis, logic, topology, number theory, and 
other subjects from school and college mathemat-" 
ics. LAS 


General, S. NYSML-ARML Contests, 1983-1988. 
Gilbert Kessler, Lawrence Zimmerman. NCTM, 
1989, x + 148 pp, $18 (P). [ISBN: 0-87353-278- 
3] Problems and solutions from annual high school 
mathematics competitions, perhaps the most pres- 
tigious of the high school contests. A reader cannot 
help but be caught up in the excitement and intellec- 
tual energy represented in these high-quality prob- 
lems. Readers at all levels can use these problems 
to pique their creative instincts. A valuable resource 
which has much to offer toward developing excellence 
in mathematics at the high school level. A useful In- 
dex covers all fifteen years of the contests. LCL 


General, L*. Izrail M. Gelfand: Collected Papers, 
Volume III. Ed: S.G. Gindikin, et al. Springer- 
Verlag, 1989, x + 1075 pp, $149.50. [ISBN: 0-387- 
19399-5] Final installment in the Gelfand Selecta 
(Volume I, TR, December 1988; Volume II, TR, 
March 1989). This volume features papers on in- 
tegral geometry, cohomology, and hypergeometric 
functions. It also includes samples of Gelfand’s many 
research papers in cell biology (not mathematical 
models, but biological research), and some work in 
information theory and cybernetics. LAS 

General, S*, L**. Mathematical Carnival. Martin 
Gardner. MAA, 1989, 297 pp, $15. [ISBN: 0-88385- 
448-1] Reprint of a 1975 Alfred A. Knopf anthology 
of Scientific American columns with a new introduc- 
tion by John Conway, a new preface by Martin Gard- 
ner, and recent updates. This sixth of Gardner's col- 
lected columns is the first of several to be reprinted 
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by the M.A.A. Topics include sprouts, superellipses, 
trisection, Escher, and many puzzles. A superb class- 
room supplement to awaken bored students. LAS 


Elementary, T. Mathematics: Its Power and Util- 
ity, Third Edition. Karl J. Smith. Brooks/Cole, 
1989, xiii + 578 pp. [ISBN: 0-534-11538-1] Gives 
student “who has previously not been successful 
with mathematics a fresh and innovative approach to 
arithmetic, beginning algebra, and geometry.” The 
Preface gives no indication of changes made in this 
edition. (First Edition, TR, June-July 1983; Second 
Edition, TR, November 1986.) JNC 


Precalculus, T*(13: 1). College Algebra and 
Trigonometry: A Graphing Approach. Franklin De- 
mana, Bert K. Waits. Addison-Wesley, 1990, xvi 
+ 907 pp, $39.75 [ISBN: 0-201-52810-X]; Graph- 
ing Calculator and Computer Graphing Laboratory 
Manual: Precalculus Series. Alan Osborne, Gre- 
gory D. Foley. Addison-Wesley, 1990, 116 pp, $6.50 
(P). [ISBN: 0-201-50861-3] New interactive instruc- 
tional approaches permit a focus on problem solv- 
ing themes such as business, economics, and sci- 
ence. Traditional algebraic topics arise from the con- 
text of mathematical models. Uses computer and 
calculator-based technology to involve students in 
conjecturing, hypothesis testing, and data analysis. 
The text and lab manual provide the resources for 
implementing needed changes in mathematics edu- 
cation. KT 


Education, S**, P**, L**. Historical Topics 
for the Mathematics Classroom. John K. Baum- 
gart. NCTM, 1989, xv + 542 pp, $28 (P). (ISBN: 
0-87353-281-3] A collection of classroom capsules 
on the history and development of mathematics. In- 
cludes chapters on computation, algebra, geometry, 
trigonometry, calculus and “modern” mathematics. 
Originally published in 1969 as the Thirty-first Year- 
book of the NCTM. Changes from the original edition 
included updated bibliographic references and a new 
chapter on contemporary mathematics. All teachers 
of mathematics at all levels should have this book 
(students too!). SB 


History, T*(15-16), S*, P, L*. Mathematics 
and Its History. John Stillwell. Undergrad. Texts in 
Math. Springer-Verlag, 1989, x + 371 pp, $49.80. 
[ISBN: 0-387-96981-0] Written from notes for a 
senior-level course, this book “aims to give a unified 
view of undergraduate mathematics by approaching 
the subject through its history.” Mathematical ex- 
ercises throughout, ranging from computational to 
theoretical. Topics: Greek geometry and number 
theory, infinity, polynomial equations, analytic and 
projective geometry, calculus, infinite series, elliptic 
functions, mechanics, complex numbers in algebra, 
complex functions, differential and non-Euclidean ge- 
ometry, group theory, topology, and sets, logic and 
computation. Excellent bibliography. Assumes ba- 
sic calculus, algebra, and geometry as well as some 
exposure to group theory, topology, and differential 
equations. SB 


History, S, L*. Makers of Mathematics. Stuart 
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Hollingdale. Penguin Books, 1989, xiv + 433 pp, 
$10.95 (P). (ISBN: 0-14-022732-6] A selective, id- 
iosyncratic history of mathematics told through a 
series of sketches of the lives and accomplishments 
of famous men (no women) of mathematics: Euclid, 
Apollonius, Archimedes, Descartes, Fermat, Pascal, 
Newton, Leibniz, Euler, D’Alembert, Gauss, Hamil- 
ton, Boole, Dedekind, Cantor, Einstein. Based in 
part on a series of biographical essays published by 
the author in the British IMA Bulletin, the volume 
has been carefully edited for a general reader with no 
more than a good school background in mathemat- 
ics. A good mix of people and ideas that will appeal 
to many students. LAS 


History, P, L**. A Century of Mathematics in 
America, Part III. Ed: Peter Duren. History of 
Math., V. 3. AMS, 1989, ix + 675 pp, $75. [ISBN: 
0-8218-0136-8] Third and final volume containing 
historical papers written in honor of the AMS cen- 
tennial. This volume contains 35 papers includ- 
ing departmental histories (Hopkins, Clark, IAS, 
Columbia, MIT, Michigan, Texas); personal reminis- 
cences; topical histories; historical perspectives on 
probability, statistics, and actuarial science; and the 
history of the history of mathematics in America. In- 
cludes an informative analysis of American calculus 
textbooks of the nineteenth century. (Part J and II, 
TR, October 1989.) LAS 


History, T(13-16: 1), L. An Introduction to the 
History of Mathematics with Cultural Connections, 
Sizth Edition. Howard Eves. Saunders College, 1990, 
xviii + 775 pp, $38. [ISBN: 0-03-029558-0] Changes 
from the previous edition (TR, October 1983) include 
updatings, some new sections, expansion of some old 
sections, more attention to women in mathematics, 
and an improved bibliography. Also added are Cul- 
tural Connections written by Jamie H. Eves, which 
discuss the cultural setting of various eras in the his- 
tory of mathematics. (Third Edition, TR, October 
1969; Fourth Edition, TR, August-September 1976.) 
RH 


History, L*. Introduction to Analysis of the In- 
finite, Book II. Leonard Euler. Transl: John D. 
Blanton. Springer-Verlag, 1990, xii + 504 pp, 
$59. (ISBN: 0-387-97132-7] The theory of curves 
(or “higher-geometry” ), including classification of al- 
gebraic curves, curvature, intersections, tangencies, 
and transcendental curves. Includes an Appendix on 
surfaces. The second half of Euler’s Introductio in 
analysin infinitorum, here translated into English for 
the first time (for review of Book I, see TR, February 
1989). After 240 years, retains an amazing clarity of 
language and notation. LAS 


Logic, P. Computability in Analysis and Physics. 
Marian B. Pour-El, J. Ian Richards. Perspect. in 
Math. Logic. Springer-Verlag, 1989, xi + 206 pp, 
$65. (ISBN: 0-387-50035-9] Quantum theorists take 
note. There’s a tendency to think that once you 
know the operator, you can automatically compute 
the eigenvalues and eigenvectors—but it ain’t nec- 
essarily so. Modern mathematical logic has its say 
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in the matter: “With certain mild side-conditions, 
bounded operators preserve computability and un- 
bounded operators do not.” Moreover, even though 
the eigenvalues of a self-adjoint operator are com- 
putable, the sequence of all eigenvalues need not be 
computable. BC 

Graph Theory, T(16-18), S, P, L. Hypergraphs: 
Combinatorics of Finite Sets. Claude Berge. Math. 
Lib., V. 45. North-Holland (US Distr: Elsevier 
Science), 1989, x + 255 pp, $79. (ISBN: 0-444- 
87489-5] The theory of hypergraphs is a general- 
ization of graph theory wherein edges (e.g., pairs 
of vertices) are replaced with arbitrary sets of ver- 
tices. This generalization often leads to simplifica- 
tion, economy, and unification in understanding the- 
orems on graphs. This book presents the most signif- 
icant work, including many open problems. Topics 
include general concepts, transversal sets and match- 
ings, fractional transversals, colorings, and general- 
izations of bipartite graphs. LCL 


Combinatorics, P. Surveys in Combinatorics, 
1989. Ed: Johannes Siemons. London Math. Soc. 
Lect. Note Ser., V. 141. Cambridge University Pr, 
1989, viii + 217 pp, $27.95 (P). [ISBN: 0-521-37823- 
0] Papers by the principle speakers at the Twelfth 
British Combinatorial Conference, Norwich, April 
1989. LC 


Discrete Mathematics, T(13-14: 1), L. Learning 
Discrete Mathematics with ISETL. Nancy Baxter, 
Ed Dubinsky, Gary Levin. Springer-Verlag, 1989, 
xvii + 416 pp, $45. (ISBN: 0-387-96898-9] Stu- 
dents learn discrete mathematics by programming 
in ISETL (Interactive SET Language) which has 
syntax resembling mathematical language. Intent 
is more student participation (reading text on own, 
programming, solving problems), and less lecturing 
(spoon-feeding) by teacher. LC 


Discrete Mathematics, L*, P. Discrete Math- 
ematics in the First Two Years. Ed: Anthony 
Ralston. MAA Notes No. 15. MAA, 1989, 101 pp, 
$10 (P). [ISBN: 0-88385-064-8] Reports from six 
campuses—Colby, Delaware, Denver, Florida State, 
Montclair State, St. Olaf—that were funded by the 
Sloan Foundation in 1984-86 to experiment with var- 
ious ways of incorporating discrete mathematics into 
the standard first two-year curriculum. Opens with 
an essay by Tony Ralston—the godfather of the dis- 
crete mathematics movement—and concludes with a 
list of discrete mathematics texts and other recom- 
mendations from the MAA Committee on Discrete 
Mathematics. An excellent resource for any depart- 


ment seeking a proper role for discrete mathemat- 
ics. LAS 


Number Theory, P. Lecture Notes in Mathemat- 
ics-1383: Number Theory. D.V. Chudnovsky, et al. 
Springer-Verlag, 1989, 256 pp, $24.30 (P). [ISBN: 0- 
387-51549-6] Wide-ranging papers from the weekly 
number theory seminar held at the City University 
of New York. This, the fourth such volume, comes 
from talks given in 1985-1988. GG 


Number Theory, T*(13-15: 2), S*, L*. In- 
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troduction to Number Theory with Computing. 
R.B.J.T. Allenby, E.J. Redfern. Edward Arnold, 
1989, x + 310 pp, $22.95 (P). [ISBN: 0-7131-3661- 
8] Well-written introduction to elementary number 
theory which includes many computing applications. 
Contains numerous Basic programs and assignments 
to write your own. Lots of exercises. Does not, how- 
ever, replace rigor by calculations (e.g., proves the 
fundamental theorem of arithmetic for Z[i]). Each 
chapter includes one or two short historical sketches 
(Fibonacci, Hilbert, Dirachlet, Mersenne, Fermat, 
Lagrange, Legendre, Germain, Mordell, Euler, Ja- 
cobi, Gauss, Mébius, Pell). Standard contents plus 
the Gaussian integers and the RSA-cypher system. 
A good choice for a textbook. SB 


Number Theory, P. Lectures on the Geometry 
of Numbers. Carl Ludwig Siegel. Springer-Verlag, 
1989, x + 160 pp, $45. [ISBN: 0-387-50629-2] 
A reprint of Siegel’s 1945-46 New York University 
lecture notes as revised by K. Chandrasekharan. 
The fifteen lectures focus on three major themes: 
Minkowski’s theorems, linear inequalities, and reduc- 
tion theory. SG 


Number Theory, P. Irregularities of Partitions. 
Ed: G. Haldsz, V.T. Sdés. Algorithms & Com- 
binatorics, V. 8. Springer-Verlag, 1989, 165 pp, 
$39.50 (P). (ISBN: 0-387-50582-2] The proceeding 
of a meeting held at Fert6d, Hungary from July 7-11, 
1986. This meeting brought together number theo- 
rists and combinatorists concerned with problems in- 
volving the distribution of sequences of elements in 
certain collections of subsets. CEC 


Number Theory, T(15-17: 1), S, P, L. Factor- 
ization and Primality Testing. David M. Bressoud. 
Undergrad. Texts in Math. Springer-Verlag, 1989, 
xiii + 237 pp, $45. (ISBN: 0-387-97040-1] The- 
oretical and computational considerations of mod- 
ern methods in factorization and primality testing: 
Pollard’s rho and p — 1 methods, the quadratic 
sieve, continued-fraction algorithms, and _ elliptic- 
curve techniques. The informal writing style and 
numerous examples and exercises make this book a 
good choice for anyone interested in factoring large 
numbers. BC 


Number Theory, $(16-17), P*, L*. The Book 
of Prime Number Records, Second Edition. Paulo 
Ribenboim. Springer-Verlag, 1989, xxiii + 479 pp, 
$49.80. (ISBN: 0-387-97042-8] More than just a 
“Guinness Book” for mathematicians, this valiant 
attempt to keep track of advances on the prime 
number front presents a cogent and entertaining ac- 
count of number theory and its recent (and some 
not-so-recent) developments (First Edition, TR, De- 
cember 1988; Extended Review, August-September 
1989). BC 


Number Theory, P. Lecture Notes in Mathemat- 
tcs-1395: Number Theory, Madras 1987. Ed: K. Al- 
ladi. Springer-Verlag, 1989, vii + 234 pp, $20 (P). 
(ISBN: 0-387-51595-X] Nine papers directly related 
to the work of Ramanujan, primarily from a sympo- 
sium commemorating his 100th birthday. GG 
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Number Theory, T(16-18: 1), S, P, L. Bi- 
nary Quadratic Forms: Classical Theory and Modern 
Computations. Duncan A. Buell. Springer-Verlag, 
1989, x + 247 pp, $35. (ISBN: 0-387-97037-1] The 
quadratic expression az? + bay + cy* is one of the 
most fruitful in all of mathematics. Gauss devoted 
most of his Disquisitiones Arithmeticae to unraveling 
its number-theoretic mysteries. This book is sort of 
an update of Gauss, using the modern language of 
abstract algebra (and also the “correct” definition of 
a binary quadratic form), but still emphasizing ex- 
plicit computational methods. Try to imagine what 
Gauss would have done with a PC. BC 


Number Theory, T(18: 1), S, P. Holomorphic 
Hilbert Modular Forms. Paul B. Garrett. Math. Ser. 
Wadsworth, 1990, xiii + 304 pp. [ISBN: 0-534-10344- 
8] An introduction to Hilbert modular forms, which 
generalize elliptic modular forms from the rational 
numbers to totally real fields, with implications for 
the analytic continuation and functional equations 
of Dirichlet series. Covers classical and adelic view- 
points, Eisenstein series, theta series, and the arith- 
metic of Hilbert modular forms. BC 


Group Theory, P. Lecture Notes in Mathematics- 
1379: Probability Measures on Groups IX. Ed: H. 
Heyer. Springer-Verlag, 1989, viii + 437 pp, $37.50 
(P). [ISBN: 0-387-51401-5] Thirty papers from a 
January 1988 conference held at Oberwolfach. GG 


Group Theory, T(18: 1), S, P. Topological 
Groups: Characters, Dualities, and Minimal Group 
Topologies. Dikran N. Dikranjan, Ivan R. Prodanov, 
Luchezar N. Stoyanov. Pure & Appl. Math., V. 130. 
Marcel Dekker, 1990, x + 287 pp, $99.75. (ISBN: 
0-8247-8047-7] An attempt to present a substantial 
theory for topological groups (at least for the Abelian 
case) without resorting to functional analysis. A 
good share of the presentation depends on work of 
Folner concerning existence of continuous characters. 
Applications include almost periodic functions, the 
Haar integral, duality, and minimal groups; both 
Abelian and non-Abelian. Chapter notes, exercises, 
bibliography, index. JS 

Algebra, T(17: 1), S, P. An Introduction to Non- 
commutative Noetherian Rings. Math. Soc. Stud. 
Texts, V. 16. Cambridge University Pr, 1989, xvii 
+ 303 pp, $49.50; $19.95 (P). (ISBN: 0-521-36086-2; 
0-521-36925-8] Intended as a largely self-contained 
introduction to non-commutative Noetherian rings 
at the level of a second-year graduate student; dis- 
cussion begins with the examples of polynomial iden- 
tity rings, group algebras, differential operators, and 
enveloping algebras for Lie algebras. Theory includes 
prime ideals, modules, Goldie theorems, Krull di- 
mension, division algebras. Exercises, notes, bibli- 
ography. JS 

Algebra, P. Commutative Algebra. Eds: M. 
Hochster, C. Huneke, J.D. Sally. Math. Sci. Res. 
Inst., V. 15. Springer-Verlag, 1989, x + 516 pp, 
$49.80. [ISBN: 0-387-96990-X] Proceedings of a 
three-week program in Commutative Algebra held 
at the Mathematical Sciences Research Institute at 
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Berkeley from June 15 to July 2, 1987. Contains 
27 articles, some of which are expository and some 
which include current research. Should be of inter- 


est to both experts and non-experts in commutative 
algebra. RH 


Algebra, T(18: 1), S, P. Infinite Crossed Prod- 
ucts. Donald S. Passman. Pure & Appl. Math., 
V. 135. Academic Pr, 1989, xii + 468 pp, $84.50. 
[ISBN: 0-12-546390-1] A fairly self-contained book 
mainly concerned with recent developments in the 
theory of crossed products. Topics include Cohen- 
Montgomery duality, the symmetric Martindale ring 
of quotients, classification of prime and semiprime 
crossed products, the Galois theory of prime and 
semiprime rings, and the Grothendieck group of a 
Noetherian crossed product. Includes over 200 exer- 
cises and an excellent list of references. CEC 


Calculus, S(13). Introduction to Calculus and 
Analysis. Richard Courant, Fritz John. Springer- 
Verlag, 1989. Volume I, xxiii + 661 pp, $45, [ISBN: 
0-387-97151-3]; Volume II, xxiii + 954 pp, $59. 
[ISBN: 0-387-97152-1] First published in 1965 as an 
attempt to provide to the English speaking world a 
calculus text in the spirit of Courant’s 1927 German 
text. Rigorous, yet moving quickly to the uses of cal- 
culus, the book is a source for some (not new) ideas 
one hears mentioned in calculus reform conferences: 
defining the integral as an area, relegating indeter- 
minate forms and L’Hospital’s rule to an appendix, 
minimal attention to the convergence of series of con- 
stants, more attention to complex analysis. Would 
not win a prize in the contest to write a book with 
the most exercises; neither is it a candidate for a 
“lean calculus” text. AWR 


Calculus, T(16: 2). Advanced Calculus, Revised 
Edition. Lynn H. Loomis, Shlomo Sternberg. Jones 
& Bartlett, 1990, 580 pp, $40. [ISBN: 0-86720-122- 
3] A second course in advanced calculus, which pre- 
supposes a rigorous calculus course and a course in 
linear algebra. Provides a concise treatment of differ- 
entiation and integration in several variables. Mixed 
in are chapters on compactness and completeness, 
differential equations, multilinear functionals, mani- 
folds, potential theory, and classical mechanics. Nu- 
merous exercises, including some of a computational 
nature. (First Edition, TR, January 1968; Extended 
Reviews, April 1970 and October 1970.) GG 

Real Analysis, T(18: 1, 2), S, P. On L'- 
approzimation. Allan M. Pinkus. Tracts in Math., 
V. 93. Cambridge University Pr, 1989, x + 239 
pp, $44.50. (ISBN: 0-521-36650-X] A clearly written, 
friendly, and enthusiastic introduction to the quali- 
tative theory of best approximation, in the ZL! sense, 
from finite dimensional subspaces. Problems are ini- 
tially posed quite generally, but much of the devel- 
opment concerns the special case approximation of 
continuous functions on compact sets endowed with 
tractable measures. Includes a significant number of 
concrete examples and exercises; some of the latter 
develop material used elsewhere in the text. PZ 


Partial Differential Equations, P. Lecture Notes 


1990] 


in Mathematics-1390: Stochastic Partial Differential 
Equations and Applications II, Eds: G. Da Prato, L. 
Tubaro. Springer-Verlag, 1989, vi + 258 pp, $24.30 
(P). (ISBN: 0-387-51510-0] Proceedings from the 
second conference on this topic held in Italy in 1988. 
(The proceedings of the first conference are in LNM- 
1236 of this series.) Twenty-two short papers. JO 


Partial Differential Equations, P. Multilevel 
Adaptive Methods for Partial Differential Equations. 
Stephen F. McCormick. Frontiers in Appl. Math., 
V. 6. SIAM, 1989, ix + 162 pp, $24.50 (P). (ISBN: 
0-89871-247-5] Methods designed for discretization 
and solution of partial differential equations. In- 
cludes the finite volume method and multigrid meth- 
ods (common discretization methods for computa- 
tional fluid dynamics applications). MLR 


Partial Differential Equations, P. Nonlinear 
Boundary Value Problems in Science and Engineer- 
ing. C. Rogers, W.F. Ames. Math. in Sci. & Eng., 
V. 183. Academic Pr, 1989, xiii + 417 pp, $69.96. 
[ISBN: 0-12-593110-7] “In this text ...an attempt 
has been made to survey those nonlinear boundary 
value problems amenable to exact solutions.” This 
book is stuffed full of exact solutions obtained by 
symmetry, series, and constitutive law methods. Of 
particular interest is the appendix with full symme- 
try groups for forty differential equations. JO 


Numerical Analysis, P. Augmented Lagrangian 
and Operator-Splitting Methods in Nonlinear Me- 
chanics. Roland Glowinski, Patrick Le Tallec. 
Stud. in Appl. Math., V. 9. SIAM, 1989, x + 295 
pp, $44.50. (ISBN: 0-89871-230-0] Augmented La- 
grangian methods generalize Lagrange multipliers 
and lead to optimization algorithms with improved 
convergence properties. This book applies numeri- 
cal algorithms based on augmented Lagrangians to 
problems in elasticity and fluid mechanics. JO 


Numerical Analysis, T(18: 2), P. Approzimation 
by Spline Functions. Gunther Nurnberger. Springer- 
Verlag, 1989, xi + 243 pp, $39.50. (ISBN: 0-387- 
51618-2] Studies in a unified approach to the ba- 
sic theoretical and numerical aspects of interpolation 
and best approximation by polynomial splines in one 
variable. Prerequisites are basic knowledge of analy- 
sis and linear algebra. LC 


Numerical Analysis, P. Lecture Notes in Mathe- 
matics-13849: Deterministic and Stochastic Error 
Bounds in Numerical Analysis. Erich Novak. 
Springer-Verlag, 1988, 113 pp, $13.10 (P). (ISBN: 
0-387-50368-4] Covers deterministic error bounds, 
error bounds for Monte Carlo methods, and average 
error bounds. LC 


Numerical Analysis, P. Lecture Notes in Mathe- 
matics-1397: Numerical Analysis and Parallel Pro- 
cessing. Ed: P.R. Turner. Springer-Verlag, 1989, vi 
+ 264 pp, $24.30 (P). [ISBN: 0-387-51645-X] Notes 
for five of the eight series of lectures presented at 
the Third Science and Engineering Research Council 
Numerical Analysis Summer School held July 12-31, 
1987 at the University of Lancaster. Authors are 
L.C.W. Dixon, J.A. George, C.W. Clenshaw, F.W.J. 
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Olver and P.R. Turner, A. Feldstein and R.H. Good- 
man, J.R. Whiteman and G. Goodsell. DFA 


Numerical Analysis, S(18), P. Lecture Notes 
in Physics-318: An Introduction to the Numerical 
Analysis of Spectral Methods. Bertrand Mercier. 
Springer-Verlag, 1989, v + 154 pp, $23.30. (ISBN: 0- 
387-51106-7] Begins with Fourier series, continues 
with polynomial expansions (in particular Cheby- 
shev expansions), interpolation operators, and Fast 
Fourier Transforms. Includes a rigorous treatment of 
one-dimensional advection and diffusion equations. 
Terse, with few examples and no exercises. MPR 


Numerical Analysis, P. Adaptive Methods for 
Partial Differential Equations. Eds: Joseph E. Fla- 
herty, et al. SIAM, 1989, 265 pp, $31.50 (P). (ISBN: 
0-89871-242-4] “The essential part of any adaptive 
approach is the a posteriori determination of the er- 
ror. If the error is not acceptable, then an adaptation 
of the computational process is made.” Sixteen pa- 
pers from the Workshop on Adaptive Computational 
Methods for Partial Differential Equations held at 
Rensselaer Polytechnic Institute in 1988. JO 


Functional Analysis, T**(17-18: 3), L**. Func- 
tional Equations in Several Variables with Applica- 
tions to Mathematics, Information Theory and to the 
Natural and Social Sciences. J. Aczél, J. Dhombres. 
Ency. of Math. & Its Applic., V. 31. Cambridge U 
Pr, 1989, xiv + 462 pp, $89.50. (ISBN: 0-521-35276- 
2] The emphasis is on applications of the theory. It 
is organized so that most of the chapters are rela- 
tively independent of each other. The prerequisites 
are calculus through basic Lebesgue integration and 
linear algebra. Over 400 exercises, some of which re- 
quire outside references to solve. Also includes 25 
pages of historical notes and an extensive bibliogra- 
phy (ordered chronologically). MPR 


Functional Analysis, P. Lectures on the Ekeland 
Variational Principle with Applications and Detours. 
D.G. de Figueiredo. Springer-Verlag, 1989, 96 pp, 
$11.90 (P). (ISBN: 0-387-51179-2] A consolidation 
of results on the subject, with applications to semi- 
linear elliptic differential equations and the geometry 
of Banach spaces. JO 


Functional Analysis, P. Nonlinear Functional 
Analysis. Ed: P.S. Milojevic. Lect. Notes in Pure 
& Appl. Math., V. 121. Marcel Dekker, 1990, vii 
+ 264 pp, $89.75 (P). [ISBN: 0-8247-8255-0] Seven 
papers presented at the Special Session on Nonlinear 
Functional Analysis held at the New Jersey Institute 
of Technology, April 1987. KS 


Functional Analysis, T(18), P. Weakly Differ- 
entiable Functions: Sobolev Spaces and Functions of 
Bounded Variation. Wiliam P. Ziemer. Grad. Texts 
in Math., V. 120. Springer-Verlag, 1989, xvi + 308 
pp, $49. [ISBN: 0-387-97017-7] A weakly differen- 
tiable function is integrable and has partial deriva- 
tives in the sense of distributions that are Z? func- 
tions (Sobolev functions) or (signed) measures with 
finite total variation. This book analyzes the point- 
wise behavior of these functions. Historical notes 
include references to important and relatively new 
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results. MLR 


Functional Analysis, T(16-17: 2), L. Introduc- 
tory Functional Analysis with Applications. Erwin 
Kreyszig. Robert E Krieger, 1989, xiv + 688 pp, 
$56. [ISBN: 0-89464-332-0] Unaltered reprint of a 
1978 Wiley publication (TR, March 1978). LAS 


Analysis, P, L*. Analysis I: Integral Representa- 
tions and Asymptotic Methods. Ed: R.V. Gamkre- 
lidze. Encyclop. of Math. Sci., V. 13. Springer- 
Verlag, 1989, 238 pp, $59.50. (ISBN: 0-387-17008-1] 
A smoothly moving account of developments starting 
with Newton and Euler, this can be recommended 
to someone wanting to know what ideas led up to, 
say, attention to divergent integrals. Could serve as 
either an introduction or a capstone to a course in 
analysis. AWR 


Analysis, P. Lecture Notes in Mathematics-1384: 
Harmonic Analysis and Partial Differential Equa- 
tions. Ed: J. Garcia-Cuerva. Springer-Verlag, 1989, 
vil + 213 pp, $20 (P). [ISBN: 0-387-51460-0] Pro- 
ceedings of a conference held in Spain in June 1987. 
Four survey lectures and ten more specialized papers, 
one in French. JO 


Analysis, P. Theory of Reproducing Kernels and its 
Applications. Saburou Saitoh. Pitman Res. Notes 
in Math. Ser., V. 189. Longman Scientific & Tech- 
nical (US Distr: Wiley), 1988, 157 pp, $57.95 (P). 
[ISBN: 0-582-03564-3] Aims to introduce the the- 
ory of reproducing kernels from two viewpoints: the 
general theory as established by N. Aronszajn, and 
the theories of complex analysis of one variable as de- 
veloped by S. Bergman, M. Schiffer, P. Garabedian, 
Z. Nehari, D. Hejhal, etc. LC 


Analysis, P. Lecture Notes in Mathematics-1393: 
Séminaire de Théorie du Potentiel Paris, No. 9. Eds: 
N. Bouleau, et al. Springer-Verlag, 1989, vi + 265 
pp, $24.30 (P). [ISBN: 0-387-51592-5] This volume 
emphasizes the connections with other branches of 
analysis. JAS 

Analysis, T(18: 2), P. Asymptotic Approzima- 
tions of Integrals. R. Wong. Comput. Sci. & 
Scientific Comput. Academic Pr, 1989, xiii + 543 
pp, $69.95. [ISBN: 0-12-762535-6] Studies methods 
used in asymptotic approximation of integrals, in- 
cluding Mellin transform technique for multiple inte- 
grals, summability method, distributional approach, 
logarithmic singularities, uniform asymptotic expan- 
sions via a rational transformation, and double inte- 
grals with a curve of stationary points. Prerequisites 
are advanced calculus and complex variables. LC 


Analysis, T**(13-14: 1, 2), L. A Course in Math- 
ematics For Students of Physics 1. Paul Bamberg, 
Shlomo Sternberg. Cambridge U Pr, 1988, xvii + 
405 pp, $49.50. [ISBN: 0-521-25017-X] Developed 
at Harvard over the last ten years for students with 
a solid background in one-variable calculus. If you 
do not have a course for which this book is suit- 
able, you might want to introduce one. Best for stu- 
dents concurrently studying physics. Covers the the- 
ory and physical applications of linear algebra and 
several variable calculus (in particular, the exterior 
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calculus). Written in “spiral method.” Chapter sum- 
maries and exercises. JDEK 


Analysis, T(17-18), S, P. An Introduction to Har- 
monic Analysis on Semisimple Lie Groups. V.S. 
Varadarajan. Stud. in Adv. Math., V. 16. Cam- 
bridge U Pr, 1989, x + 316 pp, $69.50. (ISBN: 0- 
521-34156-6] Presents the representation theory of 
semisimple groups via special examples. Through- 
out, expository writing traces the historical develop- 
ment of the subject and makes this a very readable 
account. As a self-contained introduction with prob- 
lems at the end of each chapter, this book can serve 
as a text or as a researcher’s guide to this central 
part of mathematics. KS 


Algebraic Geometry, P. The Curves Seminar at 
Queen’s, Volume VI. Anthony V. Geramita. Papers 
in Pure & Appl. Math., No. 83. Queen’s U, 1989, 
319 pp, (P). Contains three substantial expository 
articles on the non-singular cubic surface in P® (A. 
Geramita), fat points (A. Gimigliano), and liaison 
(J. Migliore), along with nine research articles. SG 


Algebraic Geometry, P. Lecture Notes in Mathe- 
matics-1392: Real Algebraic Surfaces. Robert Silhol. 
Springer-Verlag, 1989, x + 215 pp, $20 (P). [ISBN: 0- 
387-51563-1] An investigation of the general prob- 
lem of classifying real algebraic surfaces and the spe- 
cific problem of determining the topology of the real 
part of a real algebraic surface. SG 


Algebraic Geometry, P. Group Actions and In- 
variant Theory. Eds: A. Bialynicki-Birula, e? al. 
Conf. Proc., V. 10. AMS, 1989, x + 228 pp, 
(P). (ISBN: 0-8218-6015-1] Twenty-two papers on 
groups acting on algebraic spaces. The papers were 
presented at a conference in Montreal in 1988. SG 


Algebraic Geometry, P. 2-Knots and their 
Groups. Jonathan Hillman. Australian Math. Soc. 
Lect. Ser., V. 5. Cambridge U Pr, 1989, x + 164 
pp, $19.95 (P). (ISBN: 0-521-37812-5] An n-knot is 
an embedding K : S" — S™+?, (For n = 1 we have 
the “classical knots.”) The major result contained 
herein is that the 4-manifold obtained by surgery on 
a 2-knot is often aspherical. This is in contrast to the 
fact that the exterior of a nontrivial n-knot (n > 1) 
is never aspherical. Also included is a discussion of 
how to recover 2-knots from their group invariants. 
It is shown that this is possible if the group is torsion 
free and polycyclic. MPR 

Differential Geometry, P. Cosmology in (2 + 1)- 
Dimensions, Cyclic Models, and Deformations of 
M21. Victor Guillemin. Annals of Math. Stud., 
No. 121. Princeton U Pr, 1989, v + 228 pp, $50; $15 
(P). (ISBN: 0-691-08513-7] An exposition of Zollfrei 
manifolds in dimension three with the aim of provid- 
ing a low-dimensional framework for various more 
general questions in cosmology. JAS 


Differential Geometry, T*, S(18), P. The 
Method of Equivalence and Its Applications. Robert 
B. Gardner. CBMS-NSF Reg. Conf. Ser. in Appl. 
Math., V. 58. SIAM, 1989, vii + 127 pp, $21.75 (P). 
[ISBN: 0-89871-240-8] The problem of equivalence 
is to characterize conditions under which sets of ge- 
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ometric objects are equivalent, i.e., there exist dif- 
feomorphisms mapping one set onto the other. The 
author restricts himself to cases where the diffeomor- 
phisms may be described as solutions to first order 
systems of differential equations. In particular, the 
focus is on the case where sets of geometric objects 
are control systems and the equivalences are feedback 
transformations. This appears to be nicely written 
(in the form of ten lectures) with many examples in- 
cluded to illustrate ideas. No exercises. MPR 


Differential Geometry, S(18), L. The Theory and 
Applications of Harmonic Integrals. W.V.D. Hodge. 
Math. Library. Cambridge University Pr, 1989, xiii 
+ 284 pp, $22.95 (P). (ISBN: 0-521-35881-7] Paper- 
back edition of this classic is enhanced by a Foreword 
by Sir Michael Atiyah in which he sets Hodge’s work 
in its historical context and connects it to subsequent 
developments. Geometry of manifolds and applica- 
tions to algebraic varieties and continuous groups. 
Takes readers back to the origins of the subject. Not 
to be read for currency. JDEK 


Differential Geometry, T(17-18: 2, 3). Differ- 
ential Geometry, Gauge Theories, and Gravity. M. 
Gockeler, T. Schiicker. Mono. on Math. Physics. 
Cambridge University Pr, 1989, xii + 230 pp, $19.95 
(P). (ISBN: 0-521-37821-4] Intended as a graduate 
text in theoretical physics, this is a self-contained but 
terse presentation of the mathematics required for 
the modern treatment of gravity and general relativ- 
ity. Although chapters on differential forms, man- 
ifolds, Lie groups, and fibre bundles limit the for- 
mal prerequisites to linear algebra and elementary 
real analysis, undergraduates should approach this 
book with extreme caution. Sections on Maxwell’s 
equations, Einstein-Cartan theory, Yang-Mills the- 
ory, and monopoles and instantons provide the phys- 
ical payoff for the mathematical effort. (1987 hard- 
cover edition, TR, December 1988.) JO 


Differential Geometry, S(17-18), P, L. Lecture 
Notes in Mathematics-1000: Differential Geometry 
in the Large. Heinz Hopf. Springer-Verlag, 1989, vii 
+ 184 pp, $16.20 (P). (ISBN: 0-387-51497-X] Cor- 
rected reprint of the LNM-1000 anniversary volume 
(TR, April 1984; Extended Review, January 1986), 
containing notes from 1946 and 1956 lectures by 
Hopf, prepared by Peter Lax and John Gray. New 
Preface contains pointers to recent work by Henry 
Wente and others on a new class of surfaces in E° of 
constant mean curvature, resolving questions posed 
by Hopf in these notes. LAS 


Algebraic Topology, P. Lecture Notes in Mathe- 
matics-1370: Algebraic Topology. Ed: G. Carlsson, 
et al. Springer-Verlag, 1989, ix + 456 pp, $47 (P). 
[ISBN: 0-387-51118-0] Proceedings of an interna- 
tional conference held in Arcata, California, July 27- 
August 2, 1986. Special attention is given to the 
silver anniversary of Topology and the work of E.H. 
Brown. JAS 


Topology, P. Lecture Notes in Mathematics-1374: 
The Topology of 4-Manifolds. Robion C. Kirby. 
Springer-Verlag, 1989, vi + 108 pp, $13.50 (P). 
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[ISBN: 0-387-51148-2] Exposition of a number of 
improved proofs and results concerning cobordism 
groups of 4-manifolds with special attention given to 
Rohlin’s theorem: 22° = Z. JAS 


Topology, S*, L*. Mathematics and Knots. Ron- 
nie Brown, et al. The Mathematics and Knots Exhi- 
bition Group (U. of Wales, School of Mathematics, 
Dean Street, Bangor, Gwynedd LL57 1UT, Wales, 
UK), 1989, 27 pp, (P). (ISBN: 0-9514947-16] Text 
with illustrations of a museum exhibit on classifica- 
tion of knots prepared at the University of Wales. 
Illustrates crossing numbers, unknotting numbers, 
bridge numbers, coloring, links, mirror images, arith- 
metic of knots, prime knots, and diverse applications 
(fluid flow, string theorem, DNA). Excellent visual 
introduction. LAS 

Topology, T(17-18: 1, 2). Fibrewise Topology. 
I.M. James. Tracts in Math., V. 91. Cambridge U 
Pr, 1989, x + 198 pp, $49.50. (ISBN: 0-521-36090-0] 
An exposition of general topology (through homo- 
topy) taking the fundamental objects to be continu- 
ous functions from spaces X to a fixed base space B. 
This provides a framework to bring ideas from fiber 
spaces to general topology. Exercises, references, and 
index. JAS 


Operations Research, S(11-13). Queues: Will 
This Wait Never End! Clifford Sloyer, et al. Con- 
temp. Appl. Math. Janson, 1987, vi + 41 pp, 
$7.95 (P). (ISBN: 0-939765-08-X] A high-school level 
workbook approach to queues based on a planned 
sequence of 77 questions (with answers at the back) 
interspersed with minimal exposition. Begins with 
constant arrival rates, and ends with simulations of 
variable arrivals and service times. LAS 
Optimization, T(14-15: 1), L. Linear Program- 
ming and Its Applications. James K. Strayer. Un- 
dergrad. Texts in Math. Springer-Verlag, 1989, xi 
+ 265 pp, $47. [ISBN: 0-387-96930-6] Covers solv- 
ing linear programming problems geometrically and 
by the simplex algorithm using the Tucker tableau. 
Also includes duality theory and problems that are 
not immediately solvable by the simplex algorithm. 
Includes applications to game theory, transportation 
and assignment problems, and network-flow prob- 
lems. Has many examples and exercises. Prereq- 
uisites are a knowledge of linear equations, including 
the graphing of lines and planes, and the solution 
of systems of simultaneous linear equations without 
matrices. RH 


Optimization, P. Finite Element Approzimation 
for Optimal Shape Design: Theory and Applications. 
J. Haslinger, P. Neittaanmaki. Wiley, 1988, xii + 
335 pp, $79.95. [ISBN: 0-471-92079-7] Investigates 
“optimal design problems via optimal control theory 
where the states of systems are governed by varia- 
tional inequalities.” LC 


Dynamical Systems, S(18), P. Vacillation and 
Predictability Properties of Low-Order Atmospheric 
Spectral Models. H.E. de Swart. CWI Tract, V. 60. 
Mathematisch Centrum, 1989, 121 pp, Dfl. 19 (P). 
[ISBN: 90-6196-368-0] A low-order spectral model 
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of the atmosphere is obtained when the infinite num- 
ber of nonlinear coupled ordinary differential equa- 
tions arising from an eigenfunction expansion are 
truncated to a finite system. One criterion for the 
truncation is that the resulting model should yield 
trajectories tending towards strange attractors. This 
tract discusses models truncated to 3, 6, and 10 com- 
ponents. One conclusion: “The atmosphere pos- 
sesses an intrinsic finite range of predictability of 
about two weeks.” MPR 


Dynamical Systems, S(18), P. Dynamical Sys- 
tems II. Ed: Ya. G. Sinai. Encyclop. of Math. Sci., 
V. 2. Springer-Verlag, 1989, viii + 281 pp, $59.95. 
[ISBN: 0-387-17001-4] More on dynamical systems 
and differential equations. From the Russian school. 
This volume contains three articles: General Ergodic 
Theory of Groups of Measure Preserving Transfor- 
mations; Ergodic Theory of Smooth Dynamical Sys- 
tems; and Dynamical Systems of Statistical Mechan- 
ics and Kinetic Equations. MPR 


Dynamical Systems, T, S*(16-18), P, L*. Per- 
spectives of Nonlinear Dynamics, Volume 1. E. Atlee 
Jackson. Cambridge U Pr, 1989, xix + 495 pp, 
$54.50. (ISBN: 0-521-34504-9}] An eclectic collage 
of introductory examples, results, and applications 
in nonlinear systems organized roughly in chronolog- 
ical terms. “In the beginning ...there was Poincaré.” 
Touches on an enormous variety of stimulating ideas, 
including catastrophe theory, fractals, Lyapunov ex- 
ponents, deterministic chaos, forced oscillators, cel- 
lular automata. Illustrated with diverse applications 
to dozens of different fields. Several appendices pro- 
vide additional mathematical depth. Massive 1600 
item bibliography. Superbly crafted for its intended 
purpose—to stimulate imagination; useful as a sys- 
tematic text. LAS 


Dynamical Systems, S(18), P. Lecture Notes in 
Control and Information Sciences-127: Determinis- 
tic Identification of Dynamical Systems. C. Heij. 
Springer-Verlag, 1989, 292 pp, $41.60 (P). [ISBN: 0- 
387-51323-X] The aim of this monograph is to show 
how a model, especially a dynamical system, may 
be determined from a set of data, especially a time 
series. In this case, the models are deterministic be- 
cause there is no underlying probabilistic or stochas- 
tic assumptions. Uncertainties and errors which arise 
are due to the complexities of reality and the inherent 
shortcomings of the model. MPR 


Systems Theory, P, L. Lecture Notes in Control 
and Information Sciences-134: Auziliary Signal De- 
sign in Fault Detection and Diagnosis. X.J. Zhang. 
Springer-Verlag, 1989, xii + 213 pp, $30.30 (P). 
[ISBN: 0-387-51559-3] To avoid catastrophies such 
as the Three Mile Island Accident, systems to detect 
and diagnose faults must be efficient yet not overly 
complex. The author develops one such system and 
applies it to the anhydrous caustic soda plant. KS 


Systems Theory, P. The Volterra and Wiener 
Theories of Nonlinear Systems. Martin Schetzen. 
Robert E Krieger, 1989, xviii + 573 pp, $55.50. 
[ISBN: 0-89464-356-8] Reprint of a 1980 volume, 
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up-dated with two papers added in the appendices. 
In one, Volterra theory is used to develop a new the- 
ory for analysis of nonlinear networks. The other new 
paper introduces the concept of differential models 
and develops it using Wiener theory. KS 


Probability, T*(1-3), S, P, L. Introduction 
to Probability Models, Fourth Edition. Sheldon 
M. Ross. Academic Pr, 1989, xiv + 544 pp, 
$44.50. (ISBN: 0-12-598464-2] Differs from Third 
Edition (TR, December 1985) by addition of mate- 
rial on Markov chains, minimal theory, and simula- 
tion. Postcalculus introduction to probability the- 
ory, stochastic processes, queueing theory, and sim- 
ulation. (First Edition, TR, March 1973; Extended 
Review, November 1975; Second Edition, TR, Jan- 
uary 1982.) TH 


Probability, P. Central Limit Theorems for Gener- 
alized Multilinear Forms. P. de Jong. CWI Tract, V. 
61. Mathematisch Centrum, 1989, 84 pp, Dfl. 14.10 
(P). (ISBN: 90-6196-369-9] Theoretical monograph 
presenting “some central limit theorems for homo- 
geneous multilinear forms and for generalizations of 
multilinear forms.” RSK 


Probability, P. Lecture Notes in Mathematics- 
1891: Probability Theory on Vector Spaces IV. Eds: 
S. Cambanis, A. Weron. Springer-Verlag, 1989, viii 
+ 424 pp, $37.50 (P). (ISBN: 0-387-51548-8] 34 pa- 
pers from a June 1987 conference in Lancut, Poland 
on probability measures on algebraic and topological 
structures, limit theorems on vector spaces, infinitely 
divisible measures, and stochastic integrals. TH 


Stochastic Processes, P. Stochastic Differential 
Equations and Diffusion Processes, Second Edition. 
Nobuyuki Ikeda, Shinzo Watanabe. Math. Lib., 
V. 24. North-Holland (US Distr: Elsevier Science), 
1989, xvi + 555 pp, $147.25. (ISBN: 0-444-873783] 
Changes from the First Edition (TR, April 1982) in- 
clude Brownian motion excursion and Malliavin cal- 
culus sections, and added sections on complex mar- 
tingales and Kahler diffusions. Terse, rigorous pre- 
sentation. TH 


Stochastic Processes, P. Lecture Notes in 
Mathematics- 1372: Séminaire de Probabilités XXIII. 
Eds: J. Azéma, P.A. Meyer, M. Yor. Springer- 
Verlag, 1989, iv + 582 pp, $49.50 (P). [ISBN: 0-387- 
51191-1] 27 French and 18 English papers from a 
seminar in Strasbourg. Diffusion processes, martin- 
gales, Brownian motion. TH 


Elementary Statistics, T(13-14: 2). Statistical 
Techniques in Business and Economics, Seventh Edi- 
tion. Robert D. Mason, Douglas A. Lind. Richard 
D Irwin, 1990, xvii + 910 pp, $43.95. [ISBN: 0-256- 
07696-0] Added features in the new edition include 
stem-and-leaf displays, the finite population correc- 
tion factor, two-factor ANOVA, sign tests, accep- 
tance sampling, and MINITAB output. (Fourth Edé- 
tion, TR, November 1978; Fifth Edition, TR, Novem- 
ber 1982; Sizth Edition, TR, August-September 
1986.) RWJ 


Elementary Statistics, T(14: 1), S. Statistics for 
Biologists, Third Edition. R.C. Campbell. Cam- 
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bridge U Pr, 1989, xviii + 446 pp, $59.50. (ISBN: 
0-521-36095-1] Revision of the author’s 1974 Sec- 
ond Edition (TR, January 1975). Main change is 
the addition of computer analyses of a selection of 


the examples, using several different statistical pack- 
ages. RSK 


Computational Statistics, P? Spatial Regression 
Analysis on the PC: Spatial Statistics Using Minitab. 
Daniel A. Griffith. Discussion Paper No. 1. In- 
stitute of Mathematical Geography (2790 Briarcliff, 
Ann Arbor, MI 48105), 1989, vi + 84 pp, $12.95 (P). 
Gives Minitab code to implement spatial statistics 
estimates (e.g., Moran coefficient), but gives no indi- 
cation as to what is being estimated. TH 


Computational Statistics, P. A Specification Sys- 
tem for Stattstical Software. V.J. de Jong. CWI 
Tract, V. 62. Mathematisch Centrum, 1989, 250 
pp, Dfl. 37.90 (P). [ISBN: 90-6196-370-2] Describes 
CONDUCTOR, a very high-level (fourth-generation) 
language for statistics, designed to be more flexi- 
ble than statistical packages, more efficient than S 
or SAS/IML, and have lower life-cycle costs than 
low-level language programs. Concepts, examples 
(the bootstrap), and a formal description of the lan- 
guage. TH 


Statistics, T?(13-14), S*. Modern Elementary 
Probability and Statistics with Statistical Program- 
ming in SAS, MINITAB & BMDP. Edward J. 
Dudewicz, Pinyuen Chen, Baldeo K. Taneja. Amer- 
ican Sciences Pr, 1989, viii + 375 pp, $34.50 (P). 
[ISBN: 0-935950-19-2] Rich set of realistic prob- 
lems; book written for mathematically unsophisti- 
cated. Better editing and printing would significantly 
enhance the value. Omits fundamental concepts such 
as difference between discrete and continuous ran- 
dom variables; introduces material best left for a dif- 
ferent course, e.g., Bayes theorem. Not a good intro- 
ductory book for someone planning to take another 
course in statistics. WE 


Statistics, S(15-18), L. Lecture Notes in Statistics- 
56: The Analysis of Categorical Data Using GLIM. 
James K. Lindsey. Springer-Verlag, 1989, v + 168 
pp, $19.50 (P). [ISBN: 0-387-97029-0] GLIM 3.77 
update 2 is applied to a variety of examples in 
the social sciences for which log-linear/logistic mod- 
els are used. Assumes previous knowledge of log- 
linear/logistic models. Appendices include anno- 
tated GLIM programs and macros. RWJ 
Statistics, T(18: 1), P*, L. Asymptotic Tech- 
niques for Use in Statistics. O.E. Barndorff-Nielsen, 
D.R. Cox. Mono. on Stat. & Appl. Prob. Chap- 
man & Hall, 1989, x + 252 pp, $49.95. [ISBN: 0- 
412-31400-2] Survey of asymptotic approximation 
methods for the distribution of (functions of) means 
of random variables, e.g., delta method. Uni- and 
multi-variate; conversational tone; few proofs. TH 


Statistics, S(16-18). A Primer of LISREL: Ba- 
sic Applications and Programming for Confirma- 
tory Factor Analytic Models. Barbara M. Byrne. 
Springer-Verlag, 1989, xii + 184 pp, $49. (ISBN: 0- 
387-96972-1] Non-technical introduction to the use 
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of the LISREL (linear structural relations) VI pro- 
gram by Joreskog and Sorbom (1985). Sample pro- 
gram input and output displayed for “self-concept” 
investigations in psychology. RWJ 


Statistics, T(17-18: 1), L. The Statistical Analy- 
sis of Discrete Data. Thomas 4. Santner, Diane E. 
Duffy. Texts in Stat. Springer-Verlag, 1989, xii + 
367 pp, $45. [ISBN: 0-387-97018-5] Assumes some 
background in decision theory and linear model the- 
ory for continuous data. Covers point and interval 
estimates of binomial, multinomial, and Poisson pa- 
rameters, loglinear models, cross-classified data, lo- 
gistic regression, two by T and two by two by S ta- 
bles. Includes recent advances in Bayes and shrink- 
age estimation, graphical models for contingency ta- 
bles, and diagnostic tools for loglinear models and 
logistic regression. RWJ 


Statistics, P. Contributions to Probability and 
Statistics: Essays in Honor of Ingram Olkin. Eds: 
Leon Jay Gleser, et al. Springer-Verlag, 1989, x + 
505 pp, $69. (ISBN: 0-387-97076-2] Collection of 32 
original essays by former students, collaborators, and 
colleagues of Olkin, divided into four areas: proba- 
bility inequalities and characterizations; multivariate 
analysis and association; linear and nonlinear mod- 
els, ranking and selection, design; and approaches to 
inference. Also includes an interview with Olkin, a 
bibliography, and a short biography. RSK 


Statistics, P. Probability, Statistics, and Mathemat- 
ics: Papers in Honor of Samuel Karlin. Eds: T.W. 
Anderson, Krishna B. Athreya, Donald L. Iglehart. 
Academic Pr, 1989, xl + 371 pp, $39.95. (ISBN: 
0-12-058470-0] A collection of 24 papers written 
by mathematicians, probabilists, statisticians, and 
economists on the occasion of Samuel Karlin’s 65th 
birthday. Includes a list of Karlin’s publications and 
students. RWJ 


Statistics, T(17: 2). Fundamentals of Mathemat- 
ical Statistics. H.T. Nguyen, G.S. Rogers. Texts in 
Stat. Springer-Verlag, 1989, $39.80 each. Volume I: 
Probability for Statistics, x + 432 pp (ISBN: 0-387- 
97014-2]; Volume II: Statistical Inference, xi + 422 
pp. (ISBN: 0-387-97020-7] Classical text, with the 
material divided into lessons, each appropriate for 
a single class period. Volume I is divided into three 
sections: elementary probability and statistics; prob- 
ability and expectation; and limiting distributions. 
Volume II contains four sections: sampling and dis- 
tributions; statistical estimation; testing hypotheses; 
and special topics. Exercises are integrated into the 
discussions. RSK 


Statistics, T(15: 1, 2). Probability and Statis- 
tics: Theory and Applications. Gunnar Blom. Texts 
in Stat. Springer-Verlag, 1989, xi + 356 pp, $59. 
[ISBN: 0-387-96852-0] Modified translation of a 
Swedish text. Interesting alternative to the usual 
calculus-based texts. Emphasizes practical applica- 
tions while holding mathematical arguments to a 
minimum. Includes chapters on descriptive statis- 
tics and planning statistical investigations, but omits 
topics such as analysis of variance, moment generat- 
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ing functions, and the F-distribution. RSK 


Statistics, S(14-15). Concise Statistics. M.G. 
Godfrey, E.M. Roebuck, A.J. Sherlock. Edward 
Arnold, 1988, vi + 402 pp, $19.95 (P). [ISBN: 
0-7131-3591-3] Study guide for statistics for ad- 
vanced level mathematical syllabuses in England. 
Contains numerous exercises with fairly complete an- 
swers. RSK 


Statistics, S(18). Multivariate Statistics and Prob- 
ability: Essays in Memory of Paruchurit R. Krish- 
naiah. Eds: C.R. Rao, M.M. Rao. Academic Pr, 
1989, xiii + 567 pp, $59.95. (ISBN: 0-12-580205-6] 
A collection of 35 papers covering the main areas of 
multivariate statistical theory and its applications, 
along with some probability and stochastic analysis. 
The papers discuss finite sampling and asymptotic 
results, including decision theory, Bayesian analy- 
sis, maximum likelihood estimation, regression, and 
time series. Most results theoretical, although some 
practical applications are included. Of interest to re- 
searchers and graduate students in theoretical and 
applied statistics, multivariate analysis, and random 
processes. Papers from 61 authors from several coun- 
tries. JJ 


Statistics, P, L. Proceedings of the Thirty-Fourth 
Conference on the Design of Ezperiments. US Army 
Research Office (POB 12211, Research Triangle 
Park, NC), 1988, xi + 367 pp, (P). Twenty papers 
from an October 1988 conference held at New Mexico 
State University. LAS 


Computer Literacy, S(13), L. A Glossary of 
Computing Terms, An Introduction, Sizth Edition. 
British Computer Society. Cambridge U Pr, 1989, 
xvi + 92 pp, $5.95 (P). (ISBN: 0-521-37828-1] Over 
800 concise definitions, divided into 15 categories of 
origin (applications, data representation and struc- 
ture, machine architecture, systems software, etc.). 
Suitable as a reference for the lay computer-user. A 
thin volume, but note its price! DFA 


Programming, S. An Easy Course in Using the 
HP-42S. Dan Coffin, Chris Coffin. Grapevine Pub, 
1989, 379 pp, $22 (P). [ISBN: 0-931011-26-4] De- 
tailed handbook for using and programming the HP- 
42S, a hand-calculator with built-in capabilities for 
matrix and statistical calculations, solving equations 
and numerical integration. Includes alphabetic reg- 
isters for naming variables and programs, branching, 
conditional tests, loops, indirect addressing, and rea- 
sonable editing capabilities for stored programs. LAS 


Algorithms, P. Lecture Notes in Mathematics- 
1387: Iterative Methods for Simultaneous Inclusion 
of Polynomial Zeros. Miodrag Petkovi¢. Springer- 
Verlag, 1989, x + 263 pp, $24.30 (P). [ISBN: 0-387- 
51485-6] Galois’ famous theorem states that ex- 
plicit formulas determining zeros exist only for poly- 
nomials of degree less than five. Numerical methods 
must be used for higher degree polynomials. This 
book collects algorithms of the past fifteen years. 
Much of the work is in the spirit of the so-called “in- 
terval arithmetic” pioneered by Henrici, Gargantini, 
Alefeld, and Herzberger. MPR 
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Computer Systems, P. The Society of Tezt: Hy- 
pertezt, Hypermedia, and the Social Construction of 
Information. Ed: Edward Barrett. Inform. Systems. 
MIT Pr, 1989, xix + 459 pp, $37.50. [ISBN: 0-262- 
02291-5] A sequel to Tezt, ConTezt, and Hyper Tezt 
(TR, June-July 1989) containing 22 new essays that 
together “assert a syntheses of three related areas 
of activity within the computer industry: writing, 
management, and engineering.” These authors view 
computer-based information (including hypertext) as 
a form of social construction. Based in part on pa- 
pers presented at an annual MIT summer conference 
“Writing for the Computer Industry.” LAS 


Computer Graphics, P, L. Mathematical Methods 
in Computer Aided Geometric Design. Ed: Tom Ly- 
che, Larry L. Schumaker. Academic Pr, 1989, xv + 
611 pp, $49.95. (ISBN: 0-12-460515-X] Forty-five pa- 
pers from an international conference on mathemat- 
ical methods in CAGD. Topics include varieties of 
splines (discrete D™, multivariate, box, beta, para- 
metric, etc.), Bezier curves, interpolation schemes, 
etc. BC 


Artificial Intelligence, P. Ontic: A Knowledge 
Representation System for Mathematics. David A. 
McAllester. MIT Pr, 1989, xiv + 158 pp, $27.50. 
(ISBN: 0-262-13235-4] Ontic is a software system 
that automatically verifies formal mathematical the- 
orems. It can both verify existing proofs as a check of 
their validity as well as try to discover proofs of cur- 
rently unproven mathematical conjectures. It uses 
many of the new ideas of artificial intelligence to per- 
form its proofs, including object-oriented inference 
and forward-chaining logic. The goal of the program 
is to demonstrate the validity of these new ideas by 
placing them in a working system for automated the- 
orem proving. GMS 


Computer Science, P. Lecture Notes in Control 
and Information Sciences-130: Advances in Com- 
puting and Control. Eds: W.A. Porter, S.C. Kak, 
J.L. Aravena. Springer-Verlag, 1989, vi + 367 pp, 
$47 (P). (ISBN: 0-387-51425-2] A selection of pa- 
pers from an international conference on Advances 
in Communication and Control Systems held in Ba- 
ton Rouge, Louisiana, October 19-21, 1988. A sec- 
ond volume, Advances in Communications and Sig- 
nal Processing, contains other papers from this con- 
ference. JAS 


Computer Science, P. Resolutton of Equations 
in Algebraic Structures, Volume 1: Algebraic Tech- 
niques. Eds: Hassan Ait-Kaci, Maurice Nivat. Aca- 
demic Pr, 1989, xxi + 452 pp, $64.95. (ISBN: 0-12- 
046370-9] Papers describing results, methods, al- 
gorithms and the state-of-the-art in resolution of 
equations in algebraic structures relevant to sym- 
bolic computation and the foundations of program- 
ming. By participants in the May 1987 colloquium 
in Lakeway, Texas. Contains practical applications 
as well as theoretical methods. A second collection 
from the same conference will concern rewriting tech- 
niques. DFA 


Computer Science, P. Categories in Computer 
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Science and Logic. Ed: John W. Gray, Andre Sce- 
drov. Contemp. Math., V. 92. AMS, 1989, x + 382 
pp, $42 (P). (ISBN: 0-8218-5100-4] Papers from a 
conference held June 14-20, 1987 at the University of 
Colorado, Boulder. JAS 


Applications, P*. Theta Functions, Bowdoin 
1987. Eds: Leon Ehrenpreis, Robert C. Gunning. 
Proc. of Symposia in Pure Math., V. 49, Part 1 & 2. 
AMS, 1989 (ISBN: 0-8218-1485-0]. Part 1, x + 718 
pp, $99; Part 2, x + 366 pp, $58. Very interesting col- 
lection of papers illustrating the wide range of appli- 
cations of theta functions. Areas covered include in- 
finite analysis, statistical mechanics, Kac-Moody al- 
gebras, algebraic geometry, and number theory. The 
level of the papers range from the very technical to 
survey level. MPR 

Applications (Biological Science), S*(15), P, 
L*. Biometrical Interpretation: Making Sense of 
Statistics in Biology, Second Edition. Neil Gilbert. 
Oxford U Pr, 1989, viii + 146 pp, $21.95 (P). [ISBN: 
0-19-854250-X] This edition reflects availability of 
computers for analysis; quantitative genetics and 
population biology chapters are rewritten. Conversa- 
tional tone, emphasis on how to choose right analysis 
and make sense of the answer. Good supplemental 
reading for biologists who know some statistics. TH 
Applications (Biological Science), T(17-18: 1, 
2), S, P*, L*. Mathematical Biology. J.D. Murray. 
Springer-Verlag, 1989, xiv + 767 pp, $59. [ISBN: 0- 
387-19460-6] An excellent demonstration of how to 
use mathematics to model real biological phenomena. 
Treats population and epidemic models, biological 
oscillators and waves, diffusion mechanisms, and pat- 
tern formation (e.g., how the leopard got its spots— 
see page 436). Worth browsing through just for the 
non-mathematical discussions of butterfly wings, the 
Bubonic Plague, hallucinations, and the Lake Victo- 
ria Nile perch catastrophe. Well worth the price. BC 


Applications (Communication Theory), T(17) 
S, P. Digital Signal Processing: Applications to 
Communications and Algebraic Coding Theories. 
Salvatore D. Morgera, Hari Krishna. Academic Pr, 
1989, xx + 233 pp, $37.95. [ISBN: 0-12-506995-2] 
The intent of this book is to explore the design of 
computationally efficient digital signal processing al- 
gorithms over finite fields and the relation of these 
algorithms to algebraic error correcting codes. As- 
sumes familiarity with abstract algebra and alge- 
braic coding theory. Includes references but not ex- 
ercises. CEC 


Applications (Economics), S(18). Economet- 
rics of Structural Change. Ed: Walter Kramer. 
Stud. in Empirical Econ. Physica-Verlag (US Distr: 
Springer-Verlag), 1989, ix + 128 pp, $50.60. [ISBN: 
0-387-91357-2] A collection of nine papers on vari- 
ous aspects of structural change (coefficients chang- 
ing over time) in econometric time series models. 
The Chow test to look for a change in coefficients at a 
single point in time is analyzed. Random-walk mod- 
els, Bayesian analysis, sequential approaches, trans- 
formations, goodness-of-fit tests, and CUSUM tests 
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are looked at in various papers. A 20-page annotated 
bibliography containing about 400 entries concludes 
the book. JJ 


Applications (Engineering), P. Machine Vision 
for Inspection and Measurement. Ed: Herbert Free- 
man. Perspect. in Comput., V. 24. Academic Pr, 
1989, x + 320 pp, $39.95. (ISBN: 0-12-266719- 
0} Eleven papers on applications ranging from in- 
tegrated circuit inspection to identifying defects in 
glass to making precise measurements of the curva- 
ture of the human cornea. BC 


Applications (Engineering), P. Lecture Notes 
in Control and Information Sciences-132: Interac- 
tive Multi-Objective Programming as a Framework 
for Computer-Aided Control System Design. W.-Y. 
Ng. Springer-Verlag, 1989, xv + 182 pp, $24.90 (P). 
(ISBN: 0-38-51504-6] This research monograph pro- 
vides some updated results of research into the areas 
of Computer-Aided Design (CAD). It then reports 
on a new CAD technique called Interactive Multi- 
Objective Programming (IMOP) which enhances the 
cooperation and interaction between a human de- 
signer and the computer software. IMOP lets the 
designer have more control and more flexibility in 
providing the parameters to the design process. GMS 


Applications (Fluid Dynamics), P. Lecture 
Notes in Mathematics-1385: Relativistic Fluid Dy- 
namics. Eds: A. Anile, Y. Choquet-Bruhat. 
Springer-Verlag, 1989, v + 308 pp, $28.60 (P). 
(ISBN: 0-387-51466-X] Lectures from the first 1987 
session of the Centro Internazionale Matematico Es- 
tivo. Four main lectures on covariant theory of con- 
ductivity, Hamiltonian methods, covariant fluid me- 
chanics and thermodynamics, and relativistic plas- 
mas are followed by five shorter papers. JO 


Applications (Physical Science), P. Transport 
Theory, Invariant Imbedding, and Integral Equa- 
tions. Eds: Paul Nelson, et al. Lect. Notes in Pure 
& Appl. Math., V. 115. Marcel Dekker, 1989, xxvi 
+ 444 pp, $110 (P). (ISBN: 0-8247-8158-9] Twenty- 
seven papers from a conference honoring G. Milton 
Wing. The technical contributions are primarily for 
specialists, but Wing’s short paper, “Counting Back- 
ward from 10,” should be read by everyone. BC 

Applications (Physical Science), T(16-18: 1), 
S. Mathematical Problems from Combustion Theory. 
Jerrold Bebernes, David Eberly. Appl. Math. Sci., 
V. 83. Springer-Verlag, 1989, 177 pp, $34. (ISBN: 0- 
387-97104-1] The analysis of the underlying math- 
ematical models is not only purposeful and thor- 
ough, but reflects an esthetic appreciation not usu- 
ally found in texts on applied mathematics. MU 

Applications (Physics), P. Group Theory in 
Physics, Volume III: Supersymmetries and Infinite- 
Dimensional Algebras. J.F. Cornwell. Techniq. of 
Physics, V. 10. Academic Pr, 1989, xxii + 628 
pp, $55. [ISBN: 0-12-189805-9] Super-objects are, 
roughly, objects with a notion of grading added. This 
volume presents the theories of Lie and Poincaré su- 
peralgebras and supergroups, and of Virasoro and 
Kac-Moody superalgebras, as well as algebraic as- 
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pects of strings and superstrings. JO 

Applications (Physics), T(18), P. Aspects of 
Quantum Field Theory in Curved Space-Time. 
Stephen A. Fulling. Math. Soc. Stud. Texts, V. 17. 
Cambridge University Pr, 1989, ix + 315 pp, $49.50; 
$19.95 (P). (ISBN: 0-521-34400-X; 0-521-37768-4] 
Discusses wave equations and quantum fields in 
curved space-times. Begins with background in gen- 
eral quantum theory and in eigenfunction expansions 
for second-order elliptic differential operators. MLR 
Applications (Physics), T(18: 1), S, P. Finite 
Quantum Electrodynamics. G. Scharf. Texts & 
Mono. in Physics. Springer-Verlag, 1989, x + 224 
pp, $49.50. (ISBN: 0-387-51058-3] Written to avoid 
the pedagogical flaws in the standard approach to 
this topic, this high-level text will unfortunately be 
virtually inaccessible to all but the expert. MU 


Applications (Physics), P. Lecture Notes in Con- 
trol and Information Sciences-133: Stochastic Models 
for Laser Propagation in Atmospheric Turbulence. 
R.P. Leland. Springer-Verlag, 1989, vii + 145 pp, 
$24.90 (P). (ISBN: 0-387-51538-0] Mathematically 
rigorous analysis of function space models for laser 
propagation in turbulence. Simulation results sup- 
port theory. TH 


Applications (Physics), S(18), P. Lecture Notes 
in Mathematics- 1396: Quantum Probability and Ap- 
plications IV. Eds: L. Accardi, W. von Waldenfels. 
Springer-Verlag, 1989, vi + 355 pp, $33 (P). [ISBN: 
0-387-51613-1] Proceedings of the Year of Quan- 
tum Probability held at the University of Rome II, 
1987. MU 

Applications (Physics), T(16-17: 1, 2), L. An 
Introduction to Hilbert Space and Quantum Logic. 
David W. Cohen. Problem Books in Math. Springer- 
Verlag, 1989, xii + 149 pp, $39.80. [ISBN: 0-387- 
96870-9] This unusual text assumes little of the 
reader other than the maturity attained by upper- 
level undergraduate mathematics students. Begin- 
ning with a quick tour of measure theory and Hilbert 
space, the book culminates in leading the reader 
into the Einstein-Podolsky-Rosen Paradox. A very 
worthwhile intellectual excursion. MU 


Applications (Physics), P. Noise in Nonlinear 
Dynamical Systems, Volume 3: Ezperiments and 
Simulations. Eds: Frank Moss, P.V.E. McClintock. 
Cambridge University Pr, 1989, xvi + 278 pp, $75. 
[ISBN: 0-521-35265-7] Third of three volumes com- 
prising a collection of research papers by different 
authors which may be read individually. This last 
volume contains nine articles dealing with the exper- 
imental aspects of the field. KS 

Applications (Physics), T(17-18: 1, 2), S, P, 
L. Gauge Field Theories. Stefan Pokorski. Mono. 
on Math. Physics. Cambridge U Pr, 1989, xiii + 
394 pp, $34.50 (P). (ISBN: 0-521-36846-4] An ad- 
vanced textbook (with exercises) covering path in- 
tegrals, Feynman rules, renormalization, QED and 
QCD, chiral symmetry and anomalies, symmetry 
breaking, and supersymmetry (1987 hardcover edi- 
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tion, TR, January 1988). BC 


Applications (Physics), P. Conformal Invari- 
ance and String Theory. Ed: Petre Dita, Vladimir 
Georgescu. Perspect. in Physics. Academic Pr, 
1989, xiii + 557 pp, $59.95. (ISBN: 0-12-218100-X] 
Sixteen papers on conformal invariance in statistical 
mechanics and its relationships with string theory. 
Yu. I. Manin summarizes the situation well on page 
293: “The development of theoretical physics in the 
last quarter of the twentieth century is guided by a 
very romantic system of values.” BC 


Applications (Physics), T(17-18: 1), S, P. Prin- 
ciples of Statistical Radiophysics 3: Elements of Ran- 
dom Fields. S.M. Rytov, Yu. A. Kravtsov, V.I. 
Tatarskii. Springer-Verlag, 1989, x + 239 pp, $99. 
(ISBN: 0-387-17829-5] Basic definitions, properties, 
and forms of random fields. Intended as a text for 
radio physicists and engineers. Includes solved exer- 
cises. Note the price. BC 


Applications (Physics), T(18: 2),S, P. Anal- 
ysis, Manifolds and Physics, Part II: 92 Applica- 
tions. Yvonne Choquet-Bruhat, Cécile DeWitt- 
Morette. North-Holland (US Distr: Elsevier Sci- 
ence, 1989, xii + 449 pp, $40. [ISBN: 0-444-87071- 
7| Organized under the following topics: review of 
fundamental notions of analysis; differential calculus 
on Banach spaces; differentiable manifolds; integra- 
tion on manifolds; Riemannian manifolds; Kahlerian 
manifolds; connections on a principal fibre bundle; 
and distributions (1977 edition, TR, May 1979). MU 


Applications (Physics), T*(15), S, L. Space- 
time and Singularities, An Introduction. Gregory L. 
Naber. London Math. Soc. Stud. Texts, V. 11. 
Cambridge U Pr, 1988, ix + 178 pp, $39.50; $16.95 
(P). [ISBN: 0-521-33327-X; 0-521-33612-0] Hawk- 
ing’s Theorem (paraphrased): Any universe which 
is stably causal, deterministic, expanding, and has 
attracting gravity, must be singular. Here, the exis- 
tence of a singularity is indicated by being geodesicly 
incomplete. This book is single-minded: to lead the 
reader to understand and (maybe) prove Hawking’s 
Theorem, all else is sacrificed. The requirements 
are first courses in real analysis, linear algebra, and 
physics. This could be a fun book for some motivated 
mathematics majors. MPR 
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original and survey papers dealing with game- Alvin E. Roth 
theoretic modeling in the social, biological, and Two-Sided Matching with Incomplete 
mathematical sciences. Papers published are Information about Others’ Preferences 


mathematically rigorous as well as accessible to 
readers in related fields. The purpose of the journal 
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and application of game-theoretic reasoning. ISSN 0899-8256 All other countries: $422.00 
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Textbooks 


lf you wish to consider one of the following 


books for adoption in a course you are 
teaching, please write for a complimentary 
review copy on department letterhead to the 
address below. 


Computability: Computable 

Functions, Logic, and the Foundations 
of Mathematics 

by Richard L. Epstein and 

Walter A. Carnielli 

An undergraduate introduction to the mathe- 
matics of computable functions and logic 
using readings from Hilbert, Godel, Turing, 
Post, Church, and many others. 

1989. 297 pages. Casebound. 

ISBN 0-534-10356-1. 


APL with a Mathematical Accent 

by Clifford A. Reiter and William R. Jones 
Uses examples from the mathematical 
sciences to motivate the development of 
APL usage. Independent chapters allow 

the book to be used as the main text or a 
supplement. 

1990. 225 pages. Paperbound. 

ISBN 0-534-12864-5. 


Statistics: Theory and Methods 

by Donald A. Berry and 

Bernard W. Lindgren 

Undergraduate mathematical statistics text 
designed for students with relatively weak 
calculus backgrounds. Many of the applica- 
tions use real data. 1990. 690 pages. 
Casebound. ISBN 0-534-09942-4. 


Complex Variables, Second Edition 

by Stephen D. Fisher 

New edition of a popular undergraduate text 
that presents both the theory and applica- 
tions of complex variables. 

April 1990. 424 pages. Casebound. 

ISBN 0-534-13260-X. 


Real Analysis and Probability 

by R.M. Dudley 

Graduate text that offers an exposition of 
modern probability theory and of the inter- 
play between the properties of metric 
spaces and probability measures. 1989. 448 
pages. Casebound. ISBN 0-534-10050-3. 
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_| Advanced Books & Software 


Wadsworth & Brooks/Cole 


Software and Professional 
Reference Books 


Order our books and/or software using our 
toll-free number (800) 354-9706, or write to 
us at the address below. All our products 
are backed by a 30-day money-back 
guarantee. 


Graphical Aids for Stochastic Processes 
by Bob Fisch and David Griffeath 

Winner of the 1988 EDUCOM-NCRIPTAL Award 
for Best Mathematics Software 

Six-disk educational software package for 
the IBM PC that provides a colorful introduc- 
tion to probability and random processes. 
1988. For IBM PC/compatibles with color 
graphics card/320K RAM. 

ISBN 0-534-09042-7. $105. 


Statable: Electronic Tables for Statis- 
ticians and Engineers, Version 1.0 

by Cytel Software Corporation 

With just a few keystrokes, the tail area or 
percentage point you want for the twenty- 
five most commonly used statistical distribu- 
tions appears in a pop-up window. 
Eliminates hunting for books of tables, inter- 
polation, and the possibility of errors in cal- 
culation. Can run by itself or as a memory- 
resident program. 

1989. One IBM PC disk and 32 pages 
documentation. Paperbound. 

5 1/2" disks: ISBN 0-534-11964-6. 

3 1/2" disks: ISBN 0-534-11965-4. $39.95. 


A Dictionary of Real Numbers 

by Jonathan M. Borwein and 

Peter B. Borwein 

For researchers who have encountered 
various numbers computationally and want 
to know if these numbers have some simple 
form. This reference book lists over 
100,000 eight digit real numbers in the inter- 
val [0,1) that arise as the first eight digits of 
special values of familiar functions. January 
1990. 470 pp. Cloth. ISBN 0-534-12480-8. 
$69.95. 
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EXP: The Scientific Word Processor, Version 2.0 

Simon L. Smith 

The page you see here was written with EXP 2.0, printed at 
300 dpi, and reduced to size. This page shows the actual 
screen display you see while editing this document, not a 
special “page-preview” mode. The graph was drawn by a 
graphing program and then imported into EXP. 


A > IEXP remains the fastest and truest“what you see is what you 
ASS get” scientific word processor for the IBM PC, featuring 


automatic sizing and centering of many symbols; macros; 
windows; and automatic numbering. Version 2.0 includes the 
ability to import bit-mapped graphics that can be seen on the 
screen as they will print; spelling checker; increased macro 
capabilities; more automatically scaling symbols; automatic conversion of variables into italics; ability to 
export files to popular desktop publishing programs; and support for VGA/HP DeskJet Plus. 
February 1990. ISBN 0-534-11970-0. $295. Upgrade and site license prices available. 


Mainframe Mathematical Power for the 
Macintosh 

Maple, Version 4.2: Symbolic Computation for 
the Macintosh by The Symbolic Computation 
Group, University of Waterloo 

Maple puts symbolic computation at your 
fingertips at a price you can afford! Put Maple’s 
speed and reliability to work on Macintosh—SE, 
SE/30, Mac II series—even the Mac Plus. And 
Maple requires only one megabyte! 


Maple combines symbolic computation and 
numerical computation with two-dimensional 
graphics to give you more power per dollar than 
any other symbolic processing software. Maple’s 
unique library of more than 1500 built-in 
functions is fully extensible. The Maple package 
includes the Macintosh interface, complete Maple 
libraries with source code and help files, First 
Leaves for the Macintosh (a tutorial), and the 
Maple Reference Manual. 

1989. ISBN 0-534-10218-2. $395. 


Site license prices available. 


A Powerful Software Tool for the IBM PC that 
Solves Differential Equations Fast 

PhasePlane: The Dynamical Systems Tool, 
Version 3.0 by Bard Ermentrout 

This interactive program for solving up to 20 
ordinary differential equations, difference 
equations, and differential-delay equations allows 
users to e create self-running and interactive 
demos e choose from seven different numerical 
methods, including Gear’s method for “stiff” 
equations e draw nullclines, direction fields, and 
global flows for two-dimensional models e create 
3-D graphics, perspective transformations, and 
color coding of trajectories by velocity or 
amplitude. PhasePlane includes a built-in 
scientific calculator and plotter/LaserJet support. 
January 1990. ISBN 0-534-12894-7. $42.00. 


Site license prices available. 


For U.S. orders, call (800)354-9706. 

To request a copy of our Advanced Books & 
Software catalog, please write: 

Wadsworth & Brooks/Cole 

Advanced Books & Software 

511 Forest Lodge Road 

Pacific Grove, CA 93950 
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MULTIVARIATE STATISTICAL METHODS, 3/e 
Morrison 
A BOOK OF ABSTRACT ALGEBRA, 2/e 
inter 
CALCULUS: AN INTEGRATED APPROACH 
Small/Hosack 
For more information, contact your local sales representative or write: McGraw-Hill 
Publishing Company, Comp Processing and Control, P.O. Box 448, Hightstown, NJ 08520. 


Hl 


Fields & From our mail... 
Ve he Success 
O p erators "Another Success from Lascaux Graphics! " 
M.E. City College of N Y 
S urfa ces. "Excellent Program! ! ..An exceptional educa- 


tional aid," D.W.-Dow Chemical 
Vector 


"Just what I've been looking for!" 


o D.B. Olivet College 
Field S. . "Keep up the valuable programs at decent ices " 
Differential DW 


"Amazing Software - write more.” Or. RB. TNU 


Operators. 


In two or animation capabilities, we would not have attempted 
thr ee such 0 task.” M. T. Univ. of Wisconsin 


"Without your program's powerful layering and 


° ° "A super product * 
dimensions. Recreational & Educational Computing 


And it's animated! |For PC or Macintosh 
Lascaux Graphics 3220 Steuben Ave. Bronx, NY 10467 (212) 654-7429 


ay o red copies of these tex 
WIN represenans or call of 9 323-45 
(In Ca 800 368-4178 in ontario 6-298 1588:) 
aie 60430 


SPOS 
Random Walks and 
Electric Networks, 


by J. Laurie Snell and Peter Doyle 


xiii + 159 pages. Hardbound 
List: $25.00 MAA Member: $19.00 


In this newest addition to the Carus Mathematical Monographs, the authors 
examine the relationship between elementary electric network theory and ran- 
dom walks, at a level which can be appreciated by the able college student. We 
are indebted to them for presenting this interplay between probability theory 
and physics in so readable and concise a fashion. 

Central to the book is Polya’s beautiful theorem that a random walker on an 
infinite street network in d-dimensional space is bound to return to the starting 
point when d=2, but has a positive probability of escaping to infinity without 
returning to the starting point when d = 3. The authors interpret this theorem as 
a statement about electric networks, and then prove the theorem using tech- 
niques from classical electrical theory. The techniques referred to go back to 
Lord Rayleigh who introduced them in connection with an investigation of 
musical instruments. 

In Part I the authors restrict themselves to the study of random walks on finite 
networks, establishing the connection between the electrical concepts of cur- 
rent and voltage and corresponding descriptive quantities of random walks re- 
garded as finite state Markov chains. Part II deals with the idea of random walks 
on infinite networks. 


Table of Contents 


Part I: Random Walks on Finite Networks 
Random Walks in One Dimension 
Random Walks in Two Dimensions 
Random Walks on More General Networks 
Rayleigh’s Monotonicity Law 


Part II: Random Walks on Infinite Networks 
Pélya’s Recurrence Problem 
Rayleigh’s Short-Cut Method 
The Classical Proofs of Polya’s Theorem 
Random Walks on More General Infinite Networks 
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Functions and Graphs 
Second Edition 
Bernard Kolman 

Arnold Shapiro 
Hardcover, 1990. 
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INTRODUCTORY 
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Second Edition 
Kenneth P. Bogart 
Hardcover, 1990. 
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CALCULUS 


Robert Seeley 
Hardcover, 1990. 
Just Published! 


LINEAR ALGEBRA 
Third Edition 
Michael O’Nan 
Herbert B. Enderton 
Hardcover, 1990. 

Just Published! 


EXPLORING 

SMALL GROUPS 

A Tool for Learning 
Abstract Algebra 
(software) 

Ladnor D. Geissinger 
Disk and Manual, 1989. 
Just Published! 


AA STUDIES IN 
MATHEMATICS 


Studies in Combinatorics 


Gian-Carlo Rota, editor 
Volume #17, MAA Studies in Mathematics 
272 pp., 1978, Hardbound, ISBN-0-88385-177-2 


List: $27.50 MAA Member: $21.50 


“This is a gem of a book. In layout and size it brings back memories 
of the classic Combinatorial Mathematics by H./. Ryser. The index is 
very good and the editor is to be warmly congratulated on bringing out 
a book of enormous value and uniformly high standard. It is the sort 
of book which could be used with great profit as the basis of a weekly 
working seminar on combinatorics. | recommend it unreservedly.” 


D.J.A. Welsh in Bulletin of the Institute of Mathematics and its Applications 


This excellent book in the MAA Studies in Mathematics series contains seven papers 
by prominent researchers in combinatorics. Although written by different authors, the 
papers have the unity of chapters in a monograph. Together they present a fairly com- 
prehensive exposition of some recent developments in combinatorics. Much of the material 
has application outside mathematics to computer science, operations research and net- 
work and circuit theory. Begun at the time of Euler, combinatorics has become one of 
the most active fields in mathematics. A significant feature of Studies in Combinatorics 
is its accessibility to any mathematician qualified to teach at the college level, as well 
as to a great many students. This volume is also accessible to a wide scientific public 
outside mathematics. 
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Prindle, Weber and Schmidt 
Celebrating our Silver Anniversary in 1990! 


The Prindle, Weber and Schmidt Series in 
Mathematics is both pleased and honored to 
announce the celebration of its Silver Anniver- 
sary. Over the past 25 years we've dedicated 
ourselves to leading the industry in innovative 
texts and responsive service. Our commitment 
to the discipline and its changing needs is as 
strong as ever and even today we are looking 
toward the future with our eyes and ears wide 
open to new ideas and trends. We look forward 
to servicing you, the mathematics educators of 
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An Alternative Proof of the Lindemann-Weierstrass Theorem 


F. BEUKERS, J. P. BEZIVIN, P. ROBBA 


Dedicated to the Memory of Philippe Robba 


FRITS BEUKERS: born in 1953. University education in Leiden, Netherlands. 
Currently teaching at the State University of Utrecht. Research interests are 
number theory and arithmetic properties of linear differential equations. 


JEAN-PAUL BEzIVIN: educated at the Ecole Normale Supérieure and the 
Universities of Paris VI and VII. He currently teaches at the University of 
Paris VI. Research interests are p-adic analysis, linear recurrent sequences 
and arithmetic properties of formal power series. 


PHILIPPE RoOBBA: born in 1941, died on October 12, 1988. Doctorate degree 
at the University of Paris in 1972, and professor at the University Paris XI 
from 1975 until his untimely death. Domain of research: p-adic analysis, 
mainly p-adic differential equations with emphasis on p-adic cohomologies. 


Introduction. In December 1987 J. P. Bézivin and Ph. Robba found a new proof 
of the Lindemann-Weierstrass theorem as a by-product of their criterion of rational- 
ity for solutions of differential equations. Let us recall the Lindemann-Weierstrass 
theorem, to which we shall refer as LW from now on. 


Let a,,..., @,, b,,..., b, be algebraic numbers such that the b; are all nonzero and the 
a, are mutually distinct. Then 
be" + bye? +--+ +he™ #0. 

It is well known that the transcendence of a7 follows from LW in the following 
way. Suppose, on the contrary, that 7 is algebraic. Then so is 7/— 1 and LW now 
implies that e™vV-li.¢ 0, which is certainly not true. Thus we conclude that 7 is 
transcendental. 

The usual proof of LW is essentially due to Hilbert and has been polished by a 


number of authors. One such version can be found in [2, Ch. XI] or in [4, Ch. J], [3]. 
The new proof of Bézivin and Robba looks totally different. It can be considered as 
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a direct consequence of their criterion of rationality for solutions of linear differen- 
tial equations. 

The Bézivin-Robba criterion is based on a theorem of Polya-Bertrandias, which is 
far from easy and relies heavily on p- adic analysis. We refer the interested reader to 
[1]. In March 1988 F. Beukers found that the use of the Polya-Bertrandias criterion 
is much too heavy and can be avoided in a very elementary way. The result is a new 
proof of LW which is elementary and can compete with the usual one in shortness 
and simplicity. There is always a possibility that the similarity between the proofs is 
stronger than one would expect at first sight. In fact, very soon after a first draft of 
this paper was written (May 1988) Yu. Nesterenko pointed out to us that the 
numbers v,(k) which we use are equal to the integrals 


é CO 
> ber f e7*" k(x — a )* «++ (x — ,)* dx 
j=1 


a) 


which are used in the Hilbert proof. In spite of such similarities we feel that the 
arguments of our proof are nice enough to present in front of a wider audience. We 
would like to thank the referee for several improvements upon our presentation. 


THEOREM. Let b,,...,b,, ,...,4,€@ such that b, #0 Wi and the a; are 
mutually distinct. Then 


bet +--+ the #0. 


Proof. Consider the Taylor series expansion 
ie. @) 
n 
bet +--+) +hewx*= VP x"; 
n=0 [+ 
where, clearly, 


t 
u,= >, b,a”. (1) 
i=1 


Put (X — a,)---(X — a,) = X'- a,X'' — --- —a,. Clearly, for any i = 1,..., 2 
and anyn€& Z, 


ttn 


_ ttn—-1 a n 


By taking suitable linear combinations and using (1) it follows that 


u +t QyUy+t-1 tose +a,u,. (2) 


n 


Without loss of generality we may assume that u, € Q, Wn. If not, then consider 
the product 


T](o(d,)e%* + +--+ +0(b,)e%*) 


oO 


taken over all o © Gal(Q(b,,..., 5,, a,..., a,)/Q), which, after evaluation, again 
acquires the form 


1%; 
> ble*, 
i 


where now the sets {b/}, {a{} are Galois-stable. This implies that the correspond- 
ing numbers a’ and uw’ are rational. So from now on we assume u, © Q, Wn and 
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a, Q (i=1,...,¢). Let D be a common denominator of the a;. Put A = 
max(1, |a,|). After multiplication with a suitable integer, if necessary, we may 
assume Uy,..., U,_, © Z. Hence, using (2) recursively, 
D"u, € Z, (3) 
and by (1), 
jul ~ c, A” 
for some c, > 0 and all n> 0. Now suppose that b,e% + --- +be% =0, or, 
equivalently, 
ou 
y ~=0 
r=0 
Put 
nu 
v,=n! ), — 
r=0 
and notice 
" Uu, co ul, C1 00 A’ qnt} 
alan] Sl—m) & Ble Ye. 
" of! wert] at+1i,27,(r=n+1)! n+1 


If we had A = D = 1, like in the high school proof of e € Q, inequality (4) gives us 
a contradiction since we have both v, € Z and |v,| < c,/(n + 1), 1.e. v, = 0 for all 
sufficiently large n, in other words Lfu, X” is a polynomial. In our general case a 
similar principle works. 


Claim: 


> v,X" € Q(X). 


n=0 
Assuming the claim we proceed with the proof. Define 


Lo @) 
v(X) = Yi vx". 
n=0 


Notice that 


Pn Pama Un 
nl (n _ 1)! al OT UD, — Av, = u,- 
So, 
ore) ore) 
» (u, 7 nv, 1) X" _ » u,X". (5) 
n=0 n=0 


Using (1), the right-hand side of (5) is seen to be 


t 


one) b, 

Yu, x"= Ys 

n=0 a 
whereas the left-hand side equals 


d ; d 
v(X) - X= (X0(X)) =(1-— X)v(x)-X ay). 
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So (5) becomes 


f b; , 4 

Lu(X) py Doak L=-X xt (1 — X). (6) 
By the claim we know that v( X) € Q(X) and so the non-zero poles of “#u(X) 
have order at least two. However, the right-hand side of (6) has only simple poles. 
This contradiction proves our theorem, since the assumption b,e"1 + --- +b,e% = 0 
has turned out to be untenable. O 
It now remains to prove our claim. We first observe that, as v is a solution of the 
differential equation (6), if v is a rational function then its poles must be at the 
points 1/a,. Therefore we expect that there exists an integer k such that (1 — a,X 
—--- — a,X*)*v(X) is a polynomial. 


DEFINITION. For any k, n& Z, we define v,(k) as coefficient in the formal 
power series 


y. v,(k) X" = (1—4a,X— --- —a,X')" ¥ 0, X". 
n=0 n=0 


For later use we also note that 
v(k +1) =0,(k) -— apv,_,(k) —--- — a,v,_,(k) foralln>t,k>0. (7) 


LEMMA 1. Let C =1 + |a,| + --- +la,|. For alln > kt we have 
i) |v,(k)| < c,A"CE 

u) Dv (kK)EZ 

ii) k! divides D"v,(k). 


Proof. The first two assertions follow easily by induction on k from (3), (4) and 
(7). The third assertion can be shown as follows. Write 
v, =u,t+nu,_ +--+: +n(n—-—1)---(n-—k+2)u,_,4,+,, 


where w, = n!L° u,/r!. Notice that D” “w, € Z and k! divides D”~*w,. Con- 
sider the power series 


vo(X)= Viv Xx" w(X)= Y w Xx". 
n=0 n=0 
Then 
v(X)-w(X)= Di fu, + nu,_) +--+ tn(n-1)---(n-—k + 2)u, 444} X”. 
n=0 


For any 0 < r < k — 1 we observe that 


oO t oO 
Yin(n—1)-+-(n-rtl)u,_,xX"= % Y r'(") bjanX” 
n=0 i=1 n=0 
t 
= r! ba) xX’ ———___ 
X / (1- a,x)" 
P(X) 


(l—a,X—----—a,X)" 
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where P(X) is a polynomial of degree < ¢t(r + 1). Hence 


P(X) 
v(X) — wW(X) = ———_—_——___, 
(1 —a,X-—--- — tx’) 
where P(X) is a polynomial of degree < tk. Hence v,(k) = w,(k) Wn > kt, where 
yw, (k)X" = (1 — aX — --- — a,X)“w(X). 
n=0 


Since k! divides D”~“w,, we see that k! divides D"w,(k) = D"v,(k) as asserted. 


Proof of the Claim. It is sufficient to prove that L°_ v,(k)X” € Q[X] for some 
k &N. From the Lemma it follows that if v,(k) # 0 "and n > kt, then 


k! <|D"v,(k)| < c,(AD)"C*. 
Hence, if k! > c,(AD)"C* and n> kt then v,(k) = 0. Choose ky so large that 
k! > c,(AD)'"'C* Wk > ky. Then 
v,(k)=0 forall k > ky, kt <n < 10kt. (8) 


This situation can be pictured as follows: If a point (n, k) falls in the shaded region, 
we have automatically v,(k) = 0 according to (8). So, finally, by (7) and induction 
on n — 10kt, it follows that v,(k) = 0 for all (n, k) in the infinite triangular region 
ky <k < n/10t. Thus we conclude that u,(ky) =0 Wn >kot and hence 
ve o¥,(k)X" € Q[X], which proves our claim. 


kf 


\ 


Es 
n= kt A SS 


\\ 


if 


n= 10kt 
ko cs 


*f 
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1. Introduction. The result known as Lucas’s Theorem first appeared in 1878 in 
The American Journal of Mathematics [1] (more accessible sources are [2], p. 271, 
items 76 and 77, and a complete translation in [3]), and reads as follows. Let p be a 
prime, and for integers n and k set n = Ln,p', k = Lk;p', with 0 < n,, k; < p; 
then 


(i) = r(; (mod p), (1.1) 


where we set ( = Oif k, > n,. Proofs other than Lucas’s have been given (e.g., [4]), 


A, 
the congruence has been generalized in various ways (e.g., [5], [6]), and many related 
results have been elaborated (see [7] and its many references for an extensive 
discussion of this and related topics). 

There seem to be, however, no corresponding developments for extended Pascal 
triangles T,, (as defined below), and in Section 2 we consider the extension of 
Lucas’s congruence to the entries C,(n,k) of T,; both statement and proof are 
straightforward, although the form is more awkward to use than (1.1). Section 3 
then uses some results related to the extension to investigate, as in [4], [7], some 
divisibility properties of C,,(n, k); in particular, when is C,,(n, k) # 0 (mod p) for 
k in the range 0 < k < (m—1)n? 

The extended Pascal triangles arise, by analogy with the ordinary Pascal triangle, 
as the (left-justified) arrays of the coefficients in the expansion of (1 + x + x? 
+--+ +x™~1)", That is, the array 7,, has in row n, column k, the number 
C,(n, k) defined for m,n, k = 0 by 

(m—l1)n 
(Ltxtx2tes-tx™ "= Yo C(n,k)x*. (1.2) 
k=0 
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Omitting calculational details, we list below some properties of these arrays for 
m,n,k > 0. 
a) The coefficients in T,, may be obtained from the relations 


C,, (0, k) — Box 
m—-1 
CA(nt+1,k)= ¥ C,(n,k—-J). 
j=0 


b) It follows then that 7 is the all-zero array except for C,(0,0) = 1, and all the 
rows of 7, consist of a one followed by zeros. For m > 2, T,, is the array whose 
n = 0 row is a one followed by zeros, whose n = 1 row is m ones followed by zeros, 
and any of whose entries in subsequent rows is the sum of the m entries just above 
and to the left in the preceding row. 

c) For m > 0 there are (n(m — 1) + 1) nonzero entries in row n of T,,, and the 
symmetry relation among these is 


C,(n,k)=C,(n,(m-1)n-—k), Osks(m-—1)n. 


d) T, is, of course, the Pascal triangle, and C,(n, k) = ("). The C,(n, k) can be 
expressed in terms of ordinary binomial coefficients, 


cated = BEM A") 


but for present purposes it is preferable to think of 7,, as having an independent 
existence and a triangle of its “own,” which could be tabulated as easily and 
familiarly as T,. 

e) As an example, here are the first few rows of T;: 


k 


In the following sections, letters d,e,k,m,n,r,s, and so on, will denote 
nonnegative or positive integers as specified by the context, and p will denote a 
prime. We also adopt a modification of Singmaster’s notation M(n, p) [7] by 
defining N,(n, p) as the number of values of k for which C_(n, k) # 0 (mod p). 


2. The Extension of Lucas’s Theorem. The proof given below is direct, and only 
depends on a simple congruence and the fact that the entries in the nth row of T,, 
are the coefficients in the expansion (1.2). 


THEOREM 1. Let p be a prime, and let 
n=nyt+tnyprt-:::+np’, O<n;, <p 
k=k jt+kpt---t+k,p'", O<k,;<p, Osk<(m-1)n. (2.1) 
Then 


Ca, k) = LT] Ga(aps,)(mod p) (2.2) 


(Sossess S,) i=0 
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where the sum is taken over all (r+ 1)-tuples (So, S,,...,5,) Such that i) 5S) + 
s;p +++: +s,p" =k, and ii) O<s;,<(m-—1)n,; if k is not representable in this 
form then certainly C,,(n, k) = 0 (mod p). 
Proof. 
(m—1)n 


Y C(n, k)xk = (Lt xt 0) $x™ ly" 
k=0 


; 
T]Q@+xt4+--- 4x" 3)" 
i=0 


r n, 
[] (1 + x? + x2? +--+ +x("-D2") “(mod p) 
i=0 


s =0 


t 


r (m—1)n, 
TH » Calon sx 


(m—1)n r 
» ‘ » [1 cans) x4 


k=0 5,) i=0 


Pe | 


and the result follows by equating coefficients of x“. All the steps here involve 
straightforward replacements, except the one involving the congruence in the third 
line, which may be proved by induction. Note that for m = 2, conditions (i) and (ii) 
imply that the expression of k in terms of 5o,..., 5, 1S just the unique base-p 
representation of k(t =r, s; = k;,0 < k; < p) and so there is only one term in the 
sum of products, which in this case reduces to the original form (1.1). 


Example. C,(8, 4) = 2 (mod 3), the calculation being as follows. Using (2.2), with 
n = 8 = 22, base 3 (so that ny = 2, n, = 2), 


C;(8,4)= [1 ,(%,, 5;)(mod 3), 


(sy, 5,) = 9 


where the sum is taken over all (59, 5,) such that sy + 35, = 4, and 0 < 5p, 5, < 4, 
which means here that (so, s,) takes on the values (1,1) and (4, 0). Thus, 


C;(8, 4) =C,(2,1) - C,(2,1) + C,(2,4) - C,(2,0) 
=2-24+1-1 
=14] (mod 3). 


=2 


Even for m,n small there may be many terms in a sum of products such as that 
above, because the conditions (i), (ii) may allow a given value of k to be represented 
in many ways. 

It is also possible that the conditions do not allow of any representation of k of 
the specified form. 


Example. C;(5,7) = 0 (mod2) because k = 7 here is not representable in the 
required form. That is, with n = 5 = 101, base 2, we have ny = 1, n, = 0, n, = 1, 
so that O< 5s) <2, 5, =0, O< 5, < 2, but with these restrictions k =7 is not 
representable in the form sy: 1+ 0-2 +s, -° 27, which can only assume the values 
0,1, 2,4, 5, 6, 8, 9, 10. 
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It may be worthwhile to point out that even though the conditions (1), (ii) may 
not allow some values of k to be represented, as in the previous example, the full 
range of k is allowed for, since with n =n )+n,p+--: +n,p", we always have 


O<sk=S8),+5pt+-:: +58,p’ 
<(m—-1)(no +n pt ---: +n,p") = (m—1)n. 


The coefficients k, in the strict p-ary representation of k really play no role in all 
this for m > 2, being superseded by the s;,. 


3. Related Results. In this section, we take up the question of the number of 
values of k for which C,(n, k) # 0 (mod p); the main results are exact determina- 
tions of N_(n, p) for the cases m = p and m= p®, p > 2. Among the things one 
might want to know: what the minimal and maximal values of N_(n, p) are and 
where they occur; -and a formula, if possible, for N(n, p). These questions are 
already settled in the case of m = 2 ({7], results and references, sections 6,7) and the 
main results are as follows. 

1) With n=ny +nypt+--: +n,p’, 


N,(n, p) = IT (x, + 1). (3.1.1) 

2) Nj(n, p) = 2, and N,(n, p) = 2 if and only if n = p*®. (3.1.2) 
3) N,(n, p) <n+1, and N,(n, p) =n +1 if and only if n = ap’ — 1, with 
l<ac<p. (3.1.3) 


These results might be derived by ad hoc arguments, but are usually obtained as 
consequences of Lucas’s Theorem. 

We briefly note that these results could also be argued from the point of view of 
the extension of Lucas’s Theorem, since, for example, the number of valid (r + 1)- 
tuples (so, 5,,..., 5,) occurring in (2.2) and satisfying conditions (1), (it) is 

[]((m-1)a, + 1), 

i=0 
which for m = 2 gives the formula (3.1), and is in this case just the number of 
C,(n, k) not congruent to zero (mod p); the other results can then be derived from 
this. 

To evaluate N,(n, p) we first note that when m is a prime, p, the polynomials 
1+x+x«7+---+x?7! in (1.2), whose powers give rise to the corresponding 
Pascal-T triangles T,, are just the cyclotomic polynomials of prime index, ®,(x) [8, 
p. 76 ff.] (we will also need the initial cyclotomic polynomial ®,(x) = x — 1). 
Further, the fact that we want to consider congruences, mod p, puts us in the 
situation of working with these cyclotomic polynomials over a finite field of 
characteristic p, so that we have available the following results, usually used in 
factoring polynomials over finite fields. 


THEOREM [8, p. 79]. If p is a prime and gcd(n, p) = 1, then fore > 1 
®,, «(x) = ®,,(x?" ’) (3.2) 
®,, (x) = (®(x))? "ina field of characteristic p. (3.3) 


Using (3.3) with n=1, e=1, we have that ®,(x) = ®(x)?"' = (x — 1)?7! 
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(mod p), 1.e., 
(L+x+ x24 --- +x)" = (-1)?°- "(1 — x)?7”" (mod p). 


Now, neither the sign in front, nor the fact that we have powers of (1 — x) rather 
than powers of (1 + x), has any effect in this context on whether a given coefficient 
in the expansion is congruent to zero (mod p). What this means is that, since the 
expression on the right is binomial, we can use the formula (3.1.1), appropriately 
adjusted, to evaluate N,(n, p). That is, N,(”, p) is given by Nj((p — 1)n, p). We 
summarize this discussion in the following theorem. 


THEOREM 2. Let the p-ary expansion of (p—1)n be aygt+a,p+°:-: +a,p’. 
Then in T, the number of entries C,(n, k) in line n not congruent to zero (mod p) is 
given by (cf. 3.1.1) 

N,(n, p) = TQ + a;). (3.4) 

Some other aspects of N,(n, p), which parallel known results for N,(n, p), are 
given in the following corollaries. 


COROLLARY 2.1. For n > 1, the number of entries not congruent to zero (mod p) 
in any line n of T,, is at least p, and equals p exactly when n is a power of p. That is 
(cf. 3.1.2), 


N,(n,p)=p, and Nn, p)=p_ iffn=p’. 


Proof. We know that N,(n, p) is given by (3.4), with the a; being the digits in 
the representation of (p —1)n. But these digits must add up to a multiple of 
(p — 1), and so in the expanded form of (3.4) we have 


[] (+ 4;)=1+ (a) + --- +a,) + (sums of products of the a, ) 
i=0 


= 1+ s(p—1) + (nonnegative terms), 


from which we can see that we always have N,(n, p) = p, and that N,(n, p) = p if 
and only if one of the a, is (p — 1) and the rest are zero. But then the correspond- 
ing digits in the p-ary expansion of n itself are, respectively, 1 and the rest zeros, 
which means n is a power of p. Thus, N,(p, 1) is minimal iff n is a power of p. 


COROLLARY 2.2. For n => 1, the number of entries not congruent to zero (mod p) 
in any line n of T, is at most (p — 1)n + 1, and equals ( p — 1)n + 1 exactly when n 
is of the form ( p’ — 1)/(p — 1). That is (cf. 3.1.3), 


N,(n, p) $(p-1)n+1, and 
N,(p,n) =(p-1)n+1 iffn=(p'-1)/(p - 1). 


Proof. The inequality is trivial, since there are only ( p — 1)n + 1 entries in line 
n. For equality, we must have N,(n, p) = N((p — 1)n, p) =(p — In + 1. But 
then we know from [7, Th. 14] that the argument of N, must be of the form ap’ — 1, 
with 1 < a < p. That is, (p — 1)n = ap’ — 1, or n= (ap’—1)/(p — 1). But n 
here is an integer, so that a= 1. Thus, N,(n, p) is maximal iff n = (p’— 1)/ 
(p — 1). 
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COROLLARY 2.3. For Pascal-T triangles T,,, where m is a power of a prime p, we 
have 


N,«(n, p) = N,(n(p* — 1)/(p — 1), p) 
= N,(n(p* — 1), p). 


Proof. Direct calculation using (3.2) shows that 


(Fa) 

po" (mod p), 
from which the result follows. Maximal and minimal values of Ny can then be 
obtained as in corollaries 2.1 and 2.2. 


(Ltxtxrtte- +x? Tt) = (Ltxt x74 --- +x?7}) 


Example. For the rows of 7; (mod 5), Theorem 2 and Corollaries 2.1, 2.2 give the 
following. 

a) In row n = 10, the number of entries C,(10, k) not congruent to zero (mod 5) 
is N;(10,5) = 8 since the base-5 representation of 4 - 10 = 40 is 130, so that n, = 0, 
n, = 3, n, = 1, and (3.4) gives (1 + 0) + 3)(1 + 1). 

b) For n > 1, N,(n,5) = 5, and N,(n,5) = 5 iff n = 1,5,25,125,.... 

c) Forn > 1, N(n,5) < 4n + 1, and N.(n, 5) = 4n + 1 iff n = 1,6, 31, 156,..., 
since these are the values of (5” — 1)/4, r= 1,2,3,..., the values whose base-5 
representations are all 1’s. 


Lastly, we also have the following density result, where the density here is 
“triangular,” as defined below; cf. also [9] in this connection. 


COROLLARY 2.4. In T,, “almost all” entries C,(n, k) are divisible by p. 


Proof. Let m be an integer such that p° < (p — 1)m < p*t!. Then no, (Nn, P) 
is the number of C,(n, k) not divisible by p, and lying in the “triangle” consisting 
of rows n = 0 through n = m of T.; the total number of elements in this triangle 
will be s,, = (1/2)(m + 1I[(p — 1)m + 2]. Now, 


m m (p—l)m 
UN (n,p)= VN((p-1)n,p)s Y Nn, p) 
n=0 n=0 n=0 


sr(o+ 9 | ((p — 1)m)((p - 1)m +1), 


where the last inequality follows from Th. 1(b) of [9]. But then (L7_9N,(1, P))/S,, 
goes to zero as m (and thus e) goes to infinity. 

In conclusion, we remark that results corresponding to those of Theorem 2 and 
its corollaries are likely to be less straightforward for T,, (mod p), m composite, or 
even T, (mod p), p,q distinct primes. It is true, for example in 7; (mod 2), that 
N,(n,2) > 3 for any n > 1, since we always have C,(n,0) = C,(n,2n) = 1, and it 
may be shown by induction, using the recurrence relation for the coefficients, that 
C;(n, n) = 1 (mod 2). It is also true that N,(n,2) = 3 exactly when n is a power of 
2. On the other hand, N,(n, 2) does not take on the value 2n + 1 for any n > 1, as 
is shown by the following brief argument. The patterns of the first four elements in 
rows n = 2 ton = 5 of T, (mod 2) are 1010, 1101, 1000, and 1110. After that the 
cycle of patterns is repeated because the first four entries in a row are determined by 
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the first four entries of the preceding row; hence, every row contains at least one 0. 


Even so, the present discussion extends much of what is known for T, to a whole 


range of “triangles”, obtaining some fairly obvious results but also some which 
might not have been predicted from 7,, and leaving evident several interesting open 
questions. 


The authors are indebted to the referee for a number of helpful suggestions. 


REFERENCES 

1. Edouard Lucas, Théorie des Functions Numériques Simplement Periodiques, American J. Math., 1 
(1878), 184-240, 289-321. 

2. L.E. Dickson, History of the Theory of Numbers, vol. I, New York, Chelsea, 1952. 

3. E. Lucas, The Theory of Simply Periodic Numerical Functions (translated by S. Kravitz, D. Lind), 
The Fibonacci Association, Santa Clara, 1969. 

4. N. J. Fine, Binomial coefficients modulo a prime, Amer. Mathem. Monthly, 54 (1947) 589-592. 

5. G.S8. Kazandzidis, Congruences on the binomial coefficients, Bull. Soc. Math. Grece (NS), 9 (1968) 
1-12. MR 42, #182. 

6. David Singmaster, Notes on binomial coefficients I, J. London Math. Soc., (2) 8 (1974) 545-548. 

7. , Divisibility of Binomial and Multinomial Coefficients by Primes and Prime Powers, 18th 
Anniversary Volume of the Fibonacci Association, 1980, 98-113. 

8. R. J. McEliece, Finite Fields for Computer Scientists and Engineers, Kluwer Academic Publishers, 
Boston, 1987. 

9. David Singmaster, Notes on binomial coefficients III, J. London Math. Soc., (2) 8 (1974) 555-560. 


There are three kinds of mathematicians: those who can count, and those who can’t. 


(this is due to Marcia Sward, Solomon A. Garfunkel, 
A. G. Howson,..., and possibly others) 


The Remainder in Taylor’s Formula* 
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We present here a mean-value theorem that generalizes the Taylor-Lagrange 
formula. The result arises in a natural way when one studies the asymptotic 
behavior of the remainder term of the formula. As an application of our result, we 
derive several numerical schemes to approximate the solution to initial-valued first 
order differential equations. 

To begin, let us recall the Taylor-Lagrange formula. For convenience, we work in 
intervals of the form [0, x]. 


THEOREM. (Taylor-Lagrange formula). If f is continuous in [0, x], f‘"~?(O) exists 
and f‘"(t) exists in (0, x), then there exists a & in (0, x) such that 


(3) = Pyalx) + FCB). (1 


where 


n-1 


P,-1(x) = f(0) + f’(0)x + £0) to +f O Dy 


is the Taylor polynomial of order n — 1 for f about 0. 


Our first result concerns the asymptotic behavior as x — 0* of the number é in 
the theorem above: 


THEOREM 1. With notation as in the previous theorem, if f"*(t) exists in [0, x], 
is continuous from the right at t = 0 and if f"*)(0) # 0, then 


1 
lim — = . 
x->0+ X n+1 


Remark. The assumptions in this theorem imply that the number &€ is uniquely 
determined for x small enough. 


*This work was supported in part by the Byron K. Trippet Research Stipend at Wabash College. 
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Proof of the theorem. In (1), we apply the Mean-Value theorem to f‘”(£): there 
is a number 7 in (0, €) such that 


f(x) = Par) +f (E) 
= py, a(x) + [f(0) + fOP (7) E] 5 


= pylx) + SOME. 


On the other hand, by the Taylor-Lagrange formula 


n+1 


f(x) = p,(x) POY ewer for some o in (0, x). 


Therefore, 


Since f‘"*t) is right-continuous and nonzero at t = 0, the conclusion follows. 


“ 


xX e 
With notation as above, f ”(€) ~ can be viewed as the error made in approxi- 
n 


mating f(x) by p,_,(x). In view of Theorem 1, one would expect that replacing 
by x/(n + 1) and approximating f(x) by 
x” 
n+1 hn 


would result in an approximation to f(x) of order at least that of p,(x). In fact, it 
turns out that this approximation is of the same order as p,,,(x). We no longer 
require that f("*)(0) # 0. 


Ao(x) = Pal) + £( — 


THEOREM 2. If f(*%(t) exists and is continuous in [0, x], then there exists a & in 
(O, x) such that 


xnt2 


M(x) = 42) + lO Gyr 


= PrlX) +15) an ae CESMMCICES IE 


Proof. Using Taylor’s formula with integral remainder, we can write 


f(- Xx | = f(0) + fO"P(O)- ~ 7 4 [perro] ~ ; _ | dt. 


Therefore, 


Aol) = Preslx) + [S(O] =] 


n+1 
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but 


Fx) = Praal) + Tay [POM = Oat 


It follows that 


1 * +1 
A(x) = Aolx) = Gay LOM = at 


_ = Pripemn| 4 _ (| dt 
= frye Se — ~|— — ] dt 


oats ee iy de, 


Now, let 


Then g(0) = 0 and g’(t) > 0 in (0, x], so that g(t) > O in that interval. Therefore, 
we can apply the Mean-Value theorem for integrals to deduce that there are 
numbers €, and &, in (0, x) such that 


1 aa 
f(x) — Ao(x) = arene fort sls) a 


fo" (£,) (x - y"*? dt. 


n+1 


"th + 5 


Finally, an application of the Intermediate Value Theorem concludes the proof. 

In view of the way in which the statement of Theorem 2 follows naturally from 
Theorem 1, it should be clear at this point that we should try to determine the 
asymptotic behavior as x — 0* of the number € in Theorem 2 and then establish a 
result similar to the one just presented. We are thus led to consider approximations 
to f(x) of the form 


x” xnt2 
A — + Mf _ (n+2) ne 
(x) P, -1(x) of (cox) = + M,f (c)x) (n + 2)! 
x tt 2k 
rote Ma fC) TE 


where for a given n, the coefficients M,, and c,, depend only on i. Clearly, we have 
1 
M, = 1 and cy = ——-. 
° He 60 n+1 


In the spirit of Theorem 2, we want A,(x) to be an approximation to f(x) of 
order n + 2k + 2. By expanding f(**/)(c, x) in Taylor polynomials about 0 and 
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collecting terms, we see that the sequences {M,,} and {c,,} can be obtained 
recursively from 


M, a‘ M, c3k~? M3, 1 
__ + Hi SO (2) 
n! (2k)! (n + 2)! (2k — 2)! (n + 2k)! (n + 2k)! 
M, co‘** M, ck} My, Cox 1 
ey ag OE 
n! (2k + 1)! (n + 2)! (2k — 1)! (n+ 2k)! 1! (n+ 2k +1)! 
(3) 


Then, one verifies readily that if f("+?**” exists and is continuous at 0, 
lim f(x) — A,(x) _ My 1,42 
x30 x tek +? (n + 2k + 2)! 


and, therefore, A,(x) is indeed an approximation to f(x) of order n + 2k + 2. In 
fact, the following mean-value theorem holds. 


fr 2ktD(O), 


THEOREM 3. If f‘"*?)(t) exists and is continuous in [0, x], then there exists a 
number & in (0, x) such that 


n+2k 


f(x) = A,_,(x) + Max f°(8) Cay (4) 


Moreover, 0 < M,, <1 forj > 1 and 0<c,,<1 forj > 0. 


Proof. By induction on k. For k = 1, this is just Theorem 2. Let us assume that 
the result holds for k > 1. Assume also that 0 < M, j~<1 for j=1,..., k and 
O0<c,,<1 for j7=0,...,k4 —1. Finally, as part of the induction hypothesis, 
assume that g,_,(A) > 0 for 0 < A < 1, where 


(1 - hyrreen My( eo — yen My,( Co, — A) 4 
 (n+2k +1)! n\(2k + 1)! (n + 2k)! 
As usual, z, denotes the largest of 0 and z, for any real number z. (That g)(A) > 0 
for 0 < A < 1 follows from Bernoulli’s inequality.) First of all, observe that if f has 


n + 2k +1 continuous derivatives and f‘"+?**)(Q) # 0, then lim, _,),(&/x) = 
C>,, where & in (0, x) satisfies 


f(x) = A,_1(%) + My, f*P(E). 


(n+2k)! 


Therefore, 0 < c,, < 1. This, together with the induction hypotheses, implies that 
g,(0) = g,(1) = 0. Notice also that g;’(A) exists for A # c,, and 97 (A) = g,_4(A). 
This implies that g, is convex in [0, c,,] and in [c9,, 1]. 

If c,, # 0,1, then g//(A) > 0 for ’ # 0,c,,,1 implies that g, is increasing in 
[0, c,,] and in [c),, 1]. We also have that g,(0) = g,(1) = 0, which implies that g, is 
convex and increasing in [0, c,,] and it is convex and decreasing in [c),, 1]. The 
conclusion is that g,(A) > 0 in [0,1] with equality only for A = 0 and 1. 

If c,, were 0 or 1, then we could carry out the same analysis as above, using 
one-sided derivatives ‘instead. But c,, = 0 or 1 would imply that g, = 0 in (0, 1], 
which would force g,_, = 0 in [0, 1], contradicting the induction hypotheses. 
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Now we are ready to complete the proof: 
If f has n + 2k + 2 continuous derivatives in [0, x], we have 


x tte) 


7 


f(x) — A,(x) = f(x) — De Ma fea 


Now, expand each term in the right-hand side into its Taylor polynomial of degree 
2k — 2j + 1 about 0, with integral remainder, and collect terms. One gets 


4 7 x ont 24D (x _ a ae ; 
f(x) - A(x) = fF ()yoee @ 


x (cy .x _ p)7k- 2d ynti 


_ may, . (n+2k+2) t a ne, Fa 
rs, aif (1) (2k -—2j+1)! (n+2/,)! 


In each of the integrals above, substitute t = Ax. After rearranging, this gives 


f(x) ~ Ag(x) = x8? [POP (Ax) By (A) dd. 
0 


Since g,(A) > 0 in [0,1], we can apply the Mean-Value theorem for integrals to 
conclude the proof. 


Remarks. 

1. From the point of view of applications, it would be interesting to have a closed 
formula for the sequences {c,,} and {M,,} given by (2) and (3). Numerical 
computations suggest that for fixed n the sequence {c, ,} is monotonically increas- 
ing while { M, ,} decreases to 0. 

2. If f is analytic in a disk with center at the origin and radius R > 0, then one 
can use the Cauchy-Hadamard estimates to deduce that the expansion 


xt) 


(2) = Pra) + EMF ar) ay 


J 


is valid in the disk |x| < R/2. In practice, however, the region of convergence of 
this expansion appears to be at least as large as that of the power series expansion of 
f. (Cf. Remark 4 below.) The validity of this statement may depend on the behavior 
of the sequences M,, and c,; asi > ©. 

3. For n = 1, both the statement and proof of our main result are simpler: 
elementary combinatorial identities reveal that in this case c,,= 1/2 and M,, = 
1/4’ satisfy (2) and (3). This suggests the following proof for n= 1: In the 
Taylor-Lagrange formula 


, (m) (b _ a)" (m+1) (b _ a)" 
F(B) = F(a) + F(ab ~ a) + FFA FOOT 
take m = 2k. Put a= x/2 and b = x; then, put a = x/2 and b = 0, subtract and 
appeal to the Intermediate Value Theorem, to deduce that there is a € in (0, x) such 
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that 
2k 2 4 x x 2j+1 
Ks) =10)+ © arayl” »(5 [5] 


2k+1 


2 x 
4 fp RK +1) | 
(2k + pi (é) 2 
It follows that if 


fektD(E) x 12k+1 
lim Sra |s| = 0, 


then 


co 1 Qj+) x x 72/+1 
= en J — — 
f(s) = 10) +2¥ aay S\s] (5) 

4. We are grateful to W. Swift for calling our attention to the fact that if f is 
analytic at 0, then (5) is valid for all x in the interior of the Borel polygon of 
summability of f. Recall that the Borel polygon of f is constructed as follows: Let 
P be a point in the complex plane where f has a singularity and let O denote the 
origin. Draw the line Ap perpendicular to OP at P. Then, a point x is in the Borel 
polygon of summability if and only if x is in the half plane determined by A, and 0, 
for all the singular points P of f. It then follows that x is in the interior of the Borel 
polygon if and only if f is analytic at every point of the closed disk with diameter Ox. 
Therefore, if x belongs to the Borel polygon of f, then 


fod Eae(a}le- a] 


holds for all z such that |z — (x/2)| < |x/2|. As above, putting z = x and then 
z = 0 and subtracting gives the result. 


Example. lf f(x) = Ind + x), then 
fekD(x) _ (2k)! 
(1 4 x)er 
Applying (5), we obtain 
(06) x 2n+1 
In(1 + =2 ——— ; 
n( x) LY oils 


The Borel polygon of f is the half plane Re(x) > —1. Therefore, this expansion is 
valid for all x with Re(x) > —1. This should be contrasted with the circle of 
convergence of the Taylor series expansion for f(x) about x = 0. Even in the 
common region of convergence, this expansion converges faster. Since 


1 
arctan x = 5, Lind + ix) — In(1 — ix)], 
i 
we also obtain 


00 1 
arctanx = —i >), —— |(x + 2i)"" -(x- 21)" 


x |" 
,a9 22 + 1 


4+x? 
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which is valid for —1 < Im(x) < 1. In particular, for all x real, arctan x may be 
computed using 


arctan x = aL an 7 An(x) 
where 
4x 3 
A,(x) = ras A,(x) = al ; 3 | (3x? — 4) 
and 


Aol) = [za] 22? = 4) Ayal) = 24,00) 


For example, 


4+ x? 


1 1 


y (—1)" 
2= —_—_———— + 
arctan 4k +1 8k+6 


k 
k=0 4 


and 


ys y : (341) CO" 
5 fag jan 2M + 1\4F +1) 5° 


An Application. We will now use Theorem 3 to derive several numerical methods 
of approximating the solution of the initial-valued differential equation 


y=f(x,y) — y(x%o) =, 


where f (and, therefore, y) is a sufficiently smooth function. 

Fix h > 0. For each nonnegative integer j, we are seeking an approximation y,,, 
to the exact value of the solution y at x,,; = Xo + (j + Ih. Let f; = f(x,, y;) and 
denote by & the error y(x,,1) — yj41, assuming that y(x,;) = y, forO <i <j (E is 
the local discretization error). We will omit the derivation of the error term. Note, 
however, that in each case the error term can be derived using the main ideas in this 
note, namely expansion in Taylor polynomials with integral remainder followed by 
an application of the Mean-Value Theorem for integrals. 


(i). The Modified Euler’s Method: for n = 1 and k = 1, we apply (4) (in the 
interval [x;_,, x;4,]) to obtain the approximation y,,) = y,_, + 2hf, with E = 


y”(€)(h?/3). 


(ii). For n = 2 and k = 1 and the interval [x,_,, x;+,], we get 
/ 9h? I 
y(X;41) = y(x,_2) + 3hy (x;_2) + re (x;_}). 


Now, using the approximation y’(x,_,) = [y’(x;) — y(x,-2)|/2A and rearrang- 
ing, we get Ya, = Yj -2+ GhA/DF-2 + 3f,]) with the error term E = 
(3/4) yO(E)A*. 

This approximation does not appear to be one of the standard multistep methods 


b 
of elementary numerical analysis. However, applying it to approximate i] g(t) dt, 
a 
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we obtain 


[’s) dt = +b - a)} eta) + 35 - jh 


This is clearly asymmetric with respect to a and b, which suggests also considering 
the approximation 


['s(0) ae = 3(> - ofa ’ a) 


and then averaging these to obtain 


"(0 a= =(6- a)} eta) + 3e( =) 4 3¢( 2") 4 2(6)}, 


which is the well-known Simpson’s 3/8 Rule, with error —(1/6480)(b — a)°g(&). 
(ili). In (ii) above, we could equally well use the approximation y’(x,_,) = 
[y(x,) — 2y(%)-1) + y(x)-2)]/h?, which gives 


1 
Yar 5 LM y-2 — 18y,_, + 9y,| + 3hf,_>, 


with E = (3/4) y(€)h*. 
(iv). Now, we use (4) with n = 1, k = 2 and the interval [x,_3, x,,,] to get 
/ Mr (4h)° 
yxy) = y-3) +9" ty (Ga 
Approximating y “’(x,_,) as we approximated y’(x,_,) in (iil) above, we derive 
the multistep method 


4 
Yjay =Ij-3 3h [2h-2 ~ fi-r + 2f;]. 


with the error E = (14/45) yp (€)h°. 

(v). Again, we take n = 1 and k = 2, but we use the interval [x ;_,, x;,]. This 
gives y(Xj41) = y(X%);-1) + y(x,)2h + y"(x,)(2h)?/24. Using y(x,) = 
[y’(x;41) — 2y'(x,;) + y'(x;-)I/h*, we obtain 

Yjyar =Vj-1 + (h/3)| fiat + 4fj + fy], 
with E = (—1/90) y®(é)h°. Observe that this last formula, when interpreted as a 
numerical integration method, gives the well known Simpson’s Rule. Also, notice 
that the formula is implicit, since y,,, appears also in the right-hand side. However, 
using this formula in conjunction with the one derived in (iv), we obtain a 
well-known predictor-corrector method (Milne’s method): 


( 4h 
WP = ya 2 fat 2h 


h 
Yaa Yart [Ah + 4h +h], 


where IP? = I(Xj415 yfP)). 


Conclusion. All the results above arose from studying the error term in the 
Taylor’s polynomial approximation to a given function. By no means is this 
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restricted to Taylor's polynomials: in general, suppose that A(/f) is a numerical 
approximation to a functional F( f) such that F( f) = A(f) + E(/) for all f in an 
appropriate class of functions. If anything can be said about the error term E(f), 
then perhaps one should try to use A(f) + E(f) as a better approximation to 
F(f). Here, E(f) is an “estimate” of the error that does not depend on the 
particular function f. To illustrate this, consider the trapezoidal rule in numerical 
integration: 


[0 de= 9S Ta) +10), 


Now, the error of this approximation is —f”()(b — a)*/12, for some & in (a, b). It 
can then be shown that if f® is continuous and nonzero at a, then, as b > a*, c 
approaches the midpoint of [a, b], that is, 


, (c-—a) 1 
boat (b—a) (b — a) 20 


Therefore, we obtain the numerical integration method: 


[10 dex CAO a) + 4(6)) - (joo 


with error — (1/480) f “eye — a)°. Finally, we note that if we approximate 


p{S") wy [ile) 27") +709] (Oa) 


and replace, we obtain the approximation 


['(0) ae v 


elie) +a 


i.e., Simpson’s rule once again. 


*) +.7(0)}, 
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LETTERS TO THE EDITOR 
Editor: 


A search for a larger “largest known prime number” was conducted at the 
Amdahl Benchmark Center in Santa Clara, California. The search consisted of 
selecting over 350,000 prime candidates of the form h2” —1 with n> 216091, 
sieving them for prime factors less than one trillion and running a Lucasian test [1] 
on over 7000 remaining candidates. 


After over a year of searching for a prime number in this range, the prime 
(391581) x 2716193 — 4 


was found on August 6, 1989. The Lucasian primality tests take 33 minutes for 
numbers of this size on the Amdahl 1200E. The restriction n > 216091 was chosen 
so that any prime found would be the largest known, exceeding those discovered by 
Slowinski [2]. 

We wish to thank Jeff Young for confirming our result. 
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Editor: 


In the May issue of the MONTHLY, George E. Andrews and R. J. Baxter present 
“A Motivated Proof of the Rogers-Ramanujan Identities”. It seems to me that they 
missed a chance to give a simpler and better motivated approach to the proof. 

For each positive integer i, let F,(q) be the generating function for partitions 
with parts at least i and differences between parts at least 2. Then it is clear that 
F(q) =1+ O(q') 

and 


Fi(q) — Fi41(¢) = 9'Fis2(q) 
214 
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for all 7. Furthermore, it is easily seen that these equations determine all of the 
series F.(q). Thus if G,(q) is the generating function for partitions with parts of the 
forms 5m + 1 and 5m + 4, and G,(q) is the generating function for partitions with 
parts of the forms 5m + 2 and 5m + 3, and we put 


Gi42(4) = [6,(4) — Gisi(9)]/4' 


for all positive integers i, then to prove the two Rogers-Ramanujan identities, we 
need only show that 


G(q) =1+ O(q') 
for all i. The authors came to this conclusion in a less direct way. 


Raphael M. Robinson 
University of California 
Berkeley, CA 94720 


Editor: 


Indeed, if one starts out with both sides of the Rogers-Ramanujan identities, then 
one can effectively skip Sections 1, 2 and 4 of our paper. Robinson’s comments are 
essentially equivalent to those in our first paragraph of Section 5. However, starting 
with both sides begs the question of motivation. 

Our object was to start with the one side of the Rogers-Ramanujan identities 
concerning partitions into congruence classes mod 5. From there we examine a 
problem intrinsic to that side (Ehrenpreis’s question), and we are consequently 
forced to consider partitions with differences at least 2 between parts. Or as we state 
in our introduction, “In answering this question we are led naturally to the 
Rogers-Ramanujan identities themselves.” 


George E. Andrews 
Pennsylvania State University 
University Park, PA 16802 
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EDITED BY DENNIS DETURCK, DAVID J. HALLENBECK, AND RODICA SIMION 


A Link Between the Jordan Curve Theorem and the Kuratowski 
Planarity Criterion 


CARSTEN THOMASSEN 
Department of Mathematics, University of Pennsylvania, Philadelphia, PA 19104 
and Mathematical Institute, Bld. 303, Technical University of Denmark, 2800 Lyngby, Denmark 


1. Introduction. The purpose of this note is to show that, under certain mild 
conditions, a topological space has a Jordan Curve Theorem if and only if K, 3 does 
not embed in the space. This sheds some light on exactly what properties of the 
plane the Jordan Curve Theorem uses. 

The Euclidean plane has the following two fundamental properties: 


(1) THE JORDAN CURVE THEOREM. Every closed simple curve in the Euclidean 
plane partitions the plane into two connected components. 

(2) KURATOWSKI’S THEOREM. A graph G can be embedded in the Euclidean plane 
if and only if G contains no subdivision of either of the graphs K, or K, 3. 


All known proofs of the easy part of (2) (that K; and K;, cannot be embedded 
in the plane) are based on (1). [1] contains a short proof of the easy part of (2) based 
only on the restriction of (1) to polygonal curves. Conversely, it is shown in [2] how 
(1) is an easy consequence of the easy part of (2). Thus there is a close relation 
between the two theorems, and H. Wilf raised the question if this relationship holds 
in other topological spaces as well. Here we shall answer the question in the 
affirmative for those topological spaces that cannot be separated by a simple arc. 
For spaces of low connectivity the relationship disappears. Every graph (which is 
defined in the next section) may be thought of as a topological space where the 
edges are simple arcs. Then a nonseparating, simple closed curve is the same as what 
in [3] is called a nonseparating induced cycle. The investigations in [3] show that the 
existence of such cycles depends more on edge density than planarity (or nonpla- 
narity). One basic result in [3] says that every finite connected graph of minimum 
valency > 3 has a nonseparating induced cycle. For infinite graphs not much is 
known about nonseparating induced cycles. 


2. Terminology. A simple arc A in a topological space X is the image of a 
continuous 1-1 function f: [0,1] > X. If 0<a<b<1, then f({a, b]) is the 
segment of A from f(a) to f(b). A simple closed curve is defined analogously except 
that now f(0) = f(1). X is arcwise connected if any two elements are joined by a 
simple arc. A set Y C X separates X if X \.Y is not arcwise connected. We shall 
repeatedly make use of the following simple fact: If F,, F, are disjoint closed sets in 
X and A is an arc from an element of F, to an element of F,, then A contains a 
segment A’ from an element of F, to an element of F, such that A’ has no 
intermediate point in common with F, U F,. Furthermore, the union of a finite 
number of arcs is compact and hence closed in case X is a Hausdorff space. 
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A graph is a (finite) set V(G), called vertices, together with a set E(G) of 
unordered pairs of vertices called edges. If the edge xy is present, we say that x is 
joined to y. The number of vertices joined to x is the valency of x. The complete 
graph K,, is the graph with n vertices and all possible edges. The complete bipartite 
graph K,, , is the graph whose vertex set can be partitioned into sets A and B with 
m, respectively p, vertices such that the edges are all mp edges joining A and B. A 
graph G is embedded in a topological space X if the vertices of G are distinct 
elements of X and every edge in G is a simple arc connecting in X the two vertices 
which it joins in G. Moreover, no two edges intersect except at a common vertex. 


3. The link between curve separation and graph planarity. A special case of 
Euler’s formula for graphs in the Euclidean plane says that no simple arc separates 
the plane. (A short proof is given in [2].) The following result holds for spaces with 
that connectivity property. 


THEOREM. Let X be a Hausdorff space which is arcwise connected and which is not 
separated by any simple arc. Assume also that X is not homeomorphic to a simple 
closed curve. Then the following statements are equivalent: 

(a) Every simple closed curve separates X. 

(b) Every simple closed curve separates X into precisely two arcwise connected 
components. 

(c) K3 3 cannot be embedded in X. 

(d) Neither K,, nor K, can be embedded in X. 


Proof. Clearly (b) = (a) and (d) = (c). It is also easy to prove that (c) = (d). 
For suppose that K, is embedded in X. Let vj, v5,..., vs be the vertices of K.. Let 
P1, P> be points on the edge v,v, such that v,, py, pz, V2 are distinct. Now consider 
an arc A in K, whichconnects p, and p, and which contains all of v,, v2, V3, U4, Vs. 
Since X \_A is arcwise connected it contains an arc B from a point gq, on the edge 
U,V, to a point qg, on another edge such that BN K; = {4q,, q,}. It is now easy to 
find a K,, in K,U B. 

We shall now assume (c) and prove (b). Let C be any simple closed curve in X. 
Since X is not homeomorphic to a simple closed curve, there is a point p © X \C. 
Let A be a simple arc from p to C such that A M C consists of one point qg. Let A, 
be a segment of A from g to a point p, # p. Since X \ A, is arcwise connected it 
contains an arc A, from A\ A, to C having only its ends in common with A U C. 
Now A U A, contains a simple arc A, connecting q with another point qg’ on C such 
that A, C = {q,q’}. Let Ay, A; be the two segments of C from gq to q’. Since 
X\.A,; is arcwise connected, there is a simple arc A, joining a point r on 
A,\{q,q'} with a point r’ on A;\{q,q’} such that A, (A; U A,U As) = 
{r,r’}. Now A;\{q,q’} and A,\{r,7r’} belong to distinct arcwise connected 
components of X \C since a simple arc from A, to A, in X \C (having only its 
ends in common with A, and A,) together with A, U A, U A, U A, would be an 
embedding of K,,. So, X \C has at least two arcwise connected components. 

To see that X \C cannot have more than two arcwise connected components we 
assume (reductio ad absurdum) that p,, p>, p; belong to distinct arcwise connected 
components of X\C. Let A,, Aj, A, be pairwise disjoint segments of C. Since 
X \(C\_A,) is arcwise connected it has a simple arc from p, to p,. This arc 
contains an arc A, , from p, to a point q,, on A, such that 4), C= {qj}. 
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Similarly, there is a simple arc A; , from p; to a point g;, on A, such that 
COA, ,= {q;, ;} fori, 7 = 1,2, 3. Clearly, A, j OAK, p = D or {q;, ;} when i # k. 
We can ‘assume that A, , NA; , = {D7} when F # p. For j = 1,2,3, let B be the 
minimal segment of A " that contains 4, ;, 42; and q, ,. Then the nine arcs 
A, ,Gi, j = 1,2, 3) together with B, U B, U B; form an embedding of K, 3. This 
contradiction proves that X \C has precisely two arcwise connected components. 

We shall now complete the proof by showing that (a) implies (c). We prove this 
by contradiction. So we assume that X contains an embedding of K, ;. A k-cycle in 
K, 3 1s defined as a closed simple curve in K, consisting of k edges. We can only 
have k = 4 or 6. K3,3 has nine 4-cycles and six 6-cycles which each separate X 
(since we assume (a)). Consider now a 4-cycle C. Since K,3\C is in an arcwise 
connected component of X\C, there is another arcwise connected component 
which we denote by Q(C). Then Q(C) is also an arcwise connected component of 
X\K, 3. If A is a simple arc from Q(C) to K;, such that A has only an end q in 
common with K, 3, then we say that q is a point of attachment of Q(C) on K; 3. All 
points of attachment of Q(C) are on C. Moreover, if A is any segment of C, then 
X \(C \ A) is arcwise connected, which implies that Q(C) has a point of attach- 
ment on A. In other words, the points of attachment of Q(C) on K; , are dense in C. 

Now we may think of K;, as a 6-cycle C such that the vertices of K3 3 occur in 
the order v,, v,,..., Us (where the indices are expressed modulo 6) and such that 
K,, has the “diagonals” v,, v,,3(i = 1,2, 3). For each edge e; = v,v;,, we let C, be 
the 4-cycle 0,0,440;40;430;, and in Q(C;) ‘J e; we select an arc A; joining two 
distinct points p, and q, on e; such that A; 11 K3 3 = { p,, q,;}. In C we now replace 
the segment B, of e; between p, and q; by A;, i = 1,2, 3, 4,5, 6. The resulting simple 
curve is denoted by C’. We claim that X \C’ is arcwise connected. K3 3 has three 
4-cycles containing two diagonals of C. For each of these there is a corresponding 
component of X\ K;,, and using those we conclude that all arcs B, and all 
diagonals of C (except the ends) are in the same arcwise connected component R of 
X\ C’. In other words, K;3,\C’ is contained in R. Now suppose (reductio ad 
absurdum) that p © X\(C’ U R). Since C’\\(A, \{ Pj, 9, }) does not separate X, 
it follows that X has a simple arc D from p to A,. Since p € R, D does not 
intersect K,. But then D € Q(C,). In particular, p © Q(C,). By similar reasoning 
we prove that p € Q(C,). But Q(C,) and Q(C,) are disjoint. This contradiction 
completes the proof. 

Every open connected subset of the sphere satisfies all conditions of the theorem. 
Since a topological space satisfying (d) of the Theorem admits no embedding of 
nonplanar graphs, by Kuratowski’s theorem, it must be “spherelike” or “planelike.” 
This raises the question of whether every topological space satisfying the conditions 
of the theorem can be embedded into the sphere. 
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A New Series Representation for ¢(3) 


JOHN A. EWELL 
Department of Mathematical Sciences, Northern Illinois University, DeKalb, IL 60115 


The Riemann zeta function ¢ is defined for each complex number s having real 
part greater than 1 as follows: 


o ] 
{(s)= Lis. 
n=1 7 
Actually the zeta function was first investigated by Euler about 250 years ago. Euler 
considered {(s) for real values of s> 1, and obtained many interesting and 
important results about the function. Among the more striking of these results is the 
now classical formula {(2) = 1°/6. 

In a recent issue of the MONTHLY [1] Boo Rim Choe gives yet another proof of 
Euler’s formula, which essentially turns on integration of the power series expansion 
of sin~'. Attempting to find a closed-form expression for {(3) by modifying Choe’s 
method, the author found instead a new series representation for §(3) that converges 
more rapidly than the defining series. This series representation is 


7 00 §(2n) 
SB)= baad CES Esa 


n=1 


(1) 


Since §(2n) > 1 as n— oo, it is clear that the typical term of the series is 


asymptotic to n~?2~?". Hence, the rate of convergence of this series is much faster 


than that of the defining series Ln~?. 


To prove (1) we consider the binomial series 


(1— x2) =14 ¥ ec, x 
k=1 


with coefficients 


c, = (-1)"{ | _ 2-24( 2k) _ Sa 


which converges for |x| < 1. Integration gives the series expansion of sin™~ x: 
fore) x 2k +1 
snot x=x+ ) c,——. 
ba, | 2k +1 


Divide the foregoing identity by x and integrate once more to obtain 


x 2ktl 


‘tl sin7l rat =x+ Yc, ———. 
J x “(2k +1) 
The change of variable u = sin~'+¢ transforms the integral on the left side of (2) 
into 


(2) 


1 

sin x 
[ ucot udu, 
0 
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so that when x is replaced by sin ¢ in (2) we get 


ucot udu = —_———3 3 
J ue 5K (2k +1) 3) 
But 
sen 
ucotu=1-2¥ ( dp ", 
qT 


n=1 


which follows easily from the well-known resolution of the cotangent into partial 
fractions [2, p. 444]. Hence, _ becomes 
far 7" Int 1 Ky “Ges 1)” 


Integrate this from 0 to 7/2 and use Wallis’ product [2, p. 223] in the form 


m/2 K+] | 
tdt = 
caf mm 2k+1 
to obtain 
a 0 ¢(2n) (1/2)"*? 0° 


ge 


_—_————_. = ———; = —§(3). 
yo, om" (2n + 1)(2n + 2) Xu (2k + 1)° 8 (3) 
Simplifying this equation we thus derive (1). 


The author would like to thank the referee for suggested improvements in the exposition. 
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The Dirichlet Problem for a Disk 


DAVID MINDA 
Department of Mathematical Sciences, University of Cincinnati, Cincinnati, Ohio 45221 


1. Introduction. The Dirichlet problem for the open unit disk D can be formu- 
lated as follows. Given a continuous function v: dD = T — R, does there exist a 
continuous function u:D — R such that u|D is harmonic and u[T = v? To say that 
u is harmonic means that u is twice continuously differentiable and Au = u,,. + u,, 
= 0. The answer is affirmative and the unique solution is given by the formula 


1 pan it it 
u(z) = wal v(e")P(z,e") dt zeéeD, (1) 


v(z) zeT. 
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Here 


| ev +z 1 — |z|? 
P(z,e") = Re = 


eit —27 le” _ z|? 

denotes the Poisson kernel; P(z, e’’) is harmonic for z © D and continuous for 
e' © T. That uw is the natural guess for the solution to the Dirichlet problem for D 
is a consequence of the Poisson integral formula. 

There are two basic steps in proving that the function u defined by formula (1) 
actually does solve the Dirichlet problem for D. First, it is necessary to show that u 
is harmonic in D. This is relatively easy to do since it is not difficult to verify that it 
is permissible to differentiate under the integral sign and obtain 


1 pan, , | 
Au(z) = ral v(e’) AP(z, e") dt = 0, 


since P(z, e’’) is harmonic as a function of z © D. The second step is to show that u 
is continuous in D. It is sufficient to show that u has the proper boundary values, 
that is, 


lim u(z) = v(e’”) 

for all e'? € T. This result is called Schwarz’ Theorem. The proof of this result is 
usually more involved because one cannot justify moving the limit operation under 
the integral sign in formula (1). It is worthwhile to outline one of the standard 
proofs of Schwarz’ Theorem ([1, pp. 169-170], [2, pp. 137-139], [3, pp. 257-—259)). 
First, one shows that it is sufficient to consider the special case e’? = 1 and 
v(1) = 0. Then one splits the integral into two parts (a small arc centered about 1 
and the remainder of the unit circle) and notes that the first term is small since v is 
close to zero on the first arc, because v is continuous and vanishes at 1, while the 
second term is small because the Poisson kernel P(z, e’’) can be made negligible if z 
is close enough to 1 and e” is kept a fixed distance away from 1. This is a standard 
“approximate identity” argument. 

The purpose of this note is to present a simpler proof of Schwarz’ Theorem by 
writing the integral in a different form so that passage to the limit under the integral 
sign is easily justified. This proof is known within the context of harmonic analysis 
and can be motivated from the viewpoint of non-Euclidean Fourier analysis [5, pp. 
97-99]. The two main points of this article are to make this simple proof better 
known and to provide an elementary motivation for the proof. 


2. Poisson integral formula. A motivation for the proof of Schwarz’ Theorem 
arises naturally from one of the standard derivations of the Poisson integral formula 
({(1, pp. 166-168], [2, pp. 134-135], [8, pp. 195—196]). Let us recall this approach. 

Suppose that u is continuous on D and harmonic in D. A simple extension of the 
mean value property for harmonic functions yields 


u(0) = = fue") dé. (2) 


From the point of view of geometric function theory there is a natural way to obtain 
an analog of formula (2) with O replaced by an arbitrary point a © D—Jjust make 
use of a conformal automorphism S of D that sends 0 to a and replace u by uo S. 
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Specifically, take S(z) = (z + a)/(1 + az) and note that S yields a homeomor- 
phism of T. Then wo S is continuous in D and harmonic in D, so (2) gives 
i@ 
e- +a 


u(a) = (ue$)(0) = fue syle) d@ = =f" aa dé. (3) 


Now, S7\(w) = (w — a)/(1 — @) and the change of variables e” = S(e!”), or 
e!? = §~1(e'), produces the Poisson integral formula 


l pe, ..1-|al? 
u(a) = al ule") i ap dt (4) 
because (S~+)(w) = (1 — Ja|?)/(1 — aw)? yields 
d@ = |(S~*)'(e”)| dt = tala" = ila" dt. 
jl — ae"|? je” — al? 
Here we have used e“e~" = 1 and |e~” — a| = |e” — al. 


For our subsequent purposes the crucial observation is that 


1 2a : 2a it it 
val y do = = J v(e") P(a, e) dt (5) 


for any integrable function v defined on T. 


ef 4 a 


1+ ae” 


3. Main result. We are now in a position to present a simple proof of Schwarz’ 
Theorem. 


SCHWARZ’ THEOREM. If uv: T — R is continuous and 


(2) = <= [ole P(z.e" a 
u(z) = = [ole z,e"') dt, 


then lim, _, .» u(z) = v(e’®) forall e'? € T. 


Proof. From the identity (5) we know that we can also write 
1 27 
u (z) = a f D 


z+ el! 


1 + ze" 


Now, for e # —e!? 
ztet . 
lim -~ = e'?. (6) 


z—el? 1+ Ze" 


Since v is continuous on the compact set T, v is bounded on T. The Lebesgue 
bounded convergence theorem implies that 


1 pon z+e"™ 1 poor. | 
— | gy = iP) dt = v(e'® 
lim u(z) i lim | To pee dt al v(e'?) dt = v(e'®). 


z—e'? 27 ze’? 


Observe that this proof of Schwarz’ Theorem works at any point of continuity of 
v if v is merely Lebesgue integrable and bounded on T. 


4. Comments. The one nonelementary aspect of this proof is that it makes use of 
Lebesgue integration. However, the bounded convergence theorem is valid for the 
Riemann integral and can be proved completely within the context of Riemann 
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integration ((4], [6], [7]). Another simple remedy can be obtained by noting that the 
convergence is almost uniform. The elementary inequality 


2|z — e'?| 


> je” + e'?| — |z — e’?| 


z+ ei?! 


1 + Ze” 


ip 


shows that the limit (6) is uniform on compact subsets of T \ {—e'”}. Since v is 
continuous at e’?, this implies that v((z + e”)/(1 + Ze’)) converges to v(e’”) 
uniformly on compact subsets of T \ {—e’”}. This observation together with the 
following elementary lemma about an almost uniformly convergent sequence of 
Riemann integrable functions yields a proof that avoids Lebesgue integration. 


LEMMA. Suppose that {f,} and f are Riemann integrable functions on [a, b]. 
Assume that there is a finite constant M such that |f,(x)| < M and |f(x)| < M forall 
x € [a, b] and that for any n > 0, the sequence { f,,} converges to f uniformly on 
[a+ 7, b — yn]. Then 

b b 
lim [hlx) dx = [fs dx. 


n—- © 
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Diagonalization over Commutative Rings 


R. BRUCE RICHTER 
Deptartment of Mathematics, Carleton University, Ottawa, Ontario, Canada KIS 5B6 


WILLIAM P. WARDLAW 
Department of Mathematics, U.S. Naval Academy, Annapolis, Maryland 21402 


1. Introduction. We are interested in diagonalizing matrices over the integers: 
given an integral matrix A, find an integrally invertible matrix P such that P~'AP 
is diagonal or show that no such matrix exists. 

In this note, we solve this problem. To obtain our characterization (given here as 
Theorem 5), we prove a variety of elementary generalizations to commutative rings 
of theorems that are well known over fields. 

There is an extensive literature on the topic of linear algebra over commutative 
rings. The texts [2, 3, 4] are all valuable references. However, we found they lack 
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many of the results we required and had expected to find in such texts. In 
particular, we could not find our Theorems 1 and 5 in the literature. 

It is our purpose to present this material in an elementary manner which is 
accessible to the reader having only a basic knowledge of linear algebra and an 
introductory knowledge of rings. 


2. Basics. There are many similarities between the development of the portion of 
linear algebra presented here and the usual development over fields. However, the 
absence of inverses and the existence of divisors of zero create some problems. 


Example 1. Let A = p >|. If P= | °| then P™! is integral and 


P~'AP is the diagonal matrix diag(1, 2). Hence A is diagonalizable over the integers 
Z. Let Z, be the integers modulo n. Then A is evidently diagonalizable over Z,, for 


' ; , then, over Z,, O-'AQ = diag(4, 5), 


so the diagonal form of A is not unique up to ordering of the diagonal elements. 
Note that Z, is the direct sum of the fields Z, and Z,. Over the former, 
diag(4, 5) = diag(2, 1), while over the latter, diag(4, 5) = diag(1, 2). Thus, in a sense, 
the diagonal elements are the same, but reordered. 

Henceforth, we shall let R be a commutative ring with unity 1 and M an 
R-module. (See [1] for definitions; we assume M is unital, 1.e., lv = v for every 
v © M.) Independent sets, spanning sets and bases in M are defined as usual for sets 
of vectors over a field. Warning: Not every module has a basis. 

(We have chosen to use the words independent and spanning rather than free and 
generating— as in [3]—to emphasize the parallel we are intending to the theory over 
fields. The definitions given for free and generating in [3] allow greater flexibility in 
certain developments with which we shall not be concerned.) 

A module is free if it has a basis. The module R””? of all n X p matrices with 
entries in R is an example of a free module. We shall be dealing chiefly with 
R”" = R®™! and its submodules. 


each integer n > 1. However, if QO = 


We begin with some relevant facts about R-modules. Suppose V = (v,,..., U;) 
and W = (w,..., w,) are sequences of elements of the module M and A = (a, ,) is 
an n X k matrix over R. Then we write V = WA if v, = Li_,a;;w;,, for j = 1,..., k. 


If M is a submodule of R4, then W can be interpreted as a d X n matrix and WA as 
a matrix product. (We speak here of sequences of vectors rather than sets of vectors, 
since the ordering is important.) 

We have not been able to find the following result in [2], [3], and [4]. On the other 
hand, [3], for example, does show that any two bases of a free module have the same 
cardinality. 


THEOREM 1. Let V and W be the sequences above and suppose V is independent and 
W is spanning. Then k <n. Moreover, k =n implies W is a basis (but V need not 
be). 


Proof. Since W spans, V = WA for some A € R”™*. If k > n, then [2, Thm. 51] 
implies there is a non-zero x © R* such that Ax = 0, contradicting the indepen- 
dence of V. 

Suppose k = n. We must show W is independent. Now x # 0 implies Ax # 0, 
since V is independent. Thus, [2, Cor. to Thm. 51] implies det( A) is not a divisor of 
zero; so [2, Thm. 50; or 6, Thm. 3] implies A has an inverse B over the quotient ring 
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Q of R. (The denominators in Q are the non-divisors of zero in R, so det(A) is a 
unit in Q.) 

If Wx = 0 for some x € R", then 0 = W(AB)x = (WA) Bx = Vu, where u = 
Bx € Q”. Multiplying u by 6 to clear the denominators yields y = du = 6Bx € R". 
Then Vy = 6Vu = 0, which implies y = 0. Now 6x = 6(AB)x = 6Au = Ay = 0, so 
x = 0, since 6 is not a divisor of zero. Thus, W is independent. 


Several corollaries to Theorem 1 or its proof can now be easily established, via 
arguments well known over fields. 


COROLLARY 1.1. Let P and Q be matrices over R, each of whose columns form a 
basis of the same subspace of R". Then there is an R-invertible matrix S such that 
P= QS. 


COROLLARY 1.2. If P is an n X n matrix over R, then the columns of P form a 
basis for R" if and only if P is invertible over R (if and only if det(P) is a unit in R). 


Let A © R”*", An element A of R is an eigenvalue of A if there is a nonzero 
vector v © R” such that Av = Av. The vector v is an eigenvector. 


COROLLARY 1.3. Let a be ann X n matrix over R. Then A is diagonalizable over R 
if and only if there is a basis of R" consisting of eigenvectors of A. 


3. Eigenvalues and eigenvectors. Corollary 1.3 is central to our discussion. Thus, 
we must find a basis of eigenvectors of A. Now A is an eigenvalue of A if and only 
if the matrix C = C,(A) = AI — A is singular (i.e., Cu = 0 for some v # 0) if and 
only if f,(A) = det[C,(A)] is a divisor of 0. 

If v is a member of a basis of eigenvectors, then {uv} is independent, 1.e., av = 0 
implies a = 0. If Av = Av, then, for C = C,(A), Cv = 0, so adj(C)Cuv = det(C)v = 
0 implies det(C) = 0. Thus, f,(A) = 0. It follows that we need only consider 
eigenvectors corresponding to zeroes of the characteristic polynomial of A. In 
summary, we have 


LEMMA 2. Let A be ann Xn matrix over R with characteristic polynomial f,(t). 
Then  & R is an eigenvalue of A if and only if f,(A) is a divisor of zero. Moreover, 
the eigenvalue X has an associated independent eigenvector only if f,(A) = 0. 


Another comment is in order. In [3] it is observed that an independent vector v 
need not be part of a basis of a free module M. (For example, 2 is independent in Z, 
but is in no basis of Z.) A vector v is unimodular if it is independent and 
M = (v) @ N for some submodule N of M, where (v) = {rv|r © R}. If v is in 
some basis of M, then v is unimodular. Thus, the “moreover” statement of Lemma 
2 might have read, “The eigenvalue A has an associated unimodular eigenvector 
only if f,(A) = 0.” 

We observe that if R has any nonzero nilpotent elements and n > 1, then the 
converse to the moreover statement of Lemma 2 fails. For example, if A = 
diag(3, 3) over Z,, then f,(1) = 0, but no eigenvector corresponding to 1 is indepen- 
dent. 

For the eigenvalue A of A, let E, = {v|Av = Xv} be the associated eigenmodule. 
Let Q(A) = {A © RIf,(A) = 0}. To find a basis of eigenvectors of A, we need only 
consider elements of the EZ, for A € Q(A). 
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Example 2. For the matrix A from Example 1, over R = Z,, every element of R 
is an eigenvalue! However, f,(A) = 0 only for A =1, 2, 4 and 5, so Q(A) = 
{1, 2, 4, 5}. For each A € Q(A), let S, = {v, } be a maximal independent subset of 
E,. Of the six pairs of these four sets, only two pairs have unions which are 
independent, namely S, US, and S, U S,;. These give us the two diagonalizations 
seen in Example 1. 

Example 2 shows the naiveté of the definition of eigenvalue. One should, 
perhaps, define to be an eigenvalue if there is an independent (or unimodular) 
vector v such that Av = Xv. In fact, in [3] an eigenvector is required to be in some 
basis of the module! 

The following is the commutative ring version of a well-known result for fields; 
the proof is the usual induction and is omitted. 


LEMMA 3. Suppose 22 C Q( A) has the property that X\, 7 €Q and X #7 imply 
\ — 7 is not a divisor of zero in R. Then the sum XU) =F) is direct. Hence, S, € Ej 
independent implies U, <gS) is independent. 


The matrix A = diag(2, 4) over Z, shows the converse of Lemma 3 is false. 
Lemma 3 explains the outcome of Example 2; more generally it provides a 
mechanism for constructing “large” independent sets of eigenvectors. 


4. Diagonalizing. Since the converse of Lemma 3 can fail, the general problem of 
building a basis of eigenvectors seems to be quite difficult. However, when R is an 
integral domain, Lemma 3 shows that the sum of the eigenmodules is direct. The 
procedure for diagonalization would seem to be: (1) find a basis of each eigenmod- 
ule; and (2) if the union of these bases (an independent set) is a basis of R”, then A 
is diagonalizable. Obviously, two problems may arise: (1) some eigenmodule may 
have no basis; and (2) the union of the particular bases we choose might not be a 
basis of R”. (We note that if R is a principal ideal domain, then every module has a 
basis, so problem (1) does not arise [4, Thm. 230].) 


y-) 


Example 3. Consider the matrix A = | | over the ring R = Z[y —5]. 


The eigenvalues of A are y —5 and 1, but the eigenmodule E, has no basis. Thus, 
this is an example of problem (1). 


0 1 


Example 4. Consider the matrix A = over Z. The reader should have 


little difficulty showing that each eigenmodule has a basis, but no union of their 
bases is a basis of Z?. This, then, is an example of problem (2). 

Perhaps somewhat surprisingly, these are the only problems that can arise in 
diagonalizing over integral domains. 


THEOREM 5. Let R be an integral domain and let A € R"*". Then A is diagonaliz- 
able over R if and only if each eigenmodule of A has a basis over R and, for every 
choice of these bases their union is a basis of R”. 


Of course, as the special case of interest, we have the following. 


COROLLARY 5.1. Let A € Z"*". Then A is diagonalizable over Z if and only if, for 
any choice of bases of the (integral) eigenspaces of A, the union of these bases is a 
basis of Z". 
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Proof of Theorem 5. If there is any basis of eigenvectors, Corollary 1.3 shows A is 
diagonalizable over R. 

Now suppose A is diagonalizable over R. Let Q~'4Q = D, where Q is invertible 
over R and D is a diagonal matrix. The eigenvalues of A and their multiplicities are 
determined in the quotient field K of R. These are the entries of D. Without loss of 
generality, we may assume that equal eigenvalues of A occur in consecutive columns 
of D in the order \,,...,A,. Write Q =[Q, --- Q,], where the columns of Q, 
form a basis over K of the eigenmodule associated with X,. 

We show now that Q, is a basis over R of the eigenmodule associated with 4 ,. If 
v © R” and Av =A,v, then v = Q(Q7'v) is the unique expression of v as an 
R-linear combination of the columns of Q (the columns of Q form a basis of R”). 
This is also the unique expression of v as a K-linear combination of the columns of 
Q. But v is in the eigenspace (over K) associated with A,, so v is a K-linear 
combination of the columns of Q ,. Hence the columns of Q, form an R-basis of the 
R-eigenmodule associated with A ,. Thus, each eigenmodule of A (over R) must have 
a basis. 

Suppose that, for each j, P, is a matrix whose columns form an R-basis of the 
eigenmodule associated with A, and let P =[P, --- P,]. By Corollary 1.1, for each 
j there is an invertible matrix S, such that P; = Q,S,. Let S be the block diagonal 
matrix diag(S,,..., S,). Evidently, S is invertible over R. Since P = QS is the 
product of invertible matrices, P is also invertible over R. It follows from Corollary 
1.2 that the columns of P form a basis of R”. Hence, the union of the arbitrarily 
selected bases P, is a basis of R”. & 


To actually find bases of the eigenmodules £,, we suppose R is a Euclidean 
domain. The bases always exist [4, Thm. 230] and can be found by the algorithm 
summarized below. 

Let C = C,(A) = AI — A. Column operations making repeated use of the divi- 
sion algorithm can be used to put the gcd of the first row of C in the (1, 1)-position 
and zero every other entry of the first row. Proceeding inductively and keeping track 
of the elementary matrices producing the column operations, we can find a matrix U 
which is invertible over R such that CU is column reduced to lower triangular form. 
CU is called the Hermite Normal Form (HNF) of C. All the zero columns of CU 
are to the right and it is not difficult to show that the columns of U in the same 
positions as the zero columns of CU form a basis of the eigenmodule EF, of A. 
(Doing this in polynomial time over the integers is described in [5]. We are grateful 
to W. J. Cook for having brought the HNF to our attention.) 

Corollary 5.1 and the procedure just described realize our goal of determining 
which integral matrices are diagonalizable over the integers and diagonalizing those 
that are. 
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Random Triangles 
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A question similar to the one that follows occurred on the final examination of a 
second-year course in probability at the University of Birmingham. One wonders if 
the setter got it right. 


A chimpanzee named Euclid is teaching himself the geometry of the triangle by 
notching thin, straight jungle twigs with his finger nail at two independent 
random points, breaking the twig into three at these points, and then noting if it 
is possible to construct a triangle from the fragments. Nearby Pythagoras, a 
colleague of Euclid’s, is learning geometry in a similar fashion; his procedure is 
the same except that after making the first notch he breaks the twig into two and 
then etches the second notch on the longer of the two resulting portions. After 
breaking this, he too tries to construct a triangle from the three fragments. 

Archimedes, a rather learned chimpanzee, strolls through the jungle clearing 
and observes his two companions. After a moment’s reflection he comments, 
“Pythagoras, you are 4/3 times as likely as Euclid to succeed in constructing a 
triangle.” 

Geometrically or otherwise evaluate the two probabilities and say whether 
Archimedes’s calculation is correct. 


Analysis. Consider an idealized twig of unit length and random break-points B,, 
B, measured from one end of the twig. 


The sample space for this experiment can be parameterized by the set of all pairs 
of break-points, (b,, b,), falling in the unit square (see Figure 1). A success, where a 
triangle is possible, occurs if and only if the sum of any two sides is greater than the 
third. When b, < b,, the lengths of the three sides are b,, b, — b, and1 — b,, anda 
success implies: (1) b, + b, — b, >1-—5),, so that b, > 1/2; Gib, + 1— b, > 
b, — by so that b, — b, < 1/2; and Gi) b, — b, + 1 — b, > by, so that b, < 1/2. 
These three linear inequalities determine one of the two triangular regions enclosed 
by the dashed lines in Figure 1(a). The other is determined by consideration of the 
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(a) Euclid (b) Pythagoras 
Fic. 1. The sample space. 


case b, > b,. Exterior to the dotted triangles, the square regions are failures as the 
two notches fall in the same half of the twig, and the triangular regions are failures 
as the two notches are too far apart. For Euclid, all points in the square are possible, 
and by symmetry equal areas are equally probable, so that his probability of success 
is 1/4. Now Pythagoras’s sample space is a subset of Euclid’s modified to exclude 
the region corresponding to making the second break in the shorter fragment. So 
from Figure 1(b), Pythagoras’s success probability is 1/3 and Archimedes is right. 

But there is a twist to the tale: under Pythagoras’s sampling scheme, equal areas 
are not equally probable. To see this, suppose they are and calculate the areas in the 
vertical strips of width 6 located on the horizontal axis at two points 0 < a < B < 
1/2. From Figure 1(b) the probability P(B, lies in a + 6/2) is greater than P(B, 
lies in B + 6/2), but this violates the uniform distribution of B, along the twig. 

To ensure a uniform distribution we have to assign a larger probability to the 
shorter interval: so given B, = b, (> 1/2) say, then B, is uniformly distributed on 
the interval (0, 5,) and not the interval (0, 1). Hence 


) 
- (b, — 0) 


and the density function is illustrated in Figure 2. Consequently Pythagoras’s 
success probability is 


6 
P| B, liesina + | Fi = b, 


1 
1 1/2 
2 —db, ) db 
I, Io vat, rf 


which simplifies to 
1 
2[° —(1 — b,)db, = 2[log b, — by]. = 2log2 — 1 = 0.38... 
1/2 5, 


and not 0.33 --- , Archimedes’s analysis was naive. 
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A New Extension of the Derivative 


DANIEL KOPEL 
10 Polo Road, Great Neck, NY 11023 


MICHAEL SCHRAMM 
Department of Mathematics, LeMoyne College, Syracuse, NY 13214 


How, in an elementary setting, can we go beyond calculating one example after 
another to learn about differentiation? Can we see what differentiation is rather 
than just how it works? What are the vital properties that an operator must have to 
be called a “derivative?” One way to consider these questions is to construct 
extensions of the derivative. In checking whether a proposed extension is what we 
want it to be, we learn more than we could by just doing examples. In this note we 
examine one such extension. The motivation for our definition is very simple, and 
might form the basis of other, similar experiments. 

The relationship of the graph of a function to its tangent line is described in 
familiar words: “Look closer and closer at the point of tangency, and the curve and 
its tangent line look more and more alike.” Most methods of extending the 
derivative retain the difference quotient as the measure of how much a curve “looks 
like” its tangent line, and proceed by manipulating the way the limit of the 
difference quotients is computed (see e.g., [1, 2]). 

We take a different approach: Reinterpret the phrase “The curve looks like the 
line.” There are many ways in which a set of points can be said to “look like” a line. 
The one given by the ordinary definition of the derivative may not be the most 
natural. On the other hand, most of us can recognize the similarity of two sets of 
points, even as we appreciate the difficulties in defining it. 

One way in which a set of points may be said to look like a line is in the sense of 
least squares. Usually used to find the line that best fits a finite set of points, the 
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method may be adapted for our purposes in this way: Let m(h) and b(h) be the 
numbers which minimize the expression 


PU — m(h)t — b(h) at 


for a given value of h (we are concerned only with m(h), the slope of the 
approximating line). Setting partial derivatives to 0, solving the resulting system of 
equations, and integrating where possible, we find 


m(h) = al [noe _ 


We say f is Least Squares Differentiable if lim, _, )m(h) exists, and call this quantity 
the Least Squares Derivative of f at x, denoted LSD, f. 

We now ask some questions concerning this operator. Some of these questions 
are specific to the Least Squares Derivative, while others can be asked, with 
appropriate modifications, of any proposed extension of the derivative (in what 
follows, words such as “differentiable” and “derivative” without qualification mean 
“differentiable in the ordinary sense” and “the ordinary derivative,” etc.). 


‘ ['roa}, 


x+r> 
2 |x 


Is it linear? Any operator that is to be called a derivative certainly must be 
linear. The integrals which define m(h) act linearly on the function f, and the limit 
acts linearly on m(h), so LSD, 1s linear. 


Is it an extension of the derivative? We want our operator to be an extension of 
the derivative, that is, any function that is differentiable should be least squares 
differentiable, and LSD, f should be the same as f(x) in that case. We will omit 
most of the details, but recall that the function f is differentiable at the point a 
with derivative L if f(x) — f(a) — L(x — a) = &x — a), where « > 0 as x > a. 
We may substitute f(a) + L(x — a) + e(x — a) for f(x) in the integrals defining 
m(h) and obtain the desired result. 


Is it a proper extension of the derivative? Are there any functions that are least 
squares differentiable but not differentiable? The affirmative answer to this is quite 
easy, and reveals our motivation for considering this extension in the first place. 
Change the value of a differentiable function f at the point at which the derivative 
is to be computed. This will render f nondifferentiable, but the integrals used to 
evaluate m(h) are not effected by such a change. The function will still be least 
squares differentiable. If we use Lebesgue integrals, we may change the function on 
a set of measure 0 without losing least squares differentiability. It would seem that 
the least squares derivative is an improvement over the derivative in this sense. This 
leads us to our next question. 


Are there any least squares differentiable functions that do not arise in the way 
just described? If there were none, our “new” derivative would be of little interest. 
The example we give here is more complicated than it need be to answer this 
question, but will be of use later: Let C, be the characteristic function of the set 

| 1 2j 1 2j+1- 

x 


‘Dia + gai <PIS Rit + gar i= O1,..-5 f= 01,...,27 — i} 


Here we have shaded, in part, the region under the graph of C,: 
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It may be shown that LSD,C, = 0, and that C, is not equal almost everywhere to a 
differentiable function. 

This example works because the integrals do not distinguish well between 
comb-like functions with very thin teeth and functions that are slowly increasing. 
They ignore some of the essential character of such a function, which is apparent in 
our next results. 


Does it satisfy other familiar differentiation rules? While we may not wish to 
make this requirement, the extent to which a new operator mimics (or fails to 
mimic) the behavior of a familiar one is certainly of interest. Here we must adjust 
our expectations. We can’t expect the product rule, LSD,( fg) = f(x)LSD,g + 
g(x)LSD_f, to hold in a setting where we may change the values f(x) and g(x) in 
any way we wish. We might try to avoid this difficulty by substituting, for instance, 


. 1 xt+h 
fim Gf Fas 
for f(x) (the relationship between m(h) and b(h) makes this seem appropriate, and 
the limit is equal to f(x) if f is continuous). But this does not help. Let C,(x) = 
xC,(x), then LSD,C, = 5. 


ee ae: 1 ee oy 
lim = C,(x)dx = 5 and lim 5 J C,(x)dx = 0. 
Now C,(x) = C,(x)C,(x), so even our modified product rule would yield >= 
LSD,C, = 4} -4+0-0= 4. Similarly, LSD, fails to satisfy the chain rule: Since 
C,(x) = C,(C,(x)), we would have 


5 = LSD,C, = LSD,(C, ) C,) = LSD,C, . LSD,C; _ 1 


4 

Further questions suggest themselves: In view of the last result, should we 
consider this attempt a failure? If we extend the derivative (thereby widening the 
class of “differentiable” functions) we should expect to lose some of its properties. 
How important are these lost properties? 

We know that “Differentiable = Least Squares Differentiable.” Is there a simple 
property that can be inserted to make “Least Squares Differentiable and 2??? => 
Differentiable” true? (“Continuous” will not work. The function C, is continuous at 
0, but is not differentiable there.) 
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How does this derivative behave with regard to other properties and uses of the 
derivative? For instance, is there a meaningful version of Taylor’s Theorem for this 
derivative? 

Most important, what does this preliminary exploration teach us about the 
derivative? While suggesting some directions for experimentation, these results 
contain a warning that familiar properties of the derivative might be more strongly 
attached to the “limit of difference quotients” than we suspect. This brings us full 
circle to one of our very first questions: What are the vital properties that an 
operator must have to be called a “derivative”? This is probably where the 
discussion should begin! 
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Remainder Estimates in Taylor’s Theorem 


G. B. FOLLAND 
Department of Mathematics, University of Washington, Seattle, WA 98195 


Suppose f: R > R is n times differentiable at a, that is, f is of class C“~) ina 
neighborhood of a and f(a) exists. Then we can form the nth degree Taylor 
polynomial and remainder of f at a: 

n tq 
P,(h, a) = ye 
0 


The versions of Taylor’s theorem in almost all calculus books, both elementary and 
advanced, give some formula for R, that involves the (m + 1)st derivative of f. 
Actually, it is quite unnecessary to invoke f‘"*” to obtain good control of R,,. This 
point is perhaps too fine for standard first-year courses, but it should be of interest 
in honors or advanced calculus courses. 

In the first place, assuming only that f is n times differentiable at a, R,(h, a) 1s 
o(h") as h — 0, that 1s, 


» R,(h,a) = f(a +h) — P(A, a). 


R,(h, a) 
lim —_—— = (0 (1) 
h->0 h 


Indeed, after an (n — 1)-fold application of ! H6pital’s rule, (1) reduces to 


fat h)— f(a) 
kin —- 
h-0 h 


_ f™(a) = 0, 
which is true by definition of f(a). (An analogous result is valid for functions of 
several variables, but it cannot be obtained quite so cheaply: see [1].) 

Under the slightly stronger hypothesis that f is of class C‘ on an interval 


containing a, we can obtain better bounds for R, as follows. We start with the 
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usual integral formula for R,,_,, 


h" n~-1 ¢(n) 
Ry a(hy a) = 7, mofo f(a th)dt, 


which is easily proved by integration by parts, and give it an extra twist: 


(n) 
R,,(h, a) = R,,-1(A, a) 7 ce, 


= ap f(a +m) -F%(@)]a. 


Now any estimate one may have for the modulus of continuity of f‘ at a can be 
translated into an estimate for R,(h, a). For example: 

(I) If f‘” is continuous on an interval J, then f(a + th) — f(a) > 0 as 
h — 0 uniformly for ¢ in [0, 1] and a in any compact subset of J; hence (1) holds 
uniformly for a in compact subsets of J. 

(II) If f“” is Hdlder continuous of exponent A, 0 <A <1, then R,(h, a) is 
dominated by |h|"**. More precisely, if 


[f(a + x) — f(a)| < Clx|" (3) 


for all x on the line segment from 0 to h, then 


oe hyrt+a a1 C\h\"** 
|R,(h, a)|< Good, @ _ t) tdt = (1 +aA)(24A)...(n 4A)’ (4) 


(To prove the last equality, integrate by parts.) In particular, this yields the standard 
estimate for R,, under the hypothesis that f is of class C+ and |f@*?| < C, for 
then (3) holds with A = 1 by the Mean Value Theorem. 

The same arguments work equally well for functions of several variables, say f: 
R?” — R. (We denote points in R? by x = (x;,..., x,).) Suppose f is of class Cc"), 
Since (d/dt) f(a + th) =h- Vf(a + th), the nth Taylor polynomial of f is 


P (ha) =» CL W®) 


j=0 J 


; (5) 


and the integral formula for the remainder analogous to (2) is 


R,,(h,a) = aay fe — t)"~*|(h -V)"f(a+ th) — (h- v )"f(a)] dt. (6) 


However, it is preferable to restate these formulas so that the partial derivatives of f 
appear more explicitly. To this end, we employ multi-index notation: a = 
(a,,...,a,) denotes a p-tuple of nonnegative integers, and 


jal] =a, +--+ +4Q,, al=a,!---: a,! 
q lal 


o% =A. nk — — > xX = x1 eee x. 
Oxft +++ Oxoe’ ; P 
P 
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With this notation, we have the “multinomial theorem,” 


k! 
Dx" = (x to +x,)%, (7) 


jaja 


which can be proved from the binomial theorem by induction on p. If we use this 
formula to expand the operator (h - v )*, (5) and (6) become 


P,(h,a) = 5 of 9) 


lal<n 


R, (ha) =n > make —t)""“*[a*f(a + th) — d°f(a)] dt. (8) 


jal =n 


Now information about the derivatives 0°f, |a| = n, can be used to estimate R,. 
The analogue of (4) works out most neatly if we use the norm on R? defined by 


KL = [xy + ++ + Lxpl 
Indeed, if we have 
\d°%F (a + x) — d%(a)| < Chx|* 
for |a| = n and x on the line segment from 0 to h, (7) and (8) yield 
hh Cih\"*? 


“| l 4 
R, (h, < h|* 1-1)" dt = ————_____- 


lal=n 


I am grateful to the referee for replacing my original argument with a simpler one. 
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Symmetry in Logos and Hubcaps 


JOSEPH A. GALLIAN 
Department of Mathematics and Statistics, University of Minnesota, Duluth, MN 55812 


In common things that round us lie 
Some random truths he can impart. 
William Wordsworth, The Prelude 


In [7] and [8] Hungarian needlework was used to illustrate the frieze groups and 
the crystallographic groups. Others have used African, Chinese, Indian, and Islamic 
art to illustrate the same families. But what about finite groups of symmetries? 
What are rich sources for illustrating cyclic and dihedral groups through symmetry? 
My favorite ones are logos and hubcaps. These ubiquitous artifacts are preferable to 
exotic artworks and crafts for educational purposes because students encounter 
logos and hubcaps daily. Leafing through the yellow pages or strolling through a 
parking lot becomes a lesson in group theory! 
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A font for logos is the fascinating book American Trademark Designs [3]. Recall 
Leonardo’s Theorem: The only finite plane symmetry groups are Z, and D,, the 
cyclic group of order n and the dihedral group of order 2n [5, p. 375]. Perusing [3] 
and the yellow pages for Madison, Wisconsin, I found corporate logos with the 
cyclic symmetry groups Z,, Z3, Z,, Zs, Z6, Z7, Zs, Z1¢, and Z,,. and the dihedral 
symmetry groups D,, D,, D3, D,, Ds, Ds, Ds, Dy, Di, Dig, Dig, Doo, and Doq. 
Many of these are illustrated in FIGURES 1 and 2. Note that although Z, and D, are 
isomorphic as groups they are geometrically distinct. 


vV ® 


at 


XS 


he 


f= 
eX 


Q 
We te XS 


FIG. 1. Logos with symmetry groups Z,, Z3, Z,, Z5, Ze, Zg, Zig, and Zr. 


Fic. 2. Logos with symmetry groups D,, D,, D,, D5, Dg, Dy4, Dye, and Dy. 


Automobile hubcaps and wheels offer an even more varied array of finite 
symmetry groups provided one makes allowances for imperfections such as the 
notch for the valve stem, the automobile name or logo, and the area where the wheel 
lugs are. Leonardo’s Theorem applies here as well since the symmetry group of the 
orthogonal projection of a hubcap onto a plane (e.g., the photographic image) 1s 
isomorphic to the three-dimensional symmetry group of the hubcap itself. (A 
reflection across a plane in three dimensions corresponds to a reflection across a line 
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in the orthogonal projection.) In one brief foray to a few parking lots I netted 
hubcaps (or wheels) with Z,, symmetry groups for n = 2, 5, 9, 10, 12, 15, 17, 18, 20, 
22, 24, and D,, symmetry groups for n = 2, 3, 4, 5, 6, 7, 8, 10, 12, 13, 15, 16, 18, 20, 
24, 30, 32, 34, 36, and 72. Some of these are shown in FIGURES 3 and 4. (Some 
photos were shot at a slight angle.) Subsequent searching turned up hubcaps with 
symmetry groups Z, for n = 3, 4, 6, 7, 8, 14, 16, 31, and 60 and D, for n = 28, 35, 
40, 76, 80, and 100 (yes, 100). 


Ford LTD Toyota Honda 


Fic. 3. Hubcaps with symmetry groups Z,, Z;, Z,, Zg, Z,, and Z,5. 


Dodge Camaro Chevrolet Caprice 


Fic. 4. Hubcaps with symmetry groups D,, D;, D,, Dj, D,5, and Dyo. 
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in closed form (in terms of a finite number of trigonometric and hyperbolic 
functions). 


E 3376. Proposed by Robert J. Blodgett, Morningside, MD. 
Prove that 


2 
i+j\[4N-—2i-2j\ | (207) 
all 2N -2j }- en +17) 


for any positive integer N. 


E 3377, Proposed by Farhoud Pouryoussefi (student), Sharif University of Tech- 
nology, Tehran, Iran. 


Suppose we consider the polygon with vertices e?7!%, e?7!%2,..., e727» in the 
complex plane, where 
0<0,<A,< -+: <0, < 27. 


Suppose a and 8 are given positive numbers with a + B = 1. Define 6 = 0, for 
i= 1,2,...,n and 


A = a8 + poser? 


for k = 1,2,..., where subscripts are taken modulo n and the angles 6{“) are taken 
modulo 27. Prove that 

lim (0%) — 0) = 2a/n 

k—0o 


for i= 1,2,...,n. 


E 3378. Proposed by Miklos Bona, Budapest, Hungary. 


Suppose the points of Z? (the set of points in the plane with integer coordinates) 
are colored with a finite number of colors. For every n > 3 prove that there exists a 
convex n-gon with vertices and centroid in Z? such that all n + 1 points have the 
same color. 


SOLUTIONS OF ELEMENTARY PROBLEMS 
Sums of Five Composite Squares 


E 3262 (1988, 351]. Proposed by R. G. E. Pinch, Emmanuel College, Cambridge, 
and C. Boyd, University of Edinburgh. 


It is known that every natural number n can be expressed as the sum of four 
squares of integers, and that three squares suffice unless n is of the form 4°(8b + 7). 
Show that every sufficiently large natural number can be expressed as the sum of at 
most five squares of composite numbers (i.e., squares of positive integers divisible 
either by the square of a prime or by the product of two distinct primes). 
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Solution I by L. E. Mattics, University of South Alabama, Mobile, AL. Let A be 
any natural number not less than 4(2 - 54 + 1)* + 144. We may choose a number x 
from the set {6,10,12} such that 


A — x? = +4(mod5). 
Since the equation 
A — x* = 4z*(mod 5) 
is solvable for z, there is an integer w such that 
A—x?=4w?(mod5*), 2<w<5*+4+1. 


(Cf. Niven and Zuckerman, An Introduction to the Theory of Numbers, New York, 
1960, 1966, 1972, 1980, §2.6.) By taking y as either w or w + 5* we may choose an 
integer y satisfying the three conditions 


A — x* = 4y*(mod5*), A —x?- 4y? #0,7(mod8), 2<y<2-54+1. 
Since (A — x” — 4y”)/5* is a non-negative integer congruent to A — x* — 4y” 
modulo 8, there are integers a, b, c such that 

(A — x? — 4y*)/54 = a? + B* + c?. 
(Cf. Dickson, Modern Elementary Theory of Numbers, Chicago, 1939, §53.) Hence 
A =x? + (2y)? + (25a)? + (25b)* + (25c)’, 
where x and 2y are composite positive integers and 25a, 25b, and 25c are either 
zero or composite. 


Solution II, prepared by the editors on the basis of a solution by the late Emil 
Grosswald, Temple University, Philadelphia, PA. We prove that every sufficiently 
large positive integer is a sum of exactly five squares of composite numbers. 

Let r,(n) be the number of solutions of x? + x3 + --- +x? =n in integers and 
let r*(n) be the number of solutions of the same equation in which |x,| is a 
composite positive integer for each i. 

It is known (Theorem 386 of Hardy and Wright) that 


r,(n) = 8 » d, 
din,4td 
so that r,(”) > 8n if n is not divisible by 4. Thus 


r(n) = » r4(n —j7) 


—yn sjsyn 

2 » r4(n — f°) 
—yn/2 <jsyn/2 
jJ=n-+ l(mod 2) 

> y 4n 


~yn/e <jsyn/2 
jJ=n+ l(mod 2) 


> 4n(~n/2 — 1) > 2n?”, 


provided n > 25. 
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Now r;(n) — ré(n) is the number of solutions of x? + x3 + x3 +x2+ x? in 


which at least one of the summands is 0, +1, or +p for some prime number p. 
Thus 


r5(n) — r#(n) < 5r,(n) + 107,(n -—1) +10 YY n(n - p?). 
psyn 
Now 


1 
r,(n) <8) d=8n), 77 O(n log log n) 
din din 


by Theorem 323 of Hardy and Wright. Hence 
r,(n) — r*(n) = O(n log log n) + O(a(vn )n log log n), 


where a(x) is the number of primes not exceeding x. By Chebychev’s estimate 
a(x) = O(x/log x) we get 


r,(n) — ré(n) = O(n?” (log log n) /log n). 
Combining the assertions of the preceding two paragraphs, we find that r*(n) > 
n°/? for all sufficiently large n. 


Editorial comment. The proposer remarked that there are exactly 256 positive 
integers which cannot be expressed as a sum of at most five squares of composite 
numbers and that the largest of these is 1167. 

Several mathematicians have pointed out to the editors that actually every 
sufficiently large positive integer can be expressed as a sum of at most four squares 
of composite numbers. This can be established in several ways, none of them 
elementary. If n = 0(mod16) and nv is expressed as a sum of four squares of 
non-negative integers, all of the integers involved are divisible by 4 and hence the 
non-zero ones among them are composite. If n is a sufficiently large positive integer 
not divisible by 16, then by the results on pp. 453-454 of [1] it follows that n is 
represented by the quadratic form 


(69x)? + (95y)* + (119z)* + (143w)’. 


Thus every sufficiently large n is expressible as a sum of at most four squares of 
composite numbers. 
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Solved also by the proposers. 


More Socks in the Laundry 


E 3265 [1988, 456]. Proposed by D. M. Friedlen, Georgia Institute of Technology, 
Atlanta, GA. 
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Suppose n distinct pairs of socks are put into a laundry, where it is assumed that 
each of the 2n socks has precisely one mate. At the completion of the laundering 
operation, socks are drawn out one at a time. Suppose the first pair is realized on 
draw number T,. 

(a) Find an explicit formula for E(T,,), the expected value of T,, and determine 


lim n~'/*E(T,). 
(b) For positive x, prove that 
lim P{T, < xn'/} 


exists and find its value. 


Solution by Doug Pryor, University of California, Santa Barbara, CA. For (a), 
we show that E(TJ,) = 27"/ (2"), from which an application of Stirling’s for- 
mula yields lim n~!/7E(T,) =v. Note that E(T,) = U"i}kP(T, =k) = 
oF C7, > k). However, P(T, > k) = (")2*/(2"), because 7, > k means that, of 


the (2" ways to select the first k socks, we have chosen exactly one from each of 
some k pairs. After canceling and rearranging terms, this yields E(T,) = 
(n!)?(2n)! “17 _G( 2n 2k These summands are precisely 27” times the probabili- 
ties in the well-known Banach match box problem (see, e.g., W. Feller, An 
Introduction to Probability Theory and Its Applications, vol. I, 3rd ed., Wiley, 1968, 
p. 166-167), and thus they sum to 2”. 

For (b), we show that lim P(T, < xVn) = 1 — e~*’/*. Let k = | xVn |. Using the 
formula obtained in (a), we have P(T, < xVn) =1— P(T,>k)=1- (")2*/(2"). 
By applying Stirling’s formula and rearranging factors approximately, we can obtain 
an asymptotic expression for P(T, > k), namely, 


akn! (2n —k)! 
(n-—k)! (2n)! 


kn” (2n—ky"* | 1—k/n \ (1—k/(2n))?)\- 
(n—k)"“ — (2n)" 1 — k/(2n) 1—k/n 
We claim that the two factors in the last expression approach e-*’/2 and e 


respectively, which will complete the proof. In fact the logarithm of the first factor 
1S 


P(T, > k) = 


x? /4 
9 


k {log(1 — k/n) — log(1 — k/(2n))} = —k?/(2n) + O(k7/n*), 
which approaches —x*/2 as n goes to infinity, while the logarithm of the second 
factor 1S 


1 —k/n + k?/(4n?) k? | k4 | 
n log) ———-_—-—_ ; 


= 4+ Qg| ——____ 
1—k/n 4(n — k) n(n— ky 
which approaches x?/4. 


Editorial comment. Many readers noted the connection with the Banach match 
box problem; others gave ad hoc proofs of the summation. Richard Groenveld 
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remarks that the asymptotic distribution P(T < x) =1 —- e~*’/4 found in (b) is 
known as the Weibull distribution. 

A related problem about socks in the laundry appeared as E 3148 [1986, 400; 
1988, 357]. 

Kent Kromarek remarked on the empirical aspects of this problem. 

Solved also by A. Adler, G. Bach (West Germany), C. Belise and J. Sarkar, D. A. Callan, 
S. Dharmadhikari, S. Ethier and P. Griffin, R. A. Groenveld, E. Hertz, L. Holst (Sweden), L. E. Mattics, 


W. A. Newcomb, G. Parker, N. Passell, D. S. Rubin, A. Shimron (Israel), T. R. Shore, P. Tzermias 
(Greece), and the proposer. 


The Limit of an Approximation 


E 3276 [1988, 654]. Proposed by D. M. Bloom, Brooklyn College, CUNY. 


D. J. Newman (A Problem Seminar, Springer-Verlag, 1982, Problem 60) shows 
that if (x,) is the sequence defined by x, = 1 and x,,, =x,+1/x, for n> 1, 


then 
v2n <x, <\2n+ 4Inn 


for n > 3. Put 
G(n) = 2n+ 4Inn — x? 


for n € Z*. Show that maximum and minimum values of G(n) exist, find these 
extrema, and show that lim G(n) exists. 


n- oo 


Solution by Kee-Wai Lau, Hong Kong. The maximum and minimum values of 
G(n) are, respectively, G(1) = 1 and G(13) = 0.2671. Computation shows that the 
value of G(n) decreases when n goes from 1 to 13, but that G(13) < G(14). We 
shall prove that G(n) increases but is bounded (by 0.305) when n goes from 14 
to oo, so that lim, _, .G() exists. 

We shall use mathematical induction to prove the more specific result that, if 
n> 14, then 


0 < G(13) < G(14) < --» <G(n) < G(13) + + (Ink)/(8k*). (1) 


So suppose n > 14 and that we know that (1) holds. From this inductive assumption 
we shall deduce that 


0< G(n+ 1) — G(n) < (Inn) /(8n7) (2) 


and consequently 
0 < G(13) < --- <G(n) < G(n +1) < G(13) + D>) (Ink)/(8k7). (3) 
k=13 
We first note that the equality x?,, = x2 + 2 + 1/x2 implies 
1 
2n+ 4Inn—- G(n)- 


G(n+1)- G(n) = sin + -| (4) 
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Since 


co 6) 


Ink o In x In12+ 1 
G(13) + — < G(13) + —~ dx = G(13) + ———— _ < 0.305 
(13) a age ~ (13) I, gx2 = G(13) 96 


our inductive assumption (1) gives 4In n — G(n) > 41n14 — 0.305 > 1; thus by (4) 
we have 


1 1 1 
+1)- > =-Infl+—-—]|- > 0 
G(n+1)— O(n) > 5 nf | n+] 


On the other hand, our inductive assumption (1) trivially gives G(n) > 0 and so by 
(4) 
1 1 1 In n 


1 1 
G(n+1)-G < —Injl + —}] - ——————- < — - —— <x - +. 
(n+ 1) — G(n) | | 2nt+4Inn  2n  2nt+iInn ~ 8n? 
Thus we have deduced (2) and (3). Thus completes the induction and establishes the 
results of the problem. 


Editorial comment. According to Jean Anglesio, the numerical value of lim G(x) 
is 0.2768576.... 
Solved also by J. Anglesio (France), D. Borwein (Canada), D. Callan, Chico Problem Group, 


Z. Franco, S. M. Gagola, Jr., S. Gaignoux (France), R. Martin (West Germany), J.-M. Monier (France), 
M. Vowe (Switzerland), and the proposer. 


A Property of Singular Matrices 


E 3277 [1988, 654]. Proposed by Tim Sauer, George Mason University, Fairfax, 
VA. 


Suppose A and B are n by n matrices and suppose there exists a nonzero vector 
x such that Ax = 0 and a vector y such that Ay = Bx. If A, is the matrix obtained 
from A by replacing its ith column by the ith column of B, show that 


>) det A, = 0. 
i=1 


Solution I by Thomas Jager, Calvin College, Grand Rapids, MI. Let a,,..., a, 
and b,,..., 5, be the columns of A and B, and let x,,..., x, and y,,..., y, be the 
entries of x and y. By symmetry, we may assume x, # 0. Since the determinant is a 
linear function of the first column, we may write 


n n 
x, , det A, = det(x,b,, a,,...,4,) + ), det(x,a,, d5,.-.5 Gj-15 Djs Gin yy +s An): 
i=1 i=2 


Since L.x,a,; = 0 and the determinant is an alternating multilinear function of all its 
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columns, for i > 2 we have 


det( x11, 45,..-5 4; Dj, Aj445+++5 Ay) 


n 
= y det(—x,a,, Ag5+--5 A;-1, 0;, Aj445--: a,) 


j=2 
= det(—x,a,, a5, ee eg aA;_1>5 b,, GAi4s ose g a,) 
= det(x;b;, 4, ---5 Gj—15 Gis Tints +++5 Gy) 


Translating Ay = Bx as Lx,b; = Ly,a,;, we have 


x, det A, = ) det(x,b,, a),...,a,) = det( )°y,a;, Ag,++-5 a,) 
= det( y,a,, a5,...,a,) = y, det A = 0, 


since the existence of x implies that A is singular. Dividing by x, # 0 completes the 
proof. 


Solution II by A. A. Jagers, Universiteit Twente, Enschede, The Netherlands. The 
condition Ax = 0 implies det A = 0 and rank A < n. If rank A < n — 1, then each 
A, is singular and det A; = 0, so we may assume rank A =n-—1, and the 
nullspace of A is spanned by x. . 

Let A be the adjoint matrix of A; i.e., the i, jth entry of A equals (—1)'*/ times 
the determinant of the matrix obtained by omitting row j and column i of A. If we 
expand each det A, using cofactors along column i, we obtain ) det A, = trace( AB). 
Now AA = (det A): 1 =0- J implies that A( AB) = (det A) - B = 0. Hence each 
column of AB is in the nullspace of A and must be a multiple of x. Since we also 
have ABx = AAy = (det A) y = 0, we conclude (AB)* = 0. This implies that every 
eigenvalue of AB is zero, so trace(AB) = 0, which completes the proof. 


Solution III by Albert Nijenhuis, Seattle. Let R be the commutative ring R[t]/(t7); 
its elements have the form a + tb, where a,b © R, t* =0, and a+ tb = Oif and 
only if a = b = 0. The advantage of computations in R is that any square matrices 
A, B with real-valued entries satisfy det(A + tB) = det A + th det A,;. Let C be 
the adjoint of A +¢B over R, defined as in the preceding solution but using 
computations in R; since R is commutative, C(A + tB) = det(A + ¢tB)-T still 
holds. The conditions Ax = 0 and Ay = Bx imply that, over R, (A + tB)(x — ty) 
= 0. Also, Ax = 0 implies det A = 0. Putting all these facts together, we have the 
following computation over R: 


0 = C(A + tB)(x — ty) = det(A + tB)(x — ty) = (det A + tX det A,)(x — ty) 
= (det A)x + ¢[—(det A) y + (Xdet A,)x] = ¢(X det A,)x. 
Because x is non-zero, this requires ); det A; = 0, as desired. 


Solution IV by Yan-Loi Wong, Colby College, Waterville, ME. As in Solution I, 
let a,,..., a, and b,,...,b, denote the columns of A and B. We may assume by 
symmetry and scaling that x, = 1. Then Ay = Bx and Ax = 0 imply for any real ¢ 
that 


det(A + #B) = det((A + tB)x, a, + tb,,..., a, + tb,) 
= tdet(Ay, a, + th,,..., a, + tb,). 
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This yields 


d 
7 det(A + tB) = det(Ay, a5,..., a,) = det(Xy,a;, a,,..., a,) = y, det A = 0. 
0 


(t= 


d 
However, we also have ah wdet(A + tB) = det A; for any square A and B, so 
t= 
det A; = 0. 


Solved also by S. J. Bernau, J. T. Bruening, D. Callan, D. G. Cantor, J. Ferrer (Spain), S. M. Gagola, 
Jr., L. N. Howard, W. Janous (Austria), L. D. Kadison, O. P. Lossers (The Netherlands), J.-M. Monier 
(France), P. Montgomery, A. Miller (Switzerland), J. H. Nieto (Venezuela), A. Pedersen (Denmark), 
P. P. Ray, P. A. Staley, J. H. Steelman, N. S. Thornber, L. J. Wallen, S. Zakeri (Iran), Western Maryland 
College Problems Group, and the proposer. Three incorrect solutions were received. 


Disjoint Arithmetic Progressions 


E 3278 [1988, 654]. Proposed by John Beebee, Bakersfield, CA. 


Let a mod m denote the arithmetic progression a + km: k © Z. Suppose b mod n 
is disjoint from amod m. What is the smallest positive integer r such that there is 
an arithmetic progression cmodr which is disjoint from both amodm and 
b mod n? 


Solution I by John Henry Steelman, Indiana University of Pennsylvania, Indiana, 
PA. The key observation is that a mod m and bmod n are disjoint if and only if 
a # b(mod d), where d = gcd(m,n). Thus r must not be relatively prime to either 
m or n. Let p, gq, and s be the smallest primes dividing m, n, and d, respectively. 
The necessary condition just stated for r implies r > min{s, pq}. To complete the 
determination of r, we consider several cases. 

If s > 2, then the answer is r = min{s, pq}. Because a mod m and b mod n are 
contained in progressions mods and there are at least three of these, there is a 
progression cmods disjoint from amodm and bmodn. On the other hand, if 
pq <.s, then of the pq different progressions mod pq, at most q intersect a mod m 
and at most p intersect b mod n. Since p + q < pq, there is a progression c mod pq 
disjoint from both. 

If s = 2 and a= b(mod 2), then the answer is r = 2. Here both amodm and 
b mod n are contained in the same progression mod 2, and the other is disjoint from 
them. 

If s = 2 and a # b(mod 2), then the answer is r = min{s’,2¢}, where s’ is the 
smallest odd prime dividing d and ¢ is the smallest prime dividing either m/2 or 
n/2. The fact that r must not be relatively prime to m or n gives the lower bound, 
and an analysis like that for s > 2 above gives the existence of such a sequence. 


Solution II by Shippensburg University Mathematical Problem Solving Group, 
Shippensburg, PA. The answer is the smallest element of the set 
{|s,¢]:5,t> 1, s|m, t\n,[s,t] >3—-(a—,2)}, 


where [x, y] denotes the least common multiple of x and y and (x, y) denotes the 
greatest common divisor of x and y. 
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To see this, we first observe that amodmMbmodn= © if and only if 
a —b#In — km for all integers k,/, which holds if and only if (m,n) does not 
divide a — b. Let us say that r is good if there exists an integer c such that c mod r 
is disjoint from amodm and bmod n. For a good r, let d, = (r,m) and d, = 
(r,n). Then d, does not divide a—c and d, does not divide b—c. Since 
({d,, d,],m) = d, and ([d,, d,], n) = d,, it follows that [d,, d,] is good. We also 
have [d,, d,] < r, so we need consider only S = {[s, ¢]: s,¢ > 1, s|m, ¢|n}. 

Every element of S is good except possibly the element 2 when ((s, ¢], m) = 
([s, t], 2) = 2. In this situation [s, t] = 2 and 2 is good if and only if a — b is even. 

Solved also by A. Averill, D. Callan, N. Felsinger, J. Ferrer (Spain), S. Gaignoux (France), D. E. 


Manes, H. M. Marston, A. Nijenhuis, A. Pedersen (Denmark), J. L. Selfridge, R. W. Sheets, and the 
proposer. Partially solved by J.-M. Monier (France). Two incorrect solutions were received. 


An Extremal Problem for Triangle-Free Graphs 


E 3284 [1988, 762]. Proposed by Paul Erdés, Hungarian Academy of Sciences, 
Budapest, Hungary. 


Determine the minimum number of edges in a triangle-free graph on 2n vertices 
whose complement contains no complete graph on n vertices. 


Solution by Richard Stong, Harvard University, Cambridge, MA. For n < 3 there 
are no such graphs. For n = 4 the minimum number of edges in such a graph is 10, 
and for n>5 it is n+ 5. For n=4 the minimum is realized by an 8-cycle 
U},...,Ug with two consecutive diagonals v,v; and v,uy,; call this graph G,. For 
n > 5 the minimum is realized by the graph G,, consisting of n — 5 isolated edges 
and two disjoint 5-cycles. 

An independent set is a set of vertices no two of which are adjacent; forbidding 
an n-clique from the complement of G is equivalent to considering graphs G with 
no independent set of size n. For n > 5, an independent set in G, uses at most one 
vertex from each isolated edge and at most two vertices from each 5-cycle, for a 
total of at most n — 1 vertices. For n = 4, four vertices in G, are independent only 
if they alternate on the 8-cycle, but the diagonals prevent those two sets in G, from 
being independent. For n = 1,2, it is clear that there is no such graph, and for 
n = 3 it is well known that any graph on 6 vertices has a triangle or an independent 
set of size 3. It remains to show that the examples above cannot be improved. 

For simplicity, call a graph n-good if it is a triangle-free graph G on 2n vertices 
with no independent set of size n, and let f(n) denote the minimum number of 
edges in an n-good graph. Let G be an n-good graph with f(n) edges. From the 
desired properties, we may assume that G has no loops or multiple edges. 

If G has a vertex v of degree at most 1, then there is an (n — 1)-good graph, and 
f(n) = f(n — 1) + 1. To see this, let w be the neighbor of v if it has one, and 
otherwise let w be an arbitrary non-isolated vertex. Any independent set in 
G — {v,w} can be augmented in G by adding v, and deleting vertices cannot 
introduce triangles. Thus, G— {v,w} is (m — 1)-good, and the loss of edge(s) 
incident to w shows f(n) => f(n — 1) +1. 

If f(n) <n+5 and n> 5, then the average of the vertex degrees in G is less 
than 2, and G has a vertex of degree at most 1. Thus the proof is completed by 
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proving that f(4) = 10. Because there is no 3-good graph, we may assume every 
vertex of G has degree at least 2. If G, is not optimal, then the optimal G has 8 or 9 
edges and degree sequence 2, 2, 2, 2,2, 2,2,2 or 3,3, 2,2,2,2, 2,2. There are two such 
triangle-free graphs with 8 edges and 6 with 9 edges, but each of these has an 
independent set of size 4. 


Editorial comment. If the coridition “triangle-free”’ is removed from the state- 
ment, then the problem is solved by the well-known theorem of P. Turan (see, for 
example, J. A. Bondy and U. S. R. Murty, Graph Theory with Applications, 
(North-Holland, 1976), p. 109-111). Turan’s Theorem states that the maximum 
number of edges in an N-vertex graph having no complete graph on r vertices is 
attained uniquely by the complete multipartite N-vertex graph with r — 1 parts in 
which any two part-sizes differ by at most 1. If N = 2n and r = n, the complement 
of this graph is the disjoint union of two triangles and n — 3 edges, which has n + 3 
edges. 


Solved also by S. F. Barger, M. Chlebik (Czechoslovakia), R. B. Eggleton (Brunei), N. Hartsfield 
and J. Werth, and P. Tracy. Three other submissions gave the correct answer but no proof. 


A Determinant Identity with Matrix Entries 


E 3286 [1988, 763]. Proposed by William C. Waterhouse, Pennsylvania State 
University, State College, PA. 


Suppose A, B,C, D are n by n matrices and C’, D’ denote the transposes of 
C, D. In a comment on problem 6057 [1987, 1020], M. J. Pelling remarked that the 
condition 


CD’ + DC’ =0 (*) 
implies 
A B / / 
det( 4 4 = det( AD’ + BC’) (**) 


if D is nonsingular, but gave an example with singular D satisfying (*) in which one 
side of (**) is 1 and the other side is ~1. Prove that in any case (*) implies 


2 
A B\ _ , \2 
der( 4 2 det( AD’ + BC’), 


Solution I by Dennis R. Estes, University of Southern California, Los Angeles, CA. 
Let J, denote the n by n identity matrix, and let P = (; i) Then 


(dB) maid Shele(é B)e)-ole BI 4) 


C D C D C D Cc D\\c’ A’ 
_q ee( 4?” + BC’ AB’ + BA’) 
0 CB’ + DA’ 


= det( AD’ + BC’)det( AD’ + BC’) = det( AD’ + BC’)’. 
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Solution II and generalization by David Callan, University of Bridgeport, Bridge- 
port, CT. We will show that (*) implies 


aet( 4 | = (—1)det(AD’ + BC’), (***) 


where r= rank(C). Since both hypothesis and conclusion are unchanged when 
A, B,C, D are replaced respectively by Q~'4Q0, Q~'B(Q’)~1, PCQ, PD(Q’)™! for 
suitable nonsingular matrices P,Q, we may assume that (4 , has the matching 
block form 
Ay, Ay By By 
B 


Ay Ay By 22 
I 0 Dy, Dpl 
0 0 D, Dy 


where J is the r by r identity matrix. The hypothesis implies Dj}, = —D,, and 
D,, = 0. We have the following simple rules for manipulating the determinant of a 
blocked matrix. If a (matrix) multiple of one row of blocks is added to another, then 
the determinant is unchanged. If two adjacent rows of blocks (involving 7, 7, rows 
respectively of the full matrix) are interchanged, then the determinant is multiplied 
by (—1)"%. Similarly for column manipulations. Using the rules, it is easy to 


manipulate both sides of (***) into the same form: (—1)’"det D,, det( 42 Bu ~ An | 
Ag, By, — An Diu 
Editorial comment. A collaborating editor required a full handwritten page for 
each of the “manipulations” mentioned in the last sentence of Solution II. 
In [1] Marvin Marcus proved a result involving the same determinants as 


(* * *) but with (—1)’ replaced by (—1)”~*, where s is the rank of D. In fact, if (*) 
holds and if det( AD’ + BC’) # 0, then n — s =r (mod 2). 


1. Marvin Marcus, Two determinant condensation formulas, Linear and Multilinear Algebra, 22 (1987) 
95-102. 


Solved also by R. C. Alperin, D. Callan, S. Gaignoux (France), Y. Ikeda, A. A. Jagers (The 
Netherlands), B. Kirchheim (Czechoslovakia), O. Krafft (West Germany), J. R. Kuttler, O. P. Lossers 
(The Netherlands), T. L. Markham, J.-M. Monier (France), B.-S. Moon and K.-W. Jun (Korea), 
A. Nyenhuis, D. E. Otero, P. P. Ray, Univ. of South Alabama Problem Group, Western Maryland 
College Problem Group, and the proposer. 


Another Congruence Identity 


E 3297 [1988, 953]. Proposed by Gunnar Blom, University of Lund and Lund 
Institute of Technology, Sweden. 


If m and k are positive integers and m + 2k = pr, where p = k is an odd prime 
and r is a positive integer, prove that 
k-1 ; 
qmt+l > (-1)’ m +i 

j=0 


= 2’-! mod p. 
j P 


Solution by Peter Montgomery, Unisys and University of California at Los Angeles. 
If 7 <p, then (; is a polynomial in x with integer coefficients divided by a 


1990] PROBLEMS AND SOLUTIONS 251 


number /! that is relatively prime to p, which implies that the difference between (*) 


and (’ * *) is divisible by p when x is an integer. If 0 <j < k —1, then j <p, and 


we have 
(-)’ “(eh wi * (mod »). 


We can now wvaluate the sum modulo p: 


“= (2k -1 
qm+l by (- 1)/ = qm+l » | j _— gmt+ig2k—-1 79 
j=0 


my 


me 


_ qmt+2k-1 _ dpr-l = 2’-1 mod p. 


Solved also by D. Callan, J. Duemmel, S. Gaignoux (France), J. J. Kostal (student), B. Lindstrém 
(Sweden), O. P. Lossers (The Netherlands), L. E. Mattics, A. Pedersen (Denmark), J. H. Steelman, 
N. Strauss (Brazil), and the editors. 


A Characterization of the Least Common Multiple, Again 


E 3300 [1988, 954]. Proposed by H. Das Gupta, Kanchrapara, West Bengal, 
India. 


Let N be the set of natural numbers. Prove that there is exactly one mapping f 
from N X N to N having the following three properties for x, y € N: 

(i) f(x, y) = f(y x), 

(il) f(x, x) = x, 

(ii) (y — x) f(x, y) = yf(x, y — x) if y > x. 

Solution I by Glenn Powers, Western Kentucky University, Bowling Green. The 
least common multiple satisfies the given properties. If f, and f, are two appropri- 
ate mappings but /, # f,, then there exist s,t © N such that the product st is 
minimal and f(s, t) # f,(s, t). By property (i), s # t, so we may assume that 
t > s. Now property (iii) implies that f,(s, t) = [¢f/,(s, ¢ — s)|/(t — s) and f,(s, t) 
= [tf,(s, t — s)|/(t — s). This implies f,(s, t — s) # f,(s, t — 5), which contradicts 
our choice of s and t. 


Solution II by J. B. Wilker, University of Toronto, Canada. The function must be 
the least common multiple. The functional equation (ii) can be written more 
suggestively as f(x, y)/(xy) = f(x, y — x)/[x(y — x)] if y > x. Repeated appli- 
cations of this, facilitated by (1) when x > y, transform the independent variables by 
the Euclidean algorithm and therefore reduce the expression itself to f(d, d) d~?, 
where d = gcd(x, y). Property (ii) then implies that f(d, d)d~* = d~' and hence 
that f(x, y) = xyd~! = Iem(x, y). 


Solution III by Duane W. Bailey, Amherst College, MA. It suffices to give an 
algorithm for explicitly computing each value of f. It is convenient to view these 
values as an infinite matrix f(m,n). The diagonal { f(n,)} can be entered by 
property (ii). The formula f(n,1) = f(_, 2) = fC, — 1)n/(n — 1) for n > 1 pro- 
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vides a recursive formula for entering the first row and column. If we now assume 
that the first m-— 1 rows and columns have been entered, then the formula 
f(n, m) = f(m, n) = f(m,n — m)n/(n — m) for n > m provides a recursive for- 
mula for computing the remaining values in the mth row and column. 


Editorial comment. Except for a trivial change of notation, this problem is 
identical to E 2661 [1977, 487; 1978, 684]. This information came to the editors’ 
attention through a remarkably late solution of E 2661 that arrived just recently. 
Almost all solvers either noted that the function was the least common multiple and 
established the result by induction, or rephrased the problem in terms of g(x, y) = 


xy /f(x, y). 


Solved also by the proposer and 71 other readers. 


ADVANCED PROBLEMS 


6625. Proposed by Nicholas Strauss, Pontificia Universidade Catolica do Rio de 
Janeiro, Brasil, and Jeffrey Shallit, Dartmouth College. 


If k is a positive integer, let 3”) be the highest power of 3 dividing k. Put 


Hin) = (2!) 


i=o \! 
for positive integers n. Prove that 


(i) v(r(n)) = 2(n), 
(ii) v(r(n)) = »((2”)] + 2v(n). 


6626. Proposed by Bruce G. Dearden and Michael B. Gregory, University of North 
Dakota, Grand Forks. 


It is well known that if ¢ is a continuous function on R, then the composition 
oo f is Riemann integrable on [a, b] whenever f is Riemann integrable on [a, b]. Is 
the converse true? That is, suppose ¢: R > R has the property that ¢°/f is 
Riemann integrable on [a, b] whenever f is Riemann integrable on [a, b]. Does it 
follow that ¢ is continuous? 


6627. Proposed by G. Boese, Munich, West Germany. 
If n is a positive integer, put 


f(s) = Dyin. 
j=1 


J 


It is known that there are infinitely many n such that ¢, has zeros in {s: Res > 1}. 
However, prove that 

(i) if n = 1,2,3,4,5, then Ref (s) > 0 for Res > 1; 

(ii) if m = 6,7,8,9,10, then |f,(s)| > 0 for Re s > 1. 
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SOLUTIONS OF ADVANCED PROBLEMS 


Gaps in the Sequence of Integers Expressible as Sums of Two Squares 


6539 [1987, 303]. Proposed by Claudia A. Spiro, SUNY at Buffalo. 


Let b, < b, < b; < --- be the distinct positive integers expressible as sums of 
two squares of integers. Prove that for any given positive integer d the equality 
b, 4, — 5, = d holds for infinitely many n. 


Composite Solution by Douglas A. Hensley, Texas A&M University, College 
Station, and the proposer. We give a proof based on the following lemma, which is a 
consequence of the quadratic reciprocity law and Dirichlet’s Theorem on primes in 
arithmetic progressions. 


LEMMA. If N is a positive integer which is not a square, there are infinitely many 
prime numbers p such that p = 3(mod 4) and (>) = —1. 


Proof. Suppose N = 2°Q, where Q is odd. 

If a is odd and Q = 3(mod4), any prime p such that p = 1(modQ) and 
Pp = 7(mod 8) will do. 

If a is odd and Q = 1(mod4), any prime p such that p = 1(modQ) and 
Pp = 3(mod 8) will do. 

If a is even and Q = 3(mod4), any prime p such that p = 1(modQ) and 
Pp = 3(mod 4) will do. 

If a is even and Q = 1(mod 4), let g be a prime dividing Q to an odd multiplicity 
5; then we may use any prime p such that p = 1(mod Qq~°), p = 3(mod 4), and p 
is congruent to a particular quadratic non-residue modulo q. 

In each case we have selected p to be in a certain coprime residue class modulo a 
divisor of 4N; Dirichlet’s theorem then guarantees that there are infinitely many 
such p. Thus the lemma is proved. 

To prove the assertion of the problem, suppose first that d is odd. Put D = 
(d—-1)/2, ie, d=2D+1. We shall determine an arithmetic progression of 
numbers k such that 


k* + D*+j isnot asum of two squares for j = 1,2,...,2D. (1) 


Then k* + D* and k* + D? + 2D + 1 are consecutive “sums of two squares” with 
difference 2D + 1 = d. 

Since D? < D* + j<(D+1)* for 1 <j < 2D, we see that D? +/ is not a 
Square and so we may apply the Lemma to find a prime p, such that 


—D’?-j D* +j 
p, = 3(mod 4), ———]=- =1, Py # Py, Prs-++> Pj-1 
P; DP, 
for j = 1,2,...,2.D. Thus there is a number 6; between 0 and 2p, such that 
B? = —D? — j(mod p,), B?#—D?—j(mod p?) (jf = 1,2,...,2D). 
(If the smallest positive solution of x? = —D?* — j(mod p,) does not satisfy x* # 


— D* — j*(mod p?), merely add p, to it.) Finally, if k = 8 (mod p?) for j = 
1,2,...,2D, then k? + D + j is exactly divisible by the first power of p; and so is 
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not a sum of two squares (j = 1,2,...,2D). Thus (1) holds and the case of odd d is 
settled. 

The case of even d is similar but slightly more complicated in detail. Put 
D=d/2—1, ie, d=2D + 2. We shall determine an arithmetic progression of 
numbers k such that 


k? + (k + D)’ + j isnot asum of two squares for j= 1,2,...,2D +1. (2) 


But then k* + (k + D)? and k? 4+ (k + D)?+2D+2=(k -— 1)? + 
(k + D+ 1)? are consecutive “sums of two squares” with difference d = 2D + 2. 

In view of the identity 2(x? + y*) = (x + y)? + (x — y)’, it suffices to choose k 
so that the number 


2{k? + (k+D)°+j} =(2k+D)> + D? +2; 


is not a sum of two squares for j= 1,2,...,2D +1. Since D* < D?+2j < 
(D + 2)? for 7 =1,2,...,2D +1 and D? +27 #(D+ 1)? (mod 2), we see that 
D* + 2j is not a square for j = 1,2,...,2D + 1. Thus we may use the lemma to 
find a prime number p, such that 


D? +2; 


=1, Py F Py, Pas---> Pj-1 
P; 


—D*?-2j 
p, = 3(mod 4), —_ — 


} 
for 7 = 1,2,...,2D + 1. Thus there is an integer Y; between O and 4 P; such that 
vy; = —D?-—2j(modp,),  -y, = D(mod2), yy? # — D? - 2 (mod p?) 
for 7 = 1,2,...,2D. (If the smallest solution of 
x* = —D?—2j(mod p,), x = D(mod2) 


does not satisfy x7 # —D* — 2 j(mod P;)s merely add 2p, to it.) Finally if k = 
(y, — D)/2 (mod p?) for j = 1,2,...,2D + 1, then 


(2k + D)?> + D?+2j=y7+d?+2j (mod p?) 


and so (2k + D)? + D? + 2; will be divisible by p, but not by p? (j = 1,2,..., 
2D + 1). Thus (2k + D)? + D? + 2j and hence k*? + (k + D)? +j is not a sum 
of two squares for j = 1,2,...,2D + 1. Thus (2) holds and the case of even d is 
settled. 


Solved also by L. E. Mattics. 


Prescribed Zero Sets for an Entire Function and Its Derivatives 


6570 [1988, 359]. Proposed by L. A. Rubel, University of Illinois at Urbana- 
Champaign. 


(a) Let (z,) and (z/) be sequences in C, neither with a finite limit point. Assume 
that if a complex number w occurs exactly k times in (z,) where k > 1, then it 
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occurs exactly k — 1 times in (z/). (Subject to this restriction, we allow finite or 
even empty sequences.) Show that there exists an entire function f such that the 
zeros of f are exactly at the points of (z,,) and the zeros of f’ are exactly at the 
points of (z/), with the proper multiplicity in each case. 

(b) Can one similarly prescribe three sequences (z,), (27), (z;’) with correspond- 
ing assertions about the zeros of f, f’, and f’’? 


Solution by the proposer. For convenience of notation, we will assume that the z,, 
are distinct—if they are not, then the proof below can be modified in a standard 
way. For a sequence (w,,) with no finite limit points, let [](z: w,,) denote a 
Weierstrass product with (w,,) as its zero set. We will repeatedly use the classical 
interpolation theorem that for any such (w,,) and a corresponding sequence (&,,) of 
complex numbers, there exists an entire function A(z) such that A(w,,) = &,, for 
all m. 

In brief, the steps for the first part are the following: 

Step 1. Choose an entire function K(z) so that 

K(z/) _— _ I] (27:2) 
I1( 252m) 


where I’ denotes the derivative of IT. 
Step 2. Define 
K(z)I\(z:z,,) + IV(z:z,) 


R(z) = l(z:z/,) 


Step 3. Choose an entire function 6(z) so that 6(z,) = log R(z,), say for the 
principal branch of the logarithm. 


Step 4. Define 
e°) — R(z) 


M(2) = II(z:z,,) 


Step 5. Define h’(z) = K(z) + M(z)I\(z:z/,). 
Step 6. Define f(z) = e”"T](z:z,,). Then 
f'(z) = e?[h’(z)TM(z:z,,) + TV(2: 20) | 
= e"2)[{ K(z)M(z:z,,) + TV (z:2_)} + M(z)MM(z: 2), T(z: z,,)] 
= e"2T(z:2/)[R(z) + eFl2) — R(z)| = eM+92TT( 2:2"), 


and the result follows. 

For the second part, the answer is NO, as we can see on trying to prescribe Z(/), 
the zero set of f, by putting Z(f) = Z(f”) = @ and Z(f’) = {0}. By [HAY, p. 
67], we would have necessarily f(z) = e%7*? from Z(f) = Z(f”) = @, but then 
Z(f’) would also be empty. 


REFERENCE 
[HAY] Walter K. Hayman, Meromorphic Functions, Clarendon Press, Oxford, 1964. 
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An Identity Involving Sums and Products 


6575 [1988, 559]. Proposed by Richard Stanley, Massachusetts Institute of Tech- 
nology. 


Let X1,...,X,5 Yi---» ¥, be variables and for i<j set X,,= x; + X;41 
+ +++ +x, and Y,,=y,+ Yiu, +--+: +y; Also set X;,,,= Y,41,, = 0, and any 


Z 


1 n+1-i 


n I] (X;, + Y, j-1) ° I (Xia nti—k + Y, nt1—k) 


j=l 
i-1 n—-i — LL X;- 


=I (X44, 4 Y,i-1)° LT (Xia nti-s + Y; n-s) ‘SJ 
S= 


For instance, when n = 2 we get 


x(x, ty, +)M fi (x, + xX. + yj) xy 
Xx + yy Xo t+ Vy 


= XY, + XY. + Xp. 
Solution by D. E. Knuth, Stanford University. Let 
f(z) = — (x; + z) IM ( _ y/(z — X, | _ Y, j-1)). 
j= 
On a sufficiently large circle |z| = R, the Laurent expansion 


f(z) = — (x, + z) IT( — yjz — y,(X, ; + Y,j-1)2° + O(R~>)) 
j= 


Yon dcjcnX Ve 


1—- 
72 


= —(x, +z) + O(R~?) 


=-z-x,+%,+ Lo x,y,27'+ O(R”’) 
J<k 


converges uniformly. Integrating term by term gives the right side of the stated 
identity: 


1 
=— | f(z)dz = Xe: 
iz|=R jek 


QT i 


This integral is also the sum of the residues inside; and the residue at z = X, , + 
Yi ~-1 1S 


J 1, j 
9 
— Yi j-1 


X> + Yio ~ X>, /— Y, 


V(X + OY p12) [] 
e( ti PIM Sen X04 + Yi pi — X) 


J#k 


which is the kth term on the left side of the stated identity. 


oJ 


Editorial Comment. A more elementary but somewhat lengthier proof was pro- 
vided by Sidney Heller (Brookhaven National Laboratory, Uptown, NY). The 
proposer informs us that this identity arose in the study of the irreducible characters 
of the symmetric group S,. His proof involved the Young diagram and the 
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Frame-Robinson-Thrall “hook length formula.” Knuth’s solution above uses a more 
generally familiar technique, and has the additional virtue of exhibiting another 
(perhaps simpler) way of writing the left-hand side. 

Professor Anatoly M. Vershik of Leningrad State University has informed us that 
the algebraic identity of this problem occurs as formula (12) of his paper [1]. He 


used essentially the same proof as the proposer, although his original motivation 
was entirely different. 


1. A. M. Vershik, The hook formula and related identities (Russian), Zap. Naucn Sem. Leningrad 
Otdel. Mat. Inst. vol. 172, Differential Geometry, Lie Groups, and Mechanics, 10(1989) 3-20. 


The solutions by Heller, Knuth, and the proposer were the only ones received. 
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REVIEWS 


EDITED BY JOSEPH KONHAUSER 
Department of Mathematics, Macalester College, St. Paul, MN 55105 


A Formalization of Set Theory without Variables. By Alfred Tarski and Steven 
Givant. American Mathematical Society, Colloquium Publications, Volume 41, 
Providence, 1987. xxi + 318 pp. 


Lis GAAL 
School of Mathematics, University of Minnesota, Minneapolis, MN 55455 


Formalizations of the laws of thought and reasoning date back to Aristotle 
(384-322 B.c.), but set theory only dates back to the work of Georg Cantor 
(1845-1918) in the 19th century, yet the mathematical analysis of logic, set theory, 
and also modern algebra are now very closely intertwined and each benefits by 
advances in the other. 

The language of mathematics, like all other languages, evolved gradually over the 
centuries and the notion of variables grew with it, almost assuming a life of its own. 
After all, what exactly do we mean when we start a sentence “For all x,...” or say 
“There is an x such that x* = —1.” What are all the x’s we might have to 
consider? Variables are much like pronouns and so ought to be expendable. Omar 
Khayyam stated mathematical problems beautifully using poetry but no variables. 
Aside from the fact that few mathematicians are as adept at poetry, this method 
results in very convoluted linguistic tangles when applied to modern problems. The 
work of Boole and DeMorgan early in the 19th century not only showed that the 
union, intersection and complementation of sets follow specific algebraic laws, but 
also that sets can model the language and syllogisms of Aristotle. For example, if 
A(x) and B(x) are statements about some object x, and A = {all x’s for which 
A(x) is true} and B = {all x’s for which B(x) 1s true}, then the sentence 


Vx(A(x) A B(x)) & (WxA(x) A VxB(x)) 
is equivalent to the set-theoretic statement 
ANB=12e(A=1&B=1). 


The new statement does not contain any quantifiers ‘Vx’ nor ‘4x’ and it is made up 
from equations only. Moreover it seems clear that there must be some relation 
between the Boolean algebra of sets and logical statements which works both ways: 
The Boolean sentence is true if and only if the corresponding logical statement is 
also true. 

Cantor developed set theory as a basis for the natural number system and for a 
mathematical theory of the infinite, a subject heretofore considered untouchable. 
Although paradoxes were discovered in Cantor’s version of set theory, later systems 
amended its shortcomings and Russell and Whitehead’s monumental Principia 
Mathematica made set theory the basis of all mathematics. To carry out this task it 
is necessary to represent ordered pairs, n-tuples, relations and functions by sets: If 
we write ‘{ x,,x,,...}’ to denote the set containing x,,x,,...then the ordered pair 
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(X1,X2,...) and n-tuple (x,,..., x,) can be defined by 
(1X2) = { {x1}, { 215% } , 
(X15%25%3) = ((X1,X2),X3) 
(Xp ees Xa) = CCX py ees Mpa) Xn): 


A binary relation R between elements of a set A is then a set of ordered pairs 
(X1,X2) whose elements are in the relation R: for example, the relation denoted by 
“ < ” between integers consists of the set of all ordered pairs (x,,x,) in which the 
integer x, is less than the integer x,. Analogous definitions apply to relations on n 
elements. If R is a binary relation it is customary to write ’x,Rx45 whenever (x,,x,) 
belongs to the relation R. Functions are special kinds of relations, namely those for 
which any given element x, occurs in at most one pair in R. Numbers can also be 
thought of as sets of a special kind. For example, one can let 


O = {} (the empty set), 


1 = {0}, 
2 = {0,1}, 
3 = {0,1, 2}, 


and begin to build up the number system and mathematics from there. 

The title of this book may lead one to believe that it concerns a special kind of 
set theory which needs no variables. This is however not so. Instead it is a pointer to 
one of the important theorems proved: Every one of the familiar axiom-sets for set 
theory, as well as those of a great many other axiom systems, can be equivalently 
formulated in a language L* which has no variables. 

In mathematics, variables are used primarily to serve as pronouns and to 
designate those places in a sentence which are all to be filled in with the name 
indicated in the corresponding quantifier “for all...” or “for some....” For 
example the rather awkward sentence “there is a real number which when cubed 
gives the same result as when it is multiplied by three’ is customarily written 


4x (x is a real number and x? = 3x). (1) 


This makes the content of the sentence much clearer. But there are some difficulties 
with variables. For instance, when you say “for all x,” do you really mean “for all 
objects in the universe” or “for all real objects in the universe and all abstract 
concepts that you, I or anyone else may ever conceive’? The latter presents a pretty 
tall order which may lead to some deep philosophical discussions. If it could be 
shown that quantifiers and variables are not really necessary one could avoid these 
deep (although often fascinating) problems. In this simple example it is easy to get 
rid of the variable x: Let Cu be the set of all pairs (r, r?) of real numbers and let Tri 
be the set of all pairs (r, 3r) of real numbers. These are now fixed sets, defined once 
and for all. Their meaning will not change with the context. Instead of (1) we can 
then write 


Cu Tri # @ (2) 


260 LISL GAAL [March 


(where © is the empty set) and the (false) statement “every real number when 
multiplied by three is equal to its cube” becomes 


Cu = Tri. (3) 


This sentence is now free of variables. 

Even in only slightly more complicated cases, especially when there are several 
different variables in one sentence, more technical tricks are necessary to carry out 
their elimination and the resulting sentences become very contorted and difficult to 
handle. This is partly due to the fact that whenever several variables are involved 


one must use ordered n-tuples (x,,...,x,) and their inverse functions 
fi( Xp +s Xy--+>X,) = X, (for k fixed, 1 < k <n). Since ordered n-tuples are 
easily definable in terms of iterated ordered pairs (1.¢.,(x),...,%X,) = 
((X4,X95--+>X,-1)9X,)), Only the pairing function Q(x, y) =z and its inverses 


A(z) =x and B(z) = y need be present. (A very lucid informal exposition of the 
elimination of variables will be found in the article “Variables explained away” by 
Willard V. Quine (Proc. of the Amer. Philos. Soc., 104 (1960) 343-347.) This book’s 
result is, however, very much stronger than the one in this article. 

The many known paradoxes of set theory have taught us to treat sets with kid 
gloves, so for safety’s sake the first step must be the development of a suitable 
formal logic L. This is essentially a formalization of first order predicate logic with 
equality between individuals used throughout mathematics, except that only one 
constant binary predicate EF is included, intuitively to correspond to the member- 
ship relation. The system L includes the sentential connectives A (and), V (or), 1 
(not) and the quantifiers V (for all) and 4 (for some) as well as the customary laws 
and rules governing these. A set theory such as the Bernays-Gédel system can then 
be formulated in L by adding a suitable collection of axioms involving the predicate 
E. The simple examples above already suggest that it might be possible to avoid the 
use of variables and quantifiers by appropriate translation of the sentences of L 
into equivalent sentences about sets. 

Again thinking of binary relations R, S,...as sets of ordered pairs and writing 
‘xRy’ whenever the ordered pair (x, y) is a member of R, we then add the new 
symbols +,-— ,©,~ and = to L and also the following axioms which tell us how 
they are used: 

Let R, S be any binary relations, then 


Vx, y{x(R + S)y @ (xRy V xSy)} 
Wx, y{xR-y @ 1(xRy)} 
Vx, y{x(ROS)y © dz(xRz A zSy)} 
Vx, y{xRYy @ yRx } 
R=S @Vx,y{xRy @ xSy}. 


Since these axioms are equivalences, the result is a definitional expansion of L, so it 
is not surprising that L and L* are equivalent. By “equivalent” we mean that if a 
set U of elements is given and also a set E C U X U, then U and E will satisfy all 
the axioms and theorems of L if and only if they also satisfy the axioms and 
theorems of L*, provided x,y,...range over U and E is interpreted as E. The 
relation = then becomes the equality of the elements of U. For convenience we let 
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1 = {all pairs(x, y)}. The non-equality (2) is then equivalent to the equation 
10(Cu Nn Tri) ©1 = 1, (4) 


since 1 R 1 = 1 whenever R is a non-empty set of pairs, that is whenever Ix, y 
{xRy}. What’s more, this equation does not contain any variables. The rule of 
substitution of equals for equals in equations of L* can be proved to hold, so the 
set of equations of L* forms a natural subsystem L™* of L* and is closed under the 
substitution rule. However even the simple statement that there exist more than 
three elements cannot be expressed in the form of an equation in L~. This is a result 
of Korselt cited by L. Lowenheim (Uber Méglichkeiten im Relativkalkiil, Math. 
Ann., 76 (1915) 470.) So L* is not nearly as powerful as L*. In fact it turns out that 
L* is equivalent precisely to the fragment of L which has only three variables. Yet, 
despite the weakness of L* and almost paradoxically in spite of Korselt’s result, it 
is adequate for the formulation of the standard set theories and many others as well. 

To state this more precisely, we call a system of axioms in L a Q-system if there 
are formulas in L which define a pairing function Q so that Q((x,y)) =z is 
uniquely determined and also corresponding inverse functions A and B which allow 
us to recover x and y when z is given. The title theorem then states that every 
Q-system S in L is equivalent to some system S’ in L*. But there is much more: 
For example, S’ is decidable if and only if S is decidable, S’ is finitely axiomatiz- 
able if and only if S is finitely axiomatizable. Since the standard set theories are 
Q-systems, these results can be applied. Many other systems are also Q-systems, 
most importantly the Peano Axioms, so they are also covered by these theorems. 
The preceding is perhaps an oversimplified account of the theorem which gives the 
book its title. There are several hundred other theorems in this volume and they are 
far from trivial. The book covers far more than just the standard set theories. 

Algebraists began to study Boolean algebras in the 19th century, and it was also 
still in the 19th century that E. Schréder [Vorlesungen tiber die Algebra der Logik 
(exakte Logik), Leipzig, 1891] began the study of relation algebras. A relation 
algebra is a Boolean algebra whose elements are binary relations and with addi- 
tional operations ©, U, and a distinguished element 1’ in A such that for all 
relations R, S, T in A we have 


(ROS) OT=RO (SOT) 
(RUS) OT=(ROT)U(SOT) 
ROV=R 
R’Y’=R 
(RUS)~=RVYUSY 
(ROS)~=(RY OS”) 
(RY ©(ROS) ) US=1. 


These axioms are all algebraic in nature, in fact they are all equations. Models of set 
theories, as well as models of many other systems, are relation algebras provided the 
membership relation is among the elements of A. The equational axioms which 
characterize relation algebras are another example of the parallelism between logical 
and algebraic results, for one of the results proved here is the fact that these are the 
axioms of an undecidable theory and this result is essentially equivalent to Markov’s 
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theorem [Theory of Algorithms, The Israel Program for Scientific Translations, 
Jerusalem, 1961] that the word problem for semigroups is unsolvable. The books by 
Henkin, Monk, and Tarski on cylindrical algebras [Cylindrical Algebras, North 
Holland Publishing Co., Amsterdam, Part I (1971), Part II (1985)] and Halmos’ 
beautiful collection of papers on polyadic algebras [ Algebraic Logic, Chelsea Pub- 
lishing Co., New York, 1962] are more evidence of the close relationship between 
algebra and logic. Together with Tarski-Givant’s book they also provide an excellent 
bibliography of algebraic logic. Tarski-Givant is the newest of these books and very 
likely not the last, if the many open problems stated throughout are any indication. 
It is not a book for the casual reader. It requires a lot of attention to detail and 
technicalities, as well as familiarity with related literature to appreciate the results. 
Many of these will probably lead to further important applications. 


Complex Analysis. By Joseph Bak and Donald J. Newman. Undergraduate Texts in 
Mathematics, Springer-Verlag, New York, 1982, x + 244 pp. 


Invitation to Complex Analysis. By Ralph Philip Boas. The Random House/ 
Birkhauser Mathematics Series, Random House, New York, 1987, xiv + 348 pp. 


LAWRENCE ZALCMAN 
Department of Mathematics and Computer Science, Bar-Ilan University, 52900, 
Ramat-Gan, Israel 


By and large, mathematical disciplines get the texts they deserve. Certainly this 
has been the case with complex function theory. That “showpiece of nineteenth 
century mathematics” has spawned some of the great books of twentieth century 
analysis—a short list would include Hermann Weyl’s The Concept of a Riemann 
Surface, Polya and Szegé’s Problems and Theorems in Analysis, and Nevanlinna’s 
Analytic Functions, classics all, fresh as the day they were written, fifty, sixty, 
seventy years ago and more—as well as a host of superb graduate texts. Indeed, the 
instructor in a first graduate course in complex variables faces a genuine embarrass- 
ment of riches: a selection of treatments distinguished not only for their mastery of 
the subject matter but also for the freshness and distinctiveness of their point of 
view. One need only mention, among the older texts, such masterpieces as 
Carathéodory’s Theory of Functions and Analytic Functions by Saks and Zygmund. 
The current market seems to be dominated, deservedly, by Ahlfors’ authoritative 
Complex Analysis and Rudin’s extremely elegant Real and Complex Analysis. Nor 
should one pass in silence over such valuable recent additions to the literature as 
Narasimhan’s beautiful Complex Analysis in One Variable or the late Peter Henrici’s 
three-volume Applied and Computational Complex Analysis, which history may well 
mark as a meisterwerk of enduring value. 

Alas, so far as undergraduate texts are concerned, the situation is very different 
indeed. Here a kind of Gresham’s Law for textbooks applies, with stale texts driving 
out fresh—on those rare occasions that a fresh one manages to get written. The 
result (to exaggerate only slightly) has been a drab and colorless phalanx of 
Churchill clones, presenting the same material in the same way in the same order. 
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All that is changed now with the appearance of the books under review. Written 
by men who really understand complex variables, they are, quite simply, better than 
anything else in the field; and if, despite somewhat varying aims, they seem 
remarkably similar in their choice of material (even beyond the canonical core of a 
first course in complex analysis), that is simply a reflection of a shared assessment 
of what is genuinely important in complex function theory. 

The stated aim of Complex Analysis by Bak and Newman (based on lectures 
delivered by the second author at Yeshiva University in the early seventies) is “to 
present the theory of analytic functions with as little dependence as possible on 
advanced concepts from topology and several-variable calculus... also to highlight 
the authentic complex variable methods and arguments as opposed to those of other 
mathematical areas.” That aim is realized splendidly. While there are certainly 
novelties of exposition (of which the development of the properties of entire 
functions before a detailed study of the local theory is perhaps the most notable), 
what is really exciting about this text is the nonstandard material it contains. This 
applies even, and especially, to the discussion of familiar results. Thus Liouville’s 
Theorem is given not only in its usual version (“a bounded entire function is 
constant”) but also in extended form, in which an inequality of the form | f(z)| < 
A + B\z|*, or even a one-sided inequality like Re f(z) < A|z|* for sufficiently large 
z, implies that f is a polynomial of degree at most k. (These results are also 
presented in Boas. For an alternate development, available quite early in a first 
course, see [12].) A related result, less likely to be familiar to the reader, is that an 
entire function which satisfies | f(z)| < 1/|Im z| must vanish identically. The maxi- 
mum modulus principle is accompanied by an “anti-calculus proposition,” at- 
tributed to Erdés, according to which a nonconstant function f analytic on a closed 
disc cannot take its maximum at a boundary point at which /f’ vanishes. (Boas also 
states this proposition, which he, citing Polya-Szeg6, attributes to Loewner.) New- 
man’s treatment of sums and integrals via residue methods ranges beyond the usual 
examples to a discussion of the evaluation of indefinite integrals of rational 
functions (also discussed by Boas) as well as the calculation and (in general, much 
more difficult) estimation of sums involving binomial coefficients. It is shown, for 
instance, that 


lim YS (-1)! (i) = 0; 


n> k= 


try proving that without complex variables. 

In addition to the standard material of a first course through the Riemann 
mapping theorem and Poisson’s formula, various special topics are developed. 
Particularly noteworthy is the discussion of theorems of Phragmén-Lindeléf type, a 
topic absent even in Ahlfors. These results are used to derive Iversen’s Theorem (a 
nonconstant entire function tends to infinity along some path tending toward 
infinity) as well as to prove that a solution of f(z) = —f(z/2) analytic at 0 must 
be an entire function unbounded on every ray. The final chapter, devoted to 
mathematical applications of function theory, shows how complex variable methods 
can be applied to partition problems and the study of infinite systems of equations 
and even used to calculate the total variation of (sin x/x)* on the line. Also 
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included here is a very slick proof of the uniqueness theorem for Fourier transforms 
on the line. The book ends on a high note with Newman’s own streamlined proof of 
the Prime Number Theorem. Complex Analysis by Bak and Newman is a book that 
lives and breathes the excitement of complex variables. 

Ralph Boas’ Invitation to Complex Variables is every bit as good, and better. The 
distillation of an intimate acquaintance with the field that extends over half a 
century, it is, in my opinion, the best undergraduate text ever written on the subject. 
The years of thought that must have gone into the making of this book are evident 
not simply in its organization and choice of topics (both of which are exemplary) 
but also in many little touches. Boas is the only author I know who approaches 
Morera’s theorem from the proper direction (viz. Green’s formula); and even when 
he comes to prove a result as familiar as the fundamental theorem of algebra, he has 
something interesting to show us. Again and again, one encounters illuminating 
discussions of matters ignored entirely by most other authors. Cases in point include 
the section on changing variables in improper integrals and the discussion of 
methods for calculating harmonic conjugates. Another nice feature of the book are 
the notes, which contain well-chosen references to the literature, attributions, 
pointed remarks, an occasional reminiscence (note 1, p. 116), and some amusing 
speculations (note 2, p. 116; note 2, p. 225). 

As Boas observes early on, “complex analysis was originally developed for the 
sake of its applications.” In addition to the usual applications to irrotational flows 
of incompressible, nonviscous fluids, there are discussions of the Plemelj formulas, 
Nyquist diagrams, and asymptotic series. But make no mistake about it: this is a 
mathematics book. The reader will encounter here the Vivanti-Pringsheim theorem, 
the Hadamard gap theorem, and a particularly detailed discussion of Phragmén- 
Lindeléf theorems (all of which are also in Bak-Newman) as well as Tauber’s 
theorem, Vitali’s theorem, Jensen’s formula, and the elementary theory of univalent 
functions (which are not). 

Indeed, there is so much material here beyond what is contained in the usual 
undergraduate semester of complex variables that it makes more sense to ask what 
is missing. The general (homology) form of Cauchy’s theorem, for one thing. Quite 
rightly, neither Boas nor Newman believes in getting bogged down in topology in a 
first course in function theory. (As to the possibilities in this direction, see [5], where 
45 pages are devoted to the proof of Cauchy’s theorem!)' For similar reasons, the 
monodromy theorem is not so much as mentioned in either text. Less easy to 
understand or, in view of the simplicity of the proof, to justify is the absence of the 
Mittag-Leffler theorem on prescribing principal parts of meromorphic functions. I 
was also surprised not to find the symmetric version of Rouche’s theorem (“if f and 
g are analytic on and inside the simple closed curve I, and | f + g| < |f| + |g| on I, 
then f and g have the same number of zeroes inside I°”’); true, no other text of my 
acquaintance includes it either, but somehow one comes to expect more from Boas 
and Newman. Both authors discuss infinite products, prove Wallis’ formula and 


“Of course, there are various shortcuts; and Dixon’s elegant approach [3], developed in extenso in [9, 
pp. 217-220], removes the last excuse for not including this material in a first graduate course. Still, I 
would argue that Cauchy’s theorem in its most general form is precisely the sort of thing one should leave 
out in an undergraduate course. 
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derive the product representation for sin 7z. Newman proves the Weierstrass prod- 
uct theorem; Boas, remarking that the Weierstrass representation “is so general that 
it is not very useful,’ merely states it but then compensates the reader by also 
stating (again without proof) the Hadamard factorization theorem. Boas also omits 
any mention of the Riemann zeta function. This last omission, I think, was a 
mistake: even in the absence of any application to the Prime Number Theorem, 
discussion of the properties of the zeta function offers an ideal opportunity to 
illustrate many of the basic ideas of function theory in a setting that is of special 
interest. But by far the most serious omission is Picard’s Theorem, which is stated in 
passing in both texts but otherwise ignored. Actually, in view of [12], there is no 
reason not to include (a version of) this high point of complex analysis even in a 
course for beginners. One imagines that in subsequent editions (of which there 
should be many) some of these gaps will be filled. 

Exercises are a strong point in both of these texts. They are well-chosen and 
advance the student’s understanding of the material. In addition, Boas has a unique 
feature: 75 pages of complete solutions to the exercises. This enables him to leave 
certain more-or-less routine verifications (as well as further developments) as 
exercises and makes the book particularly suitable for self-study. Also included are 
some supplementary exercises, for which solutions are not given. 

Both volumes are nicely produced (as one would expect) and excellent value for 
the money. A cursory reading turned up only a couple of slips in Bak-Newman: on 
page 88, line 9, read “set” for “domain”; and on page 196, lines 2 and 3, read m for 
k. Boas contains a number of misprints, a (partial) list of which is available from the 
author. Most are harmless, but the example used to illustrate the definition of line 
integral on page 36 contains an error likely to confuse beginners. One should also 
omit the last complete sentence on page 171 as well as the phrase “with e > 0” on 
page 246, line 4. In note 1 on page 172, read 25E for 26E; and on page 219, line 7, 
change “two” to “one.” There is a genuine (though inessential) slip at the bottom of 
page 257, where “nonvanishing” should be inserted before “functions” on line 6; 
note 1 on page 261 needs corresponding modification (or deletion). The solution to 
exercise 35.2(a) is incorrect. 

Such minor blemishes should not be given undue weight. The books here 
reviewed are the texts of choice for an undergraduate course (or seminar) in 
complex analysis for mathematics majors. They belong on the bookshelf of every 
analyst and, for that matter, of anyone who ever plans on teaching a course in 
function theory—at any level. An even better place for them is in the hands of the 
student who wants to know what complex variables is really all about. 
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I am very ambivalent about fractals becoming a field because my work was inspired and 
supported by a very strong belief that the division of science into fields has been extremely 
harmful. 


Benoit B. Mandelbrot, 1983 [1] 


Occasionally it is possible to coax an opinion out of Benoit Mandelbrot. As 
proud as the “Father of Fractals’ may be of his brainchild, he looks toward its 
domestication with trepidation. Known as a polymath, Mandelbrot has carried his 
fractal message anywhere that people might listen, collecting academic appoint- 
ments in mathematics, applied mathematics, economics, engineering, physiology, 
and medicine [2]. His success has been singular. 

Fractal images have become a commonplace of math department offices and 
computer centers. Fractal techniques have become acceptable in cosmology, dynam- 
ical systems, and computer graphics. Not only is fractal geometry becoming a 
respectable tool, it is, despite Mandelbrot’s concerns, becoming a topic in its own 
right. 

The evidence of this is everywhere (along with the fractals themselves). Since the 
last edition of Mandelbrot’s original fractal manifesto [6], we have been treated to 
lavish picture books [7], best-selling popularizations [5], meticulous treatises with yet 
more pictures [8], and philosophical collaborations between physicists and psycholo- 
gists [4]. 

Among these varied offerings we have the present book, Fractals Everywhere, by 
Michael Barnsley of the Georgia Institute of Technology and Iterated Systems, Inc. 
It is a textbook for a course on fractals. However, Benoit need not fear that his 
creation is being set up for codification and sterile abstraction. Barnsley treats 
fractals as the unifying thread in his presentation of metric spaces, mappings, 
dynamical systems, image compression [3], and graphical models. 
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Fractals Everywhere is yet another picture book. This is good, because anyone 
who ignores the visual side of fractals abandons not only its esthetic dimension, but 
also its ability to represent what we otherwise cannot see. Mandelbrot’s The Fractal 
Geometry of Nature 1s in part a casebook of missed opportunities, where he points 
out how inadequate visualization of complex systems obscures their nature (see [6, 
p. 179] in particular). 

Barnsley’s cover illustration, a depiction of an Andes Indian girl, is a collage of 
160 iterated maps. The emphasis on visuals is a fundamental part of the book and 
not, as in so many other cases, a sugar-coating with little relation to what’s inside. 
Barnsley’s notation is often iconic; that is, specialized symbols provide strong visual 
reminders of what they represent. Given the inherent complexity of the material, it’s 
nice to see a heavy dot representing a closed disk in R? and a filled square standing 
in for the unit square; that’s easier than having to remember what D and S stand 
for (or whatever). Even more striking are the icon for a Sierpinski triangle and the 
little mummy representing a frozen cadaver (a useful R° subset that Barnsley calls 
“Body space’). 

These playful elements pervade the book, and a good thing, too. Barnsley uses 
them as memory devices to minimize the mental baggage one must carry during his 
fractal trek, leaving more room for the booty to be gathered as the trip progresses. 

Barnsley’s fractal geometry course at Georgia Tech was the basis for his text. The 
prerequisite is two years of calculus, which makes the entry requirements rather low; 
a healthy dollop of mathematical maturity would be a good idea. Ramping up from 
an elementary discourse on the properties and topology of metric spaces, Barnsley 
moves into contraction mappings and defines the iterated function system (IFS), a 
tool that recurs time and again in all that follows. The probabilistic Chaos Game is 
contrasted with the Deterministic Algorithm for fractal generation. Barnsley intro- 
duces dynamical systems and fractal dimension. 

By this time the atmosphere is getting rarefied as Barnsley tracks through fractal 
interpolation, Julia sets, and measures on fractals. In actual practice, Barnsley 
reserves the latter chapters of his text for special topics that round out the basic 
introductory material. The rich selection ensures that each instructor will be able to 
tailor his or her own course in many different ways. 

Fractals Everywhere has many praiseworthy features. The prose is clear and 
generally straightforward, though with enough passive voice so we don’t forget we’re 
in a textbook. Exercises abound—at all levels of difficulty. Practically every page 
has at least one illustration, and two-page spreads of pure text are almost impossible 
to find. (In fact, given Barnsley’s predilection for iconic notation, several of the text 
pages could be said to be illustrated as well.) However, some of the figures are 
rather crudely drawn, and they suffer by comparison with the slick computer 
graphics that dominate the illustrations. 

The author occasionally forgets that his readers may not be as familiar with 
fractal terminology as he is. Thus he mentions the Chaos Game or Chaos Algorithm 
in a few places, but spends a lot more time talking instead about the Random 
Iteration Algorithm. They’re one and the same, but I never saw where Barnsley 
expressly said so. Unfortunately, the index references for the two names are disjoint, 
with no cross-references to each other. 

The book may claim to have 9 chapters, but it really has only 8. I regret that the 
author refers to his five-page introduction as Chapter 1, as it is not even a member 


268 ANTHONY BARCELLOS [March 


of the same species as the other chapters. The concept of Chapter 0 was invented for 
chapters of measure zero like this one. This is, of course, a trifling matter, but so is 
Chapter 1. 

We can probably expect that fractal geometry will become a fixture in the college 
mathematics curriculum. As a descriptive tool it has great power, providing a new 
model for natural processes that has already proved fruitful in many fields. Fractals 
Everywhere takes Mandelbrot’s thesis of fractal ubiquity and frames it with suffi- 
cient structure to make classroom presentation possible. Barnsley’s text will inspire 
rival books, but I think that Fractals Everywhere will be commonly referred to as 
the “classic” in its field. 

Perhaps, somewhere, someone is writing (or has written) a textbook on fractals 
that begins with a definition in terms of Hausdorff-Besicovitch dimension and 
proceeds to adduce therefrom a numbered sequence of theorems and corollaries. 
This is not that book. Benoit would be pleased. 
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General, S, L***. The World of Mathematics. 
James R. Newman. Microsoft Pr, 1988, $13.95 (P) 
each (ISBN: 1-55615-148-9] set. Volume One, xxx + 
711 pp; Volume Two, ix + 677 pp; Volume Three, viii 
+ 602 pp; Volume Four, viii + 449 pp. A “faithful 
reproduction” of Newman’s 1956 masterful anthol- 
ogy of mathematics, scanned and retypeset with the 
aid of modern computers. In this edition an enlarged 
60-page comprehensive index is printed in each vol- 
ume. Opens with a new Foreword by Philip and 
Phyllis Morrison. A superb resource for school and 
college libraries and for any teacher of mathematics. 
Perhaps its republication will inspire a worthy sequel 
focused on contemporary mathematics. LAS 
Precalculus, T(13). Precalculus: Functions and 
Graphs, Fifth Edition. M.A. Munem, J.P. Yizze. 
Worth, 1990, xii + 534 pp, $38.95. (ISBN: 0-8791- 
418-0] Standard precalculus text, including polyno- 
mial, rational, exponential, logarithmic, and trigono- 
metric functions as well as systems of equations and 
inequalities, conic sections and analytic geometry, 
Sequences and series, and mathematical induction. 
Occasional diagrams are poorly drawn or printed. 
(Fourth Edition, TR, November 1986.) FR 


Education, P, L*. Mathematicians and Educa- 
tion Reform. Eds: Naomi Fisher, Harvey Keynes, 
Philip Wagreich. CBMS Issues in Math. Educ., V. 
1. AMS and MAA, 1990, viii + 217 pp, (P). (ISBN: 
0-8218-3500-9] First in a new series of CBMS re- 
ports on mathematics education. This volume—the 
proceedings of a July 1988 workshop—contains pa- 
pers on projects (by Tom Berger, Frank Demana, Uri 
Treisman, Herb Clemens, R.O. Wells, et al.) and 
short notes on issues (recruitment of women, nontra- 
ditional instruction, etc.). Focus is on contexts that 
enable university mathematicians to work effectively 
in mathematics education. LAS 


Linear Algebra, T(13: 1), S, C, L. Matrices. 
North Carolina School of Science and Mathemat- 
ics. NCTM, 1988, vi + 115 pp, $10 (P). [ISBN: 0- 
87353-270-8] A unit designed to show how matrices 
can be used to model real-world phenomena and as 
discrete data structures. The first three sections in- 


troduce matrix arithmetic and then studies the Leon- 
tief model, the Leslie model, Markov chains, and 
computer graphics. Well-written with lots of good 
examples and exercises. Includes easy-to-use soft- 
ware that allows static and dynamic matrices of up 
to 15 by 15 and is able to perform standard matrix 
computations. CEC 


Group Theory, P. The Classical Groups and K- 
Theory. Alexander J. Hahn, O. Timothy O’Meara. 
Grund. der math. Wissenschaften, B. 291. Springer- 
Verlag, 1989, xv + 576 pp, $119. [ISBN: 0-387- 
17758-2] Covers algebraic properties of the classical 
groups over rings. Presents the development of topics 
which have evolved since Dieudonne’s La Géometrie 
des Groupes Classiques. In two parts: the first covers 
the simple situation of the linear groups, and the sec- 
ond then does a parallel study for the other classical 
groups. Contains several new results. RH 


Algebra, T(16: 1), S, P*, L. Handbook of Boolean 
Algebras, Volume 1-3. Ed: J. Donald Monk, Robert 
Bonnet. North-Holland (US Distr: Elsevier Science), 
1989, $309.75 set [ISBN: 0-444-87291-4]. Volume 1, 
xix + 323 pp, $68.25 (ISBN: 0-444-70261-X]; Vol- 
ume 2, xix + 415 pp, $112.25 [ISBN: 0-444-87152-7]; 
Volume §, xix + 650 pp, $161. (ISBN: 0-444-87153- 
5] Treats those areas of the theory of Boolean al- 
gebras of most interest to pure mathematicians, 1.e., 
set-theoretical abstract theory, applications, and re- 
lationships to measure theory, topology, and logic. 
Volume 1 is a self-contained treatment of the subject 
including exercises. Volumes 2 and 3 consist of arti- 
cles that are more or less independent of each other 
which expose most of the frontiers of research in the 
subject. CEC 


Algebra, T(18: 1, 2). Commutative Ring Theory. 
Hideyuki Matsumura. Transl: M. Reid. Stud. in 
Adv. Math., V. 8. Cambridge U Pr, 1989, xiii + 
320 pp, $24.95 (P). {ISBN: 0-521-36764-6] Changes 
from the First Edition (TR, November 1987) include 
correction of mistakes, improvements of some theo- 
rems, and additional topics added in the form of ap- 
pendices to individual sections. The section entitled 
“Kunz’ Theorems” in the First Edition has been re- 
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placed, and the bibliography has been enlarged. RH 


Calculus, T(13-14: 1). Brief Calculus with Appli- 
cations. Raymond F. Coughlin, David E. Zitarelli. 
Saunders College, 1990, xvi + 476 pp. [ISBN: 0- 
03-031597-2] A one-semester calculus text for busi- 
Ness, social, and biological science students. Indexed 
applications, gender-neutral problems, clear graphs, 
readable text. Includes referenced problems to sup- 
port student projects or in-depth study. FR 


Calculus, T(13-14: 2). Applied Calculus, Second 
Edition. Dennis D. Berkey. Saunders College, 1990, 
xix + 681 pp, $38. (ISBN: 0-03-031267-1] First year 
calculus text for business, economics, and social sci- 
ence students. Indexed applications, gender-neutral 
problems, clear graphs, readable text. BASIC appli- 
cations programs in appendix. Includes infinite se- 
ries, differential equations, probability theory. FR 


Numerical Analysis, P. Lecture Notes in Mathe- 
matics-1386: Numerical Methods for Ordinary Dif- 
ferential Equations. Eds: A. Bellen, C.W. Gear, E. 
Russo. Springer-Verlag, 1989, vii + 136 pp, $13.50 
(P). (ISBN: 0-387-51478-3] Proceedings of a work- 
shop held in L’ Aquila, Italy, September 16-18, 1987. 
Eight papers describe recent work on the develop- 
ment of improved general-purpose codes and algo- 
rithms for parallel computers. AO 


Numerical Analysis, T(16-18: 1, 2). Matriz 
Computations, Second Edition. Gene H. Golub, 
Charles F. Van Loan. Ser. in Math. Sci., V. 


3. Johns Hopkins U Pr, 1989, xix + 642 pp, 
$59.50; $29.95 (P). [ISBN: 0-8018-3772-3; 0-8018- 
3739-1] A revised and expanded version of this im- 
portant text and reference. New material has been 
added throughout and some stylistic improvements 
have been made. The most significant change is the 
addition of new material (including a new chapter) 
on parallel matrix computation. (First Edition, TR, 
January 1985.) AO 


Numerical Analysis, S(15-16). Guide to Nu- 
merical Analysts. Peter R. Turner. Math. Guides. 
Macmillan Education (US Distr: Scholium Intern), 
1989, x + 208 pp, $22.50 (P). (ISBN: 0-333-44947-9] 
A brief introduction to many of the ideas and tech- 
niques discussed in an introductory numerical analy- 
sis course together with some of the relevant back- 
ground material from first-year calculus. Designed 
to be used as a supplementary source of information 
for students taking a numerical analysis course. AO 


Functional Analysis, P. Nonlinear Functional 
Analysis and its Applications. Eberhard Zeidler. 
Transl: Eberhard Zeidler, Leo F. Boron. Springer- 
Verlag, 1990. JI/A: Linear Monotone Operators, 
Xvili + 467 pp, $124 [ISBN: 0-387-96802-4]; JJ/B: 
Nonlinear Monotone Operators, xv + 733 pp, $115. 
[ISBN: 0-387-97167-X] The second of a five-volume 
series on nonlinear functional analysis. The theory of 
monotone operators generalizes Hilbert-space meth- 
ods (for linear partial differential equations) to non- 
linear problems. The first subvolume (JJ/A) is an 
introduction to the functional analytic theory of lin- 
ear partial differential equations; the second (JJ/B) 
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deals with the generalizations. (JJ/B may be read 
independently.) MLR 

Analysis, P. Thirteen Papers Translated from the 
Russian. S.D. Berman, et al. AMS Transl. Ser. 2, 
V. 144. AMS, 1989, viii + 140 pp, $58. [ISBN: 0- 
8218-3125-9] 


Algebraic Geometry, P. Singularities. Ed: 
Richard Randell. Contemp. Math., V. 90. AMS, 
1989, xv + 359 pp, $42 (P). (ISBN: 0-8218-5096-2] 
The proceedings of an international conference on al- 
gebraic geometry held in July 1986 at the University 
of Iowa. Includes 19 papers on singularities in alge- 
braic and analytic varieties, together with a report on 
a session devoted to problems and open questions. PZ 


Differential Geometry, P. Lecture Notes in 
Mathematics-1410: Differential Geometry. Eds: F.J. 
Carreras, O. Gil-Medrano, A.M. Naveira. Springer- 
Verlag, 1989, 308 pp, $28.60 (P). [ISBN: 0-387-51885- 
1] Proceedings of the Third International Sympo- 
sium held in Pefiscola, Spain, June 1988. MPR 


Operations Research, T(15-18: 1, 2), P, L. 
The Theory of Choice and Decision Making. Igor 
M. Makarov, et al. MIR (US Distr: Imported Pub), 
1987, 328 pp, $13.95. (ISBN: 0-8285-4017-9] A well- 
written translation from the Russian. The text is or- 
ganized into three parts: the theoretical background 
for the choice of alternatives, procedures and algo- 
rithms for decision making, and optimal control with 
multiple criteria. The book is suitable for advanced 
undergraduates studying mathematics. Extensive 
bibliography, although about two-thirds of the ref- 
erences are in Russian. There are exercises scattered 
throughout the text, most rather difficult. The price 
of this book is extremely reasonable. SM 


Operations Research, T(15-16), L. Readings in 
Decision Analysis. Simon French. Chapman & Hall, 
1989, ix + 210 pp, $25 (P). [ISBN: 0-412-32170-X] 
This is an introduction to decision analysis. The 
author uses a lively combination of his own nar- 
rative and previously published papers to lead the 
reader through the theory and applications in the 
field. Reprints of six case studies are included. The 
mathematical sophistication required is minimal: the 
most intimidating symbols are summation signs and 
functions denoted by Greek letters. No exercises. SM 


Operations Research, T(17-18). Multiple Crite- 
ria Optimization: Theory, Computation, and Appli- 
cation. Ralph E. Steuer. Robert E Krieger, 1989, 
xx + 546 pp, $57.95. [ISBN: 0-89464-393-2] Writ- 
ten as a text for graduate students in management, 
this book requires modest mathematical background 
(linear algebra with linear programming, use of cal- 
culus for optimization, willingness to use computer 
software). Good exposition that stays close to geo- 
metric representation throughout. Reprint of 1986 
Wiley edition (TR, November 1987). AWR 


Optimization, P. Simulated Annealing (SA) & Op- 
timization: Modern Algorithms with VLSI, Optimal 
Design, & Missile Defense Applications. Ed: Mark 
E. Johnson. Amer. J. of Math. & Management 
Sci., V. 8, Nos. 3 & 4. American Sciences Pr, 1988, 
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245 pp, $89.75 (P). [ISBN: 0-935950-18-4] An an- 
notated bibliography and six papers. Three of the 
papers describe applications in the areas of VLSI, 
optimal design, and missile defense. The other three 
Papers discuss computational aspects of the tech- 
nique. AO 

Optimization, T(18: 2), P, L. Methods of Dy- 
namic and Nonsmooth Optimization. Frank H. 
Clarke. CBMS-NSF Reg. Conf. Ser. in Appl. Math., 
V. 57. SIAM, 1989, v + 90 pp, $15.75 (P). [ISBN: 
0-89871-241-6] A short and succinct monograph. 
Beginning with a chapter on nonsmooth analysis 
and geometry, the text continues through nonsmooth 
problems in the calculus of variations, verification 
functions, dynamic programming, and optimal con- 
trol. The book ends with an index and a substantial 
list of references. No exercises; inexpensive. SM 


Control Theory, P. Lecture Notes in Control and 
Information Sciences-123: Knowledge-Based Control 
with Application to Robots. C.W. de Silva, A.G.J. 
MacFarlane. Springer-Verlag, 1989, x + 196 pp, 
$30.30 (P). [ISBN: 0-387-51143-1] A research-level 
monograph describing recent work using machine in- 
telligence techniques in the design of robot control 
mechanisms. AO 


Control Theory, P, L. Lecture Notes in Control 
and Information Sciences-124: Control of Partially- 
Known Dynamical Systems. A.A. Bahnasawi, M.S. 
Mahmoud. Springer-Verlag, 1989, xi + 228 pp, 
$30.30 (P). (ISBN: 0-387-51144-X] The research in 
this monograph is primarily focused on reduced- 
order adaptive control schemes and deterministic 
stabilizing control for uncertain dynamical systems. 
Both discrete systems and continuous systems are 
discussed. SM 


Control Theory, P, L. Modern Optimal Control. 
Ed: Emilio O. Roxin. Lect. Notes in Pure & Appl. 
Math., V. 118. Marcel Dekker, 1989, xiii + 437 pp, 
(P). (ISBN: 0-8247-8168-6] A collection of twenty- 
nine papers offering both an historical perspective on 
thirty years of developments in the study of optimal 
control and dynamical systems, and significant ex- 
amples in modern control theory. An index of topics 
is provided. SM 


Control Theory, T(18), P, L. Lecture Notes 
in Control and Information Sciences-138: Robust 
Controller Design Using Normalized Coprime Fac- 


tor Plant Descriptions. D.C. McFarlane, K. Glover. . 


Springer-Verlag, 1990, x + 206 pp, $30.30 (P). 
[ISBN: 0-387-51851-7] This book discusses the ro- 
bust stabilization problem which involves finding a 
single feedback controller for all members of a model 
which are close, in some sense, to a nominal model. 
The motivation for studying this problem is the re- 
alization that modeling is an inexact art and param- 
eter values are seldom known precisely. A robust 
controller will stabilize any physical system which is 
close enough to the system represented by a mathe- 
matical model. SM 


Systems Theory, $(18), P. Lecture Notes in Con- 
trol and Information Sciences-135: Three Decades of 
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Mathematical System Theory. Eds: H. Nijmeijer, 
J.M. Schumacher. Springer-Verlag, 1989, vi + 562 
pp, $69 (P). [ISBN: 0-387-51605-0] Citing 1960 pa- 
pers by Kalman, Bellman, and Pontryagin as the be- 
ginnings of modern control theory and a shift in 1958 
in the approach to filtering, the editors have assem- 
bled papers that commemorate what they take to be 
the entire history of system theory as the discipline 
is now defined and practiced. AWR 


Systems Theory, T(18), P. Nonlinear Control 
Systems: An Introduction, Second Edition. Alberto 
Isidori. Communic. & Control Engin. Ser. Springer- 
Verlag, 1989, xii + 479 pp, $49.50. (ISBN: 0-387- 
50601-2] First appeared in 1983; is here rearranged 
on the basis of classroom use, supplemented with 
two new chapters, and given more of a differential 
geometry orientation. Supposes familiarity with lin- 
ear system theory; intended as a graduate text and 
reference for professionals in the field. AWR 


Probability, T(17: 2), P. Topics in Advanced 
Econometrics: Probability Foundations. Phoebus 
J. Dhrymes. Springer-Verlag, 1989, xii + 379 pp, 
$39.95. (ISBN: 0-387-97178-5] Contains probabil- 
ity theory that is needed for rigorous study of non- 
linear econometrics. Measure theory, LLN and LLT 
for independent sequences and Martingales, ergodic 
theory. TH 


Stochastic Processes, T(17-18). Stochastic Dif- 
ferential Equations: An Introduction with Applica- 
tions, Second Edition. Bernt ®ksendal. Universi- 
text. Springer-Verlag, 1989, xv + 186 pp, $29.95. 
(ISBN: 0-387-51740-5] Changes from First Edition 
(TR, April 1986) include a reorganization of material 
and changes in presentation. MLR 


Statistics, P. Latent Variable Path Modeling 
with Partial Least Squares. Jan-Bernd Lohmoller. 
Physica-Verlag (US Distr: Springer-Verlag), 1989, 
283 pp, $58.40. [ISBN: 0-387-91363-7] Theoret- 
ical exposition, with algorithms and applications, 
of Herman Wold’s “soft modelling” method of con- 
structing linear-relation models, where “soft” refers 
to the distributional assumptions. Extensive bibliog- 
raphy. RSK 


Statistics, P. Lecture Notes in Statistics-58: De- 
composition and Invariance of Measures, and Sta- 
tistical Transformation Models. Ole E. Barndorff- 
Nielsen, Preben Blaesild, Poul Svante Eriksen. 
Springer-Verlag, 1989, 147 pp, $19.50 (P). [ISBN: 
0-387-97131-9] Theoretical monograph presuming 
some background in topology, group theory and dif- 
ferential geometry, in addition to parametric statis- 
tics. Presents the relevant mathematics of decom- 
position or disintegration of measures and construc- 
tion of invariant measures and uses these results 
to present the key aspects of transformation mod- 
els. RSK 

Statistics, P. Lecture Notes in Statistics-60: Non- 
parametric Curve Estimation from Time Series. 
Lazl6 Gyéorfi, et al. Springer-Verlag, 1989, vili + 
153 pp, $19.50 (P). [ISBN: 0-387-97174-2] Theoret- 
ical treatment of nonparametric estimation methods 
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(primarily kernel estimates) for nonlinear time se- 
ries and regression analysis for density, distribution 
function, and hazard rate estimation with time-series 
(not i.i.d.) data. TH 


Statistics, P. Ezperimental Design in Biotechnol- 
ogy. Perry D. Haaland. Stat.: Textbooks & Mono- 
graphs, V. 105. Marcel Dekker, 1989, xv + 259 
pp, $89.75. [ISBN: 0-8247-7881-2] Uses real data 
to illustrate how experimental design can be applied 
effectively to biotechnological problems. Primarily 
concerned with small screening experiments (using 
fractional factorial designs) and response surface ex- 
periments for optimization. Appendix contains a 
tabulation of useful designs (35 screening and 9 re- 
sponse surface). Note price. RSK 


Statistics, P. Lecture Notes in Statistics-54: Theory 
of Optimal Designs. Kirti R. Shah, Bikas K. Sinha. 
Springer-Verlag, 1989, viii + 171 pp, $19.50 (P). 
[ISBN: 0-387-96991-8] Theoretical monograph pre- 
senting state-of-the-art results and recent advances 
in discrete optimal designs. RSK 

Statistics, T(16-17: 1, 2), P*, L. Statistical 
Methodology in the Pharmaceutical Sciences. Ed: 
Donald A. Berry. Stat.: Textbooks & Monographs, 
V. 104. Marcel Dekker, 1990, viii + 578 pp, $125. 
[ISBN:0-8247-8117-1] Collection of 16 articles pre- 
senting a variety of statistical methods useful in phar- 
maceutical research. Each starts from first principles 
and progresses to state-of-the-art techniques, most il- 
lustrated with real data. Presumes an undergraduate 
course in statistical theory and methods. RSK 


Computer Graphics, P, L. Computational Ge- 
ometry: Curve and Surface Modeling. Su Bu-qing, 
Liu Ding-yuan. Transl: Chang Geng-zhe. Academic 
Pr, 1989, x + 295 pp, $39.95. [ISBN: 0-12-675610- 
4] Translation of the 1979 Chinese version. Con- 
cerned with computer-based representation analysis 
and synthesis of two- and three-dimensional shapes. 
Basic theories, geometric properties, and numeri- 
cal algorithms of curves and surfaces widely used 
in computer-aided design and manufacture. Spline 
functions, parametric cubic spline curves; Bézier 
curves; spline surfaces; nonlinear splines; plane 
curves and “fairness.” Readers need analytical ge- 
ometry, calculus, and linear algebra. Approximately 
170 papers published since 1979 have been added to 
an already lengthy reference list. JDEK 


Artificial Intelligence, P. Integration of Visual 
Modules: An Eztension of the Marr Paradigm. John 
(Yiannis) Aloimonos, David Shulman. Academic Pr, 
1989, xvi + 322 pp, $39.95. [ISBN: 0-12-053020-1] 
An overview of the field of computer vision and what 
we can do with it from two researchers at the Com- 
puter Vision Laboratory and Computer Science De- 
partment at the University of Maryland. JAS 


Computer Science, T(17-18: 1), P. Topology via 
Logic. Steven Vickers. Tracts in Theoret. Com- 
puter Sci., V. 5. Cambridge U Pr, 1989, 200 pp, 
$39.50. (ISBN: 0-521-36062-5] Ignore the points of 
a set, consider a family of subsets, and wallow in the 
parallelism between intersection/union and conjunc- 
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tion/disjunction to discover topology in logic. Such 
is the flavor of this foundation for certain aspects of 
formal language theory which is aimed at graduate 
students in computer science. A scattering of exer- 
cises and interesting bibliography. JAS 


Computer Science, S. Advances in Computers, 
Volume 29. Ed: Marshall C. Yovits. Academic Pr, 
1989, xi + 359 pp, $59.95. (ISBN: 0-12-012129-8] 
The twenty-ninth volume in a series begun in 1960 
addressing long-range research issues in the field of 
computer science. This volume addresses five issues, 
four of which are technical and one of which is social. 
The technical issues are computer security, human- 
computer interaction, network protocols, and com- 
puter chess. The social issue is a study of the state 
of computing in the Soviet Union and how it com- 
pares with the United States. GMS 


Computer Science, T. Object-Oriented Systems 
Analysis: Modeling the World in Data. Sally Shlaer, 
Stephen J. Mellor. Comput. Ser. Prentice-Hall, 
xil + 144 pp, (P). [ISBN: 0-13-629023-X] An infor- 
mal presentation suitable for a practitioner-oriented 
workshop in information theory. JAS 


Computer Science, T(16), P. Object-Oriented 
Analysis. Peter Coad, Edward Yourdon. Comput. 
Series. Prentice-Hall, 1990, xi + 232 pp, $29. [ISBN: 
0-13-629122-8] The two most popular techniques 
for the specification and design of large software sys- 
tems has been until recently functional decomposi- 
tion and data flow analysis. Now a newer and sig- 
nificantly improved technique is being studied which 
appears to offer significant benefits. This method, 
called object-oriented systems analysis, decomposes 
the system into a collection of objects, functions on 
objects, and classes of objects. This decomposition, 
it is claimed, is more natural and easy to under- 
stand for users and programmers. The authors give 
a complete introduction to this new software design 
method. GMS 


Computer Science, P, L. Mathematics for Large 
Scale Computing. Ed: J.C. Diaz. Lect. Notes in 
Pure & Appl. Math., V. 120. Marcel Dekker, 1989, 
xi + 345 pp, (P). [ISBN: 0-8247-8122-8] Twelve pa- 
pers arising from a special session of the AMS meet- 
ing held in Denton, Texas in the fall of 1986. Cov- 
ers specific research applications, development, and 
analysis of numerical algorithms and performance 
evaluation of algorithms on advanced architectures. 
The editors seek to give a representative picture of 
several aspects of large scale computing. SM 
Computer Science, T(16-18: 1, 2). Object- 
Oriented Software Construction. Bertrand Meyer. 
Ser. in Comput. Sci. Prentice-Hall, 1988, xviii + 
534 pp. (ISBN: 0-13-629049-3] A software engineer- 
ing perspective using the language Eiffel (designed by 
the author with Simula for inspiration) as a notation 
for conveying the concepts of objected-oriented pro- 
gramming. A serious and detailed study of concepts, 
not a programming manual. Numerous open-ended 
discussion problems for the student. JAS 


Computer Science, T(16-17: 1), L. Introduction 


1990] 


to Parallel Programming. Steven Brawer. Academic 
Pr, 1989, xi + 422 pp, $39.95. (ISBN: 0-12-128470-0] 
An informal, practical introduction to parallel pro- 
gramming techniques for use on shared-memory mul- 
tiprocessors. Does not assume a particular hardware 
or software platform. Stresses concepts and funda- 
mental techniques. AO 


Computer Science, P. Parallel Processing and 
Medium-Scale Multiprocessors. Editor: Arthur 
Wouk. SIAM, 1989, 207 pp, $31.50 (P). (ISBN: 0- 
89871-238-6] Proceedings of a research workshop 
held at Stanford University in January 1986. The 
twelve papers included in this volume cover top- 
ics in systems programming, languages for paral- 
lel programming systems, and applications program- 
ming. AO 


Applications, T(16-17: 1), L. A Concrete 
Approach to Mathematical Modelling. Michael 
Mesterton-Gibbons. Addison-Wesley, 1989, xx + 
597 pp, $48.50. (ISBN: 0-201-12910-8] Aimed at 
students with no prior background in mathematical 
modelling. Deterministic and probabilistic models 
are given roughly equal treatment. Models are cho- 
sen from a wide variety of applications areas. Mathe- 
matical prerequisites include calculus, linear algebra, 
ordinary differential equations, and probability and 
statistics. AO 


Applications (Biological Science), S(16), L. 
Some Mathematical Questions in Biology: Models in 
Population Biology. Ed: Allan Hastings. Lect. on 
Math. in the Life Sci., V. 20. AMS, 1989, vil + 
123 pp, $28 (P). (ISBN: 0-8218-1170-3] In this work 
five papers appear which deal with different aspects 
of population biology: cultural evolution, multilocus 
population genetics, spatially structured population 
genetics, chaos and the dynamics of epidemics, and 
the dynamics of ecological systems. Common to all 
four works is the use of nonlinear difference equations 
and nonlinear differential equations. SM 


Applications (Biological Science), P. Applied 
Mathematical Ecology. Eds: Simon A. Levin, 
Thomas G. Hallam, Louis J. Gross. Biomath., V. 18. 
Springer-Verlag, 1989, xiv + 491 pp, $59.50. [ISBN: 
0-387-19465-7] Proceedings of the Second Autumn 
Course on Mathematical Ecology held November and 
December 1986 at the International Centre for Theo- 
retical Physics in Trieste, Italy. Builds on the topics 
covered in an earlier publication Mathematical Ecol- 
ogy, edited by T.G. Hallam and S.A. Levin (TR, 
April 1987). Includes application of mathematical 
ecology in ecotoxicology, resource management, epi- 
demiology, demography, and population ecology. RH 


Applications (Economics), P. Theoretical Mar- 
ket Areas under Euclidean Distance. Pierre Hanjoul, 
Hubert Beguin, Jean-Claude Thill. Mono. Ser. In- 
stitute of Mathematical Geography (2790 Briarcliff, 
Ann Arbor, MI 48105), 1988, 162 pp, $15.95 (P). 
A research-level monograph describing a theory of 
market areas in the case of a duopoly competing in- 
side the Euclidean plane endowed with Euclidean dis- 
tance. AO 
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Applications (Engineering), P. Symbolic Com- 
putation: Applications to Scientific Computing. Ed: 
Robert Grossman. Frontiers in Appl. Math., V. 
5. SIAM, 1989, x + 185 pp, $24.50 (P). (ISBN: 0- 
89871-239-4] The seven papers in this volume are 
from the NASA-Ames Workshop on the Use of Sym- 
bolic Methods to Solve Algebraic and Geometric 
Problems Arising in Engineering which took place 
January 15-16, 1987. These papers describe appli- 
cations of computer algebra systems in the study 
of chaotic systems, fluid dynamics, nonlinear control 
systems, and robotics. AO 


Applications (Information Theory), S*(14), 
L*. Information Theory: Saving Bits. William 
Sacco, et al. Contemp. Appl. Math. Janson, 1988, 
vi + 58 pp, $7.95 (P). (ISBN: 0-939765-25-X] A brief 
monograph in workbook format which explores ef- 
ficient storage and transmission of information and 
error control coding. A very useful introduction to 
these two areas of information theory. CEC 
Applications (Management), T(16: 1), S, C. 
Managerial Decision Analysis. Danny Samson. 
Richard D Irwin, 1988, xiv + 743 pp, $43.95. [ISBN: 
0-256-06162-9] Focuses on the modeling and imple- 
mentation issues of supporting managerial decisions 
with rigorous analysis. The treatment is not very 
mathematical, requiring only secondary school alge- 
bra. Includes the foundations of the subject along 
with chapters on sensitivity analysis, behavioral as- 
pects of decision analysis, implementation issues, de- 
cision support systems, and complementary decision 
aids. Includes a decision analysis software package 
and documentation called ARBORIST that provides 
a tool for solving and formulating decision tree mod- 
els. This package runs on a PC and takes some ef- 
fort to understand how it works. The text does not 
depend on the software which serves as a supplemen- 
tary tool. CEC 


Applications (Physics), P. Dynamics and Con- 
trol of Multibody Systems. Eds: J.E. Marsden, P.S. 
Krishnaprasad, J.C. Simo. Contemp. Math., V. 
97. AMS, 1989, xvii + 468 pp, $46 (P). [ISBN: 0- 
8218-5104-7] Proceedings of the 1988 AMS-IMS- 
SIAM Joint Summer Research Conference. MU 


Applications (Physics), P. Lecture Notes in 
Physics-345: Schrodinger Operators. Eds: UH. 
Holden, A. Jensen. Springer-Verlag, 1989, 458 pp, 
$47.60. (ISBN: 0-387-51783-9] Fourteen papers on 
Schrédinger operators including short introductory 
lectures, longer survey articles, and current research 
topics. MPR 


Reviewers 


CEC: Clifton E. Corzatt, St. Olaf; RH: Rhonda Hatcher, 
St. Olaf; TH: Timothy Hesterberg, St. Olaf; RSK: Richard S. 
Kleber, St. Olaf; JDEK: Joseph D.E. Konhauser, Macalester; 
SM: Steve McKelvey, St. Olaf; AO: Arnold Ostebee, St. Olaf, 
MLR: Margaret L. Reese, St. Olaf; MPR: Matthew P. Richey, 
St. Olaf; FR: Flauren Ricketts, Macalester; AWR: A. Wayne 
Roberts, Macalester; GMS: G. Michael Schneider, Macalester; 
JAS: J. Arthur Seebach, Jr., St. Olaf; LAS: Lynn Arthur 
Steen, St. Olaf; MU: Milton Ulmer, Carleton; PZ: Paul Zorn, 
St. Olaf. 
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Moone can use 


This book is not just for numerical an- 
alysts, it is accessible to anyone with 
some mathematical foundation who 
wants to taste selections from a nu- 
merical analysis menu. The papers do 
a good job in motivating and introduc- 
ing their subjects, and some provide 
a nice historical background. | recom- 
mend the articles in this book for any- 
one interested in learning more about 
the current research interests in numer- 
ical analysis. They may excite their 
readers to investigate further this rel- 
evant field of mathematics. 


David C. Arney in Mathematics and Com- 
puter Education 


Order from: 


STUDIES IN NUMERICAL ANALYSIS 
Gene H. Golub, Editor 


Golub set high standards in the lead arti- 
cle on numerical solution of polynomials by 
J.H. Wilkinson which won the Chauvenet 
Prize as the best piece of mathematical ex- 
position to appear in 1985. These papers 
describe the wide range of research in nu- 
merical analysis. A variety of problems and 
of computational tools is treated. Some of 
the computations require only a hand-held 
calculator, while others require a powerful 
computer. 

422 pp., 1985, ISBN 0-88385-1 26-1 

List: $46.50 MAA Member: $34.50 


Catalog Number MAS-24 


The Mathematical Association of America 


MAT, 
ner Ca 


, 1529 Eighteenth Street, N. W. 
Washington, D. C. 20036 
(202) 387-5200 


A briefer version of Coughlin and 
Zitarelli’s acclaimed Calculus with 
Applications—the most applied 
calculus text available 


BRIEF CALCULUS WITH 
APPLICATIONS 


Raymond F. Coughlin and 
David E. Zitarelli 
both of Temple University 


Coughlin and Zitarelli’s Calculus with 
Applications was an overnight success when it 
was introduced earlier this year. Instructors 
praised its fact-based, real-world case 
studies...its abundant applications exercises... 
its unusually thorough coverage. 

Now these same great features are avail- 
able in a text for one-semester courses. Brief 
Calculus with Applications is the text profes- 
sors have been asking for ever since the 
authors’ longer book first appeared. 


The mathematically honest approach 
to applied calculus is back in a new 
second edition... 


APPLIED CALCULUS 


Second Edition— Dennis D. Berkey, 
Boston University 


Students need basic calculus skills to 
succeed in business, the social sciences 
or the life sciences. Through numerous 4 
worked examples, extensive exercise 
sets, abundant applications, and a 
uniquely clear, straightforward 
presentation, Berkey delivers 
these skills. 


Also from Dennis D. Berkey 
CALCULUS FOR 


MANAGEMENT, SOCIAL 
AND LIFE SCIENCES 


Second Edition 


This brief version of Applied Calculus, Secerd 
Edition, is perfect for the one-semester cou'se 
it includes seven appropriate chapters from 
Applied Calculus. 


SAUNDERS COLLEGE PUBLISHING .+# 
THE POWERS °F MATHEMATICS 


USA MATHEMATICAL OLYMPIADS 


Every year 100 of the most mathemati- 
cally talented high school students in the 
country compete in the USA Mathemati- 
cal Olympiad (USAMO). The USAMO is 
the third stage of a three-tiered mathe- 
matical competition for high school stu- 
dents in the United States and Canada 
that begins with the AHSME taken by over 
400,000 students, continues with the Amer- 
ican Invitational Mathematics Exam involv- 
ing 2,000 students, and culminates in the 
100-contestant USAMO. 


Order from: 


(202) 387-5200 


USA MATHEMATICAL OLYMPIADS 
1972-1986, PROBLEMS AND 
SOLUTIONS 


Compiled by Murray S. Klamkin 


People delight in working on problems “be- 
cause they are there, ” for the sheer plea- 
sure of meeting a challenge. This is a 
book full of such delights. In it, Murray 
S. Klamkin brings together 75 original USA 
Mathematical Olympiad (USAMO) problems 
for years 1972-1986, with many improve- 
ments, extensions, finger exercises, open 
problems, references and solutions, often 
showing alternative approaches. The prob- 
lems are coded by subject and solutions 
are arranged by subject as an aid to those 
interested in a particular field. Contains 
a glossary of frequently used terms and 
theorems, and a comprehensive bibliogra- 
phy with items numbered and referred to 
in brackets in the text. The problems are 
intriguing and the solutions elegant and in- 
formative. Students and teachers will en- 
joy working these challenging problems. In- 
deed all those those who are mathemat- 
ically inclined will find many delights and 
pleasant challenges in this book. 


180 pp., 1988, ISBN-0-88385-634-4 
List: $13.50 MAA Member: $12.50 
Catalog Number NML-33 


The Mathematical Association of America 
1529 Eighteenth Street, N. W. 
Washington, D. C. 20036 


Proot 
Positive 


1. of our commitment to Mathematics 


"bho 78 
° AN INTRODUCTION TO THE HISTORY OF 
' MATHEMATICS, SIXTH EDITION 
3 Howard Eves, University of Maine and University of Central Florida 
“, : This classic introduction to the history of mathematics is specially 
-? e designed for undergraduate students. The completely revised sixth 


: edition includes new information on women in mathematics, more 
Pee .. oS illustrations and portraits, additional problem studies, and new 

ane * “Cultural Connections” that show students how mathematics is part of 
e.g our cultural heritage. 


Pa Sag: '- ABSTRACT ALGEBRA: AN INTRODUCTION 
ae De ors Se Thomas W. Hungerford, Cleveland State University 
ce pe - as. ’ a . - Written specifically for undergraduates, Abstract A/gebra can be used 
~ wf ‘ch hale c@ee’ st from one to three terms. Presenting rings before groups, this text 
oo '(' + provides clear explanations, many examples, and abundant exercises, 
° oo ee making this subject both interesting and accessible. 


MATHEMATICAL LOGIC: APPLICATIONS AND THEORY 
Jean E. Rubin, Purdue University 


To make mathematical logic understandable for undergraduate 
students, this text teaches a specific proof writing system, introduces 
new topics with illustrative examples, incorporates a large number of 
exercises, and features abundant applications of logic to mathematics. 


INTRODUCTION TO PROOF IN ABSTRACT 
MATHEMATICS 
Andrew Wohigemuth, University of Maine 


In this original new text, inference rules are used as guides in setting 
up proofs. Students learn about proofs by actually doing them, 
following a logical sequence of definitions, rules, and theorems. 


aa Saunders College Publishing 


THE POWERS 9 MATHEMATICS 
The Curtis Center, Independence Square West, Philadelphia, PA 19106-3399 


MATHEMATICS & BIOGRAPHY 


ATHEMATICS 


Queen&Servant 
ot Science 


ERIC TEMPLE BELL 


MATHEMATICS: 
QUEEN AND 
SERVANT OF SCIENCE 


E.T. Bell 


An absorbing account of pure and applied 
mathematics from the geometry of Euclid to 
that of Riemann and its application in Ein- 
stein's theory of relativity. The twenty chap- 
ters treat such topics as: algebra, number 
theory, logic, probability, infinite sets and 
the foundations of mathematics, rings, ma- 
trices, transformations, groups, geometry, 
and topology. Republished in 1987 with cor- 
rections and an added Foreward by Martin 
Gardner. 


454 pp., ISBN 0-88385-446-3 
Paperbound 


List: $15.95 MAA Member: $11.95 
Catalog Number QAS 


The book deserves a place in today’s 
market It is a much more popular work 
than most histories of the subject, and 
that is exactly what makes it acces- 
sible to students as well as to non- 
mathematicians. It is rewarding read- 
ing for. . . teachers and students at 
all mathematical levels. 


Morris Kline of The Courant Institute 


Order from: 


| WANT TO BE A MATHEMATICIAN 
An Automathography in Three Parts 


Paul R. Halmos 


This is a book to be read with interest 
by all those who know, or might want to 
know, what mathematicians and mathemat- 
ical careers are like. Paul R. Halmos be- 
gins with his school days and carries the 
reader swiftly through a career that has sus- 
tained itself at a high level since his first 
post-doctoral days at the Institute for Ad- 
vanced Study in 1939, where he worked 
with John von Neumann among others. Still 
going strong in 1988, Halmos has con- 
tributed much to logic, operator theory, er- 
godic theory, and the literature in general. 


442 pp., 1988, Paper, ISBN 0-88385-445-7 
List: $18.00 MAA Member: $15.00 


Catalog Number IWM 


MATHEMATICIAN 


It is a truly unique book, which nobody 
but Paul Halmos could have written. 
| think it will be a classic. 


Constance Reid 
The book its exciting, witty, and well 
worth the time invested in its study. It 


communicates what it means to be a 
mathematician. 


John Dossey in The Mathematics Teacher 


The Mathematical Association of America 


LV » 1529 Eighteenth Street, N.W. 
« e Washington, D.C. 20036 


(202) 387-5200 


Studies in Mathematical Economics 


Volume 25 in the MAA Studies in Mathematics 
Edited by Stanley Reiter 


420 pp. Hardbound 
ISBN-0-88385-027-X 


List: $42.00 
MAA Member: $31.00 


tanley Reiter, as editor, has brought togethera including the Kuhn-‘Tucker and related condi- 
distinguished group of contributors in this tions. The third contribution, by Mas-Collell, 
volume, in order to give mathematicians and uses the tools of differential topology, including 
their students a clear understanding of the is- Sard’s theorem, to study the competitive equi- 
sues, methods, and results of mathematical eco- libria of whole families of economies using a dif- 
nomics. The range of material is wide: game the- ferentiable point of view. Next Kuhn, building on 
ory; optimization; effective computation of equi- the work of Scarf, shows how methods based on 
libria; analysis of conditions under which Sperner’s lemma can be used to compute equi- 
economies will move to the greatest possible effi- libria. 
ciency under various forces, and the require- The next two chapters by Reiter and Hurwicz 
ments for the flow of information needed to explore the properties of systems that are not 
achieve efficient markets. purely competitive. They bring analytical and to- 
The material is interesting at all mathe- pological tools to bear to determine what condi- 
matical levels. For example, the initial article tions on the exchange of information are needed 
shows how even mathematically simple, concrete, to allow such markets to become optimally effi- 
two-person, nonzero sum gamesipresent us with cient. 
the complexities and dilemmas of choices in real Radner addresses one consequence of what 
life. At the other extreme, the final article, by De- Herbert Simon calls “bounded rationality.’ Man- 
breu, begins by using the power of Kakutani’s agers neither know all the facts nor do they have 
fixed point theorem to prove the existence of eco- unlimited ability to calculate. How should they al- 
nomic equilibria. In between, the reader will find locate their time? The tools used to answer this 
beautiful uses of calculus, topology, combina- question are fittingly probabilistic. 
torial topology, and other topics. In the final chapter, Debreu gives four exam- 
The chapters of this volume can be read inde- ples of mathematical methods in economics. 
pendently, although they are related. The book These four examples alone give a sense of the 
begins with Meyerson’s chapter on game theory breadth and nature of the field. 
and its theoretic foundations. The second chap- In this study, Reiter and his other contribu- 
ter, by Simon, starts with the familiar criteria for tors show the reader the subtlety and complexity 
maxima from calculus and goes on to develop of the subject along with the precision and clarity 
more general tools of mathematical economics, that mathematics bring to it. 
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Proceedings of the Colloquium held in October 1987 in Wash- 
ington, D.C. to discuss calculus reform. The mathematical com- 
munity is challenged to make the introductory calculus course 
into a pump that feeds more students into science and engi- 
neering, not a filter that cuts down the flow. These proceed- 
ings, with contributions from over eighty authors, show the 
full sweep of concems and approaches of all the groups involved 
in calculus reform, including those currently teaching traditional 
and innovative courses, those whose students or employees 
need to use calculus as a tool, and the department chairs, deans, 
and others who must mobilize the resources needed for this 


reform. 
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Carus Mathematical Monograph, #19 
Field Theory and its Classical Problems, 


by Charles R. Hadlock 
323 + xvi pp. Hardbound 
List: $28.00 MAA Member: $22.00 


Field Theory and its Classical Problems is one of those rare monographs that will cap- 
ture and hold a mathematically prepared reader (one reasonably familiar with basic cal- 
culus and linear algebra) until the last page. Mathematicians will be intrigued by the 
development which places Galois Theory in an historical perspective and allows it to 
unfold from the emergence of the classical construction problems to the discovery of 
their solutions. The book presents some of history’s greatest and most elegant mathematics. 
Read what some reviewers say about this monograph: 

‘‘Hadlock has produced a pedagogic masterpiece . . . His idea of beginning 

with the three Greek problems and letting them fire the reader’s spark plugs 

is brilliant . . . (his) ability to inject vitality and enthusiasm into mathematical 

text is remarkable. (George Piranian, University of Michigan) 


‘‘In the preface, the author claims to have written the book for himself. . . 
Fortunately for us, he chose to share his work with the mathematical commu- 
nity. I suggested the book as collateral reading in a one-semester course in Galols 
Theory and the students found it very readable and most helpful in establish- 
ing a motivation and direction . . . This is a delightful book for both student 
and teacher.’’ (John D. Leadley, Mathematical Reviews) 


Table of Contents 
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Constructible Lengths; Doubling the Cube; Trisecting the Angle; Squaring the Circle; 
Polynomials and Their Roots; Symmetric Functions; the Transcendence of T. 
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Constructions Revisited; Roots of Complex Numbers; Constructibility of Regular 
Polygons I; Congruences; Constructibility of Regular Polygons II. 
Chapter 3—Solution by Radicals 
Statement of the Problem; Automorphisms and Groups; The Group of an Exten- 
sion; Two Fundamental Theorems, Galois’ Theorem; Abel’s Theorem; Some 
Solvable Equations. 
Chapter 4—Polynomials With Symmetric Groups 
Background Information; Hilbert’s Irreducibility Theorem; Existence of Poly- 
nomials over Q with Group S.. 
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Casson’s Invariant for Oriented Homology 3-Spheres 
An Exposition 
Selman Akbulut and John D. McCarthy 
In the spring of 1985, A. Casson announced an interesting invariant of homology 3-spheres via con- 

structions on representation spaces. In the fall of that same year, Selman Akbulut and John McCarthy 
held a seminar on this invariant. These notes grew out of that seminar. The authors have tried to re- 
main close to Casson’s original outline and proceed by giving needed details, including an exposition 
of Newstead’s results. 


Mathematical Notes, 36 Paper: $19.50 ISBN 0-691-08563-3 


Analytic Pseudodifferential Operators 
for the Heisenberg Group and Local Solvability 


Daryl Geller 
Many of the operators one meets in several complex variables, such as the famous Lewy operator, 
are not locally solvable. Nevertheless, such an operator L can be thoroughly studied if one can find a 
suitable relative parametrix—an operator K such that LK is essentially the orthogonal projection onto 
the range of L. The analysis is by far most decisive if one is able to work in the real analytic, as op- 
posed to the smooth, setting. Here the author develops an analytic calculus for the Heisenberg group. 
Mathematical Notes, 37 Paper: $29.50 ISBN 0-691-08564-1 
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Carus Mathematical Monograph No. 20 


THE GENERALIZED 
RIEMANN INTEGRAL 


Robert M. McLeod 
275 + xiii pages. Hardbound. 


List: $24.95 MAA Member: $22.00 


Until recently the most powerful and beautiful tools of integration theory have been 
accessible only to the privileged few whose studies extended through the Lebesgue 
integral. Now a new integral, a generalization of the familiar Riemann integral, has been 
discovered which has all the power and range of the Lebesgue integral but which can 
be readily understood by anyone who has studied calculus through the sophomore level. 

The Generalized Riemann Integral is the first book-length presentation of this 
exciting new development in integration theory. Because of the clarity and organiza- 
tion of the exposition and the inclusion of exercises designed to actively engage the 
reader in the material, it is eminently suitable for use as a textbook at the advanced 
undergraduate or beginning graduate level. Furthermore, because it presents within 
a single volume results which were previously scattered throughout many research 
publications, it will undoubtedly also be of considerable interest to specialists in inte- 
gration theory. 


List of Contents: Definition of the Generalized Riemann Integral « Basic Properties of the Inte- 
gral « Absolute Integrability and Convergence Theorems « Integration of Subsets of Intervals « 
Measurable Functions ¢ Multiple and Iterated Integrals « Integrals of Stieltjes Type « Compari- 
son of Integrals ¢ Appendix: Solutions of In-Text Exercises 
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NEW & CLASSIC 
TEXTBOOKS 


Principles of Real Analysis 
Second Edition 


Charalambos D. Aliprantis and 

Owen Burkinshaw 

Purdue University, Indianapolis, Indiana 

“A well-written introduction to measure and 

integration with ample exercises and examples. 

... The text emphasizes and uses the algebraic 

structure of the various function spaces con- 

sidered and this certainly enhances the appeal.” 
—THE AMERICAN MATHEMATICAL MONTHLY 

An ideal text for senior-undergraduate and 

first-year graduate courses, this book treats all 

topics rigorously, with illustrated examples and 

more than 450 exercises of varying degrees of 

difficulty. 

January 1990, c. 304 pages, $39.95 

ISBN: 0-12-050255-0 


Problems in Real Analysis 
A Workbook with Solutions 


Charalambos D. Aliprantis and 
Owen Burkinshaw 


Although based on the major textbook, 
Principles of Real Analysis, this book can be 
used as an independent source. It will be an 
invaluable tool for students who wish to develop 
a deep understanding and to acquire proficiency 
in the use of integration methods. 


January 1990, c. 296 pages, $29.95 
ISBN: 0-12-050256-9 


Elements of Differentiable 
Dynamics and Bifurcation 
Theory 

David Ruelle 


Institute des Hautes Etudes Scientifiques 
Bures-sur- Yvette, France 


Written by an eminent researcher in the field, 
this introductory text presents the elements of 
the theory of differentiable dynamical systems 
to students of mathematics and the natural 
sciences in a lucid and yet rigorous way. 

1989, 200 pages, $27.50 
ISBN: 0-12-601710-7 


Mathematics for Dynamic 
Modeling 


Edward J. Beltrami 

State University of New York at Stony Brook 

“This is an excellent book aimed at under- 
graduates studying mathematical models which 
involve differential equations....The author has 
read a lot, and produced a very interesting 
variety of models to illustrate his basic themes 
of dynamical modelling. Within his self- 
imposed restriction to an emphasis on the 
dynamic aspects, the book is very successful, 
and provides a useful source for undergraduate 


courses.” —MATHEMATICAL REVIEWS 


1987, 277 pages, $35.00 
ISBN: 0-12-085555-0 


Geometric Measure Theory 
A Beginner’s Guide 


Frank Morgan 
Williams College 
Williamstown, Massachusetts 


“Morgan's book is a pedagogical master- 
piece. —ELLIOTT LIEB 


1988, 153 pages, $24.50 
ISBN: 0-12-506855-7 


@ 


ACADEMIC PRESS 
Harcourt Brace Jovanovich, Publishers 
Book Marketing Department #803030 
1250 Sixth Avenue, San Diego, CA 92101 


CALL TOLL FREE 


1-800-321-5068 


Quote this reference number for free 
postage and handling on your 
prepaid order =» S03030 


or 
Call 619-699-6400 for textbook 
adoption information. 


Prices subject to change without notice. ©1990 by Academic Press, Inc, 
All Rights Reserved. TS —S03030 


Se a 
Random Walks and 
Electric Networks, 


by J. Laurie Snell and Peter Doyle 
xiii + 159 pages. Hardbound 
List: $26.50 MAA Member: $21.00 


In this newest addition to the Carus Mathematical Monographs, the authors 
examine the relationship between elementary electric network theory and ran- 
dom walks, at a level which can be appreciated by the able college student. We 
are indebted to them for presenting this interplay between probability theory 
and physics in so readable and concise a fashion. 

Central to the book is Polya’s beautiful theorem that a random walker on an 
infinite street network in d-dimensional space is bound to return to the starting 
point when d= 2, but has a positive probability of escaping to infinity without 
returning to the starting point when d= 3. The authors interpret this theorem as 
a statement about electric networks, and then prove the theorem using tech- 
niques from classical electrical theory. The techniques referred to go back to 
Lord Rayleigh who introduced them in connection with an investigation of 
musical instruments. 

In Part I the authors restrict themselves to the study of random walks on finite 
networks, establishing the connection between the electrical concepts of cur- 
rent and voltage and corresponding descriptive quantities of random walks re- 
garded as finite state Markov chains. Part II deals with the idea of random walks 
on infinite networks. 


Table of Contents 
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Random Walks on More General Networks 
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A broad spectrum of math texts 
for every level of learning! 


Calculus like you’ve never 
seen it before! 


Calculus 

Ross L. Finney and George B. Thomas Jr., 
both of Massachusetts Institute of 
Technology 


Calculus with Analytic Geometry, 
Third Edition 
John B. Fraleigh, University of Rhode Island 


Calculus and Analytic Geometry, 

Seventh Edition 

George B. Thomas Jr. and Ross L. Finney, 

rtsbe) Massachusetts Institute of Technology 
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We’re leading the way 
in precalculus! 


Algebra and Trigonometry, 

Fifth Edition 

M. L. Keedy and M. L. Bittinger, Purdue 
University, and Indiana University-Purdue 
University at Indianapolis, respectively. 


Precalculus: Functions and Graphs 
Franklin Demana and Bert K. Waits, both of 
The Ohio State University 


Precalculus: Functions and Graphs 
Jimmie Gilbert and Linda Gilbert, both of 
University of South Carolina-Spartanburg 


Also on our math list 
for 1990... 


Elementary Algebra: Concepts and 
Applications, Third Edition 

M. L. Bittinger and M. L. Keedy; 
Indiana University-Purdue University at 
Indianapolis, and Purdue University, 
respectively. 


Intermediate Algebra: Concepts and 
Applications, Third Edition 

M. L. Bittinger, M. L. Keedy, and David 
Ellenbogen; Indiana University-Purdue 
University at Indianapolis, Purdue University, 
and St. Michael’s College, respectively. 


Linear Algebra, Second Edition 
John B. Fraleigh, University of Rhode Island 


Foundations of Higher Mathematics: 


Exploration and Proof 
Daniel Fendel with Diane Resek, both of 


San Francisco State University 


Applied Combinatorics with Problem 
Solving 

Bradley W. Jackson and Dimitri Thoro, 
both of San Jose State University 


Elementary Geometry from an 
Advanced Standpoint, Third Edition 
Edwin Moise, Emeritus, Queens College of the 
City University of New York 


Vector Analysis for Engineers and 
Scientists 

P. E. Lewis and J. P. Ward, both of 
Loughborough University of Technology (U.K.) 
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V LE STUDIES IN 
MATHEMATICS 


Studies in 
Computer Science 


Edited by Seymour Pollack 
408 pp. Hardbound List: $33.50 MAA Member: $25.00 


STUDIES IN COMPUTER SCIENCE continues the tradition of excel- 
lence of the respected series MAA Studies in Mathematics. Written by 
computer scientists for mathematicians, the book presents a readable 
and balanced discussion of the role of mathematics in computer sci- 
ence, and the contributions of computing to mathematics. 

Opening with an historical overview of the development of com- 
puter science, the narrative continues with three articles on the nature 
of computer programs and the programming process, a discussion of 
computational complexity, and a concluding group of articles on the 
impact of computer science on artifical intelligence, numerical analy- 
sis, statistics, and simulation. 

In the introduction the editor expresses his hope that the book will 
provide ‘an interesting look at an explosive field in the process of be- 
coming.’’ What an understatement! The process is well advanced; the 
impact of computer science on mathematics and its teaching is already 
immense. And this book is not just interesting, it is exciting and impor- 
tant as background reading for anyone interested in the interactions 
between mathematics and computer science. Teachers will find that it 
enriches their teaching and students will find it rewarding supplemen- 
tary reading. Order your copy today! 
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College Algebra, 3rd Edition 
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Instructor’s Manual, Student Problem 
Solving Guide, Transparency Masters, 
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Algebra: A Course for 

College Students, 2nd Edition 
by Arthur Goodman (Queens College 
of the CUNY) and Lewis R. Hirsch 
(Rutgers University) * Supplements: 
Instructor’s Manual, Study Guide, 
Geometry Supplement to Accompany 
Understanding Elementary Algebra, 
Tutorial Software, Natural Language 
Mathematics Software by Mathens, Inc., 
Instructional Videotapes, Westest 


Understanding Intermediate 
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' College Students, 2nd Edition 
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Supplement on Sequences and Series, 
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THREE IMPORTANT BOOKS ON 
MATHEMATICS EDUCATION 


RESPONSES TO THE CHALLENGE: 
KEYS TO IMPROVED INSTRUCTION 

BY TEACHING ASSISTANTS AND PART- 
TIME INSTRUCTORS 


The Committee on Teaching Assistants 
and Part-Time Instructors, 
Bettye Anne Case, Chair 


The committee that prepared this volume 
has been gathering information on policies, 
practices, successes, failures, and goals 
connected with the use of teaching assis- 
tants and part-time instructors. In this vol- 
ume the committee presents and analyzes 
data showing who these teachers are, the 
extent and nature of their teaching duties, 
and the efforts made to assimilate them into 
the faculties. This volume will help you 
to see how your department compares na- 
tionally, to decide what steps you and your 
school should take, and to understand what 
additional resources might be needed. 


280 pp., 1988, ISBN-0-88385-061-3 
List: $15.00 
Catalog Number NTE-11 


GUIDELINES FOR THE CONTINUING 
MATHEMATICAL EDUCATION 
OF TEACHERS 


Committee on the Mathematical Education 
of Teachers 


These guidelines will be very useful to 
school teachers and supervisors, to college 
administrators who plan continuing edcu- 
cation programs, to the college teachers 
who design and teach courses for teach- 
ers, and to school administrators who must 
think about requirements for continuing ed- 
ucation of teachers. The guidelines are rich 
in specifics on course content, giving clear 
objectives for all courses. Teachers who 
want to dig out material for themselves or 
in order to enrich their classes will find the 
more than 500 references provided here un- 
der various topics an invaluable aid. 


90 pp., 1988, ISBN-0-88385-060-5 


List: $8.00 
Catalog Number NTE-10 


THE USE OF CALCULATORS IN THE 
STANDARDIZED TESTING OF MATH- 
EMATICS 


John W. Kenelly, Editor 


The calculator is a universal tool for all those 
involved in quantitative work from science 
and engineering to business. Routine use 
of calculators is part of the training and test- 
ing of students headed for these fields. But 
this is not yet the case in mathematics. This 
symposium, jointly sponsored by the MAA 
and The College Board, sets out clearly the 
theoretical and practical issues that must be 
addressed as calculators are brought more 
fully into the mathematics curriculum. This 
is a practical group concerned with specific 
tests and test items. General theoretical 
considerations are set off by the specifics 
of individual test items and students’ suc- 
cess rates on them. The Ohio Early College 
Mathematics Placement Test is reported on 
in detail by Joan R. Leitzel and Bert K. 
Waits. James W. Wilson and Jeremy Kil- 
patrick examine the theoretical issues in 
the development of calculator-based tests. 
John Harvey, now Chair of the MAA’s Com- 
mittee on Placement Examinations looks at 
the issues surrounding calculator use on 
placement examinations, as well as giving 
an overview of the symposium and a survey 
of developments through 1988. 


vi + 50 pp., 1989, Copublished by the MAA 
and The College Board. LC No. 88-064- 
100 
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Greed Is (Sometimes) Not Enough 


ANATOLE BECK,* University of Wisconsin, Madison, WI 53706 


ANATOLE BECK: I was born in the Bronx and educated at Stuyvesant High 
School, Brooklyn College, and Yale University, where I studied with Shizuo 
Kakutani. After two postdoctoral years at Tulane and in Europe, I joined the 
faculty of the University of Wisconsin in 1958. I left Wisconsin in 1973 with 
the intention of spending my remaining years as Professor of Mathematics at 
the London School of Economics, returning after two years for personal 
reasons. I have retained my professional and personal ties to the LSE, 
however, and work there whenever my schedule permits. 


Introduction. Many analyses in economic theory or in real life attempt to achieve 
global optimization by piecewise response to local considerations. Examples include 
the regulation of whole economies through the self-seeking strategies of individual 
participants. In the real world such theories are the subjects of heated debates [3, 6], 
not to say global thermonuclear confrontation [4,7]. In the world of mathematics, 
we sometimes find examples in which the (plausible) argument that local advantage 
produces global optimization is not sustained by the facts. 

We will deal with some geometric situations in which we seek to optimize a gain 
by a sequence of steps. In some cases, the choice of the best move at each step does 
not yield a global optimum. A simple example concerns the removal of non-overlap- 
ping squares from a rectangle so as to maximize the total area removed. 


DEFINITION. In the removal of areas from a geometric figure, the greedy strategy is 
the one which prescribes at each step the removal of the largest allowable area. 


DEFINITION. The square of side s, in whatever position, will be denoted as Q.,. 


Example. In removing five squares from a 6 X 7 rectangle, the greedy strategy 
gives 1 X Q,.,4 x Q,, for a total area of 40. By contrast, a combination of 
1X Q,,2 X Q,,2 X Q, gives a total of 42, the maximum possible. 


DEFINITION. A figure S is called gross if for every n © N, the removal of n figures 
of a specified sort is always best achieved by the greedy strategy. If, for anyn © N, 
some other choice is better, then S is called subtle. In that case, the number n is called 
a subtlizer for S. 


A natural question, which has been circulated by word of mouth for a long time, 
is whether a square is gross or subtle with respect to the removal of discs [1]. We will 
not attempt to answer that question here. However, in the dual problem, there are 
infinitely many subtlizers for a disc with respect to the removal of squares. 

The 6 X 7 rectangle is not unique for subtlety. In the first place, every rectangle 
close enough to it has the same property. More significantly, almost every rectangle 
has infinitely many subtlizers, as we shall see below. 


*The research of this paper was supported in part by the Wisconsin Alumni Research Foundation 
and in part by the London School of Economics. 
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Triangles. We will begin with the problem of removing squares from an isosceles 
right triangle. Depending on the number of squares removed, it might pay to invest 
in a special arrangement for the first few squares. Consider FIGURE 1, where we see 
3X Qi, 3 X Qi, and 6 X Q; 2, for a total of 12 squares. This removes all the 
triangle of side 1 except for 12 triangles of side 1/12 with a total area of 
12(1/2)(1/12)? = 1/24. For the same triangle, the greedy strategy gives 1 x Q, /) 
2X Qi, 4 X Qig, and 5 X Q) 16. This leaves 3 triangles of side 1/8 and 10 of 
side 1/16, for a total of 3(1/2)(1/8)? + 10(1/2)(1/16)? = 11/2°, which exceeds 
1/24. Thus, 12 is a subtlizer for the isosceles right triangle. Note that we have not 
proven that the alternative strategy is the best, but only that it is more effective than 
greed for n = 12. 


Fic. 1 


We consider next the right triangle whose legs are 1 and 1 + e, and whose area is 
A = (1+ €)/2. The largest square it contains has side a = 1/2 + ¢/(2e + 4), and 
the removal of Q, leaves two similar triangles whose hypotenuses are in the ratio a 
to b=1/2 — e/(2e + 4). Their areas are a7A and b7A respectively. The greedy 
strategy now chooses the largest square, which is in the first of these, and is 0,2. We 
will assume that e is small enough so that a? < b’, which will have as corollary 
a* < b. Then the third greedy square will be Q,,, from the smaller triangle. The next 
four greedy squares will be 1 X Q,;, 2 X Q,2,, and 1 X Q,,2, and the next 5 (since 
a* < ab’) will be 1 X Q,4, 3 X Q,3,, and 1 X Q,2,2. The removal of these twelve 
squares will leave triangles of the following sizes: 2 X ab*A, 1 x b7A, 1 x a‘A, 
4 x a*bA, 4 X a*b’A, and 1 X ab3A. Thus, the total left by the greedy strategy will 
be the sum of those terms, which converges to 11/2° as e > 0. On the other hand, 
the same squares chosen before by the alternate strategy (all lying within the 
isosceles right triangle) will leave over 1/24 + ¢/2, which converges to 1/24. Thus, 
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for e small enough the right triangle with legs 1 and 1 + e¢ is subtle, with subtlizer 
12. 

The number 12 is not unique. A similar phenomenon occurs at 24, 48, etc. In 
addition, the greedy strategy gains another Q, ,,. for n = 25, while the alternate 
gains only Q, /4g, but it is not enough to close the gap, nor is it for n = 26, but at 
n = 27, they are equally good. Similar analyses hold for n = 49, etc. 

Actually, 3 - 2” + k is a subtlizer as long as 


1/(3 . 2"t1) _ k/(3 . gn+ty? < 1/2"*3 4 1/2"*4 _ (k 4 1)/2?"*4, 


Le., 0O< k < (3 - 2” —9)/5. A similar calculation shows that 3 - 2” — k is a subtl- 
izer so long as 


1/(3 -2"*1) + k/(3- 2")? < 1/2743 +1284 + (k + 1) /22"44, 


Le. O< k < (2" + 3)/19. Thus, all the numbers between (56 - 2” — 3)/19 and 
9(2”*' — 1)/5 are subtlizers, Vn € N. Dividing by 3 - 2”*1, we see that the average 
density of subtlizers among the first N numbers exceeds 1/10 for N large enough. 

The role of the number 3 in this alternative strategy can be assigned to 5 as well. 
We can start with seven squares, 1 X Q,,5 and 6 X Q, /s, to leave five triangles of 
leg 1/5. Then 5 X Qj, 10 X Qj /99, 20 X Qy 49 and 40 X Q, 1g, leaving 80 trian- 
gles of side 1/80, for a total area of 1/160. Pursuing greed for 82 squares leaves 21 
triangles of side 2~* and 22 of side 27°. These have a total area of 53/2!?, which is 
more than 1/160. We can construct examples based on all odd numbers. 

Our next example concerns a convex figure generated from the triangle with legs 
1 and 1 + ¢ by replacing the hypotenuse by an arc of a circle. If the curvature is 
sufficiently small, the first n squares given by the greedy strategy are arbitrarily 
close to those in that triangle, as to both size and location. (The smallness of the 
curvature required depends on n and the degree of approximation desired.) Thus, if 
e is small enough to make 12 a subtlizer for the triangle, then it will serve for the 
figure as well, when the curvature is very small. As an alternative to making the 
curvature small, we could take the legs small, but in the same ratio. Keeping 
the same value for e (one which makes 12 a subtlizer for the triangle), we can choose 
a small value of 6 > 0 and draw a chord of length 5¥2 + 2e + e” in a circle of 
radius 1. Completing the triangle with legs of length 6 and 6 + de, we get a figure 
similar to one of the previous ones, in which 12 is a subtlizer if 6 is small enough. 


Discs. We consider next the removal of squares from a disc. The first square 
removed is inscribed in the disc and its sides determine the orientation of all the 
other squares. Following the greedy strategy, the next square is one of the four 
which have one side included in a side of the first square, with the opposite side a 
chord of the circle, and the third, fourth, and fifth are the other three. The removal 
of these five squares leaves in the disc twelve areas, four of which are segments of 
the circle, while eight are figures of the sort described in the previous section. The 
sixth square is chosen from one of these by being placed so that two sides are 
included in the straight edges of the figure, while one vertex lies in the curved side 
(there is only one way to do this). The next seven squares are chosen from the seven 
figures congruent to that one in the same way, so that now twenty figures remain, 
four of which are segments. In this way, the greedy strategy selects its squares. 
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As the remaining pieces become smaller, we are especially interested in those in a 
particular portion of the circumference. Consider the four vertices of the first 
square, and observe that for each of them there is an arc in which the tangent to the 
disc at any point of the arc makes an angle with one side of the square whose 
tangent m is so near to 1 that the right triangle with sides 1 and m is subtle with 
subtlizer twelve. As the greedy strategy continues, the figures left over are just such 
triangles, modified by replacing the hypotenuses with arcs of the original circle. 
When these are small enough, and in the part of the disc where the slope of the 
circle is close enough to 1, then that figure will be subtle, with subtlizer twelve. 

If we now choose one of these figures, 7, and find the first value of n for which 
the greedy strategy has taken twelve squares out of 7, then a change of those twelve 
squares into twelve others will increase the area extracted, making n subtlizer for 
the disc. Clearly, this happens for infinitely many values of n. 

It follows that the greedy strategy will eventually leave subtle figures as compo- 
nents along the indicated arcs. For any such figure, greed will eventually select 
twelve squares, and these will be the twelve which it would select if there were not 
other places to choose from. The fact that other squares in other parts of the disc 
will be chosen between these is irrelevant. However long it takes to choose the 
twelfth, that number is a subtlizer for the disc since just replacing those twelve 
(while keeping all the others) is already more efficient than greed. 

Of course, this analysis gives no hint of which number zn is the first subtlizer, and 
indeed, no actual subtlizer is known, but it is clear that there are infinitely many. 


Rectangles. If discs are complicated (those unruly round edges!), then perhaps 
rectangles are simple. We have already seen, in the 6 X 7 rectangle, that they are not 
all gross. In general a rectangle (n* +n) X (n? +n +1) can be exhausted by 
2n +1 squares (1X Q2, nXQ,, and nXQ__,), but not if one is greedy (in 
which case one gets only 1 X Q,2,, and 2n X Q,). But these are very special cases. 
Note that the rectangle 6 X (7 — «) is also subtle providing that ¢« is small enough. 
Indeed, we can cover all but 6e — e* by 1 X Q,, 2 X Q,, and 2 X Q,_.,. The greedy 
strategy (1 X O,¢,4 X Q,_,) covers all but (2 + 4e)(1 — e) = 2 + 2e — 4e?, which is 
more unless e > (¥10 — 2)/3, which is quite a bit. That is, every rectangle with a 
ratio of sides between 6/7 and d = 18/(19 + 10) is subtle, and similar analyses 
apply to the other rectangles (n* + n) X (n? + n + 1) mentioned above. 

On the other side, we can examine rectangles (6 — ¢) X 7, where the combination 
1X O4-seyar2 X Qore/ar2 X Q3--/2 covers all but (4 — 5e/4)(7e/4) = Te — 
35e*/16. Greed gives 1 X QO, _, and 4 X Q,,,, which leaves over (1 + 22)(2 — Se) 
= 2 — 3e — 10e?, which is more unless « > 4(V26 — 4)/25, that is, for rectangles 
with ratio between c = (6.64 — 4/26 /25)/7 and 6/7. Thus, every rectangle with 
ratio between c and d is subtle. 

However, there are other subtle rectangles. If 0 < a < 1 is the ratio of the sides 
of a rectangle, and 1 = na + B, where 0 < B < a, then if the rectangle a X B is 
subtle with subtlizer n,, it follows that the original rectangle has n+ n, as a 
subtlizer. Similarly if the rectangle a X (a + B) is subtle with subtlizer n,, then 
n+n,-—1 is a subtlizer of the original rectangle. These observations allow us the 
following conclusions: We know that a rectangle R is subtle if its ratio x lies 
between c and d. Also, if 1/x =n, + 7, where 0 <r, <1, then R is subtle if 
c<7,<d. Recursively, 1/r,,;=n,+7, where 0<7r,<1, and R is subtle if 
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c <4r,<d for any j © N. We now need to remark that for almost all x,c <r,<d 
for infinitely many 7 © N. This follows from the fact that the transformation 
T(x) = 1/x — [1/x] is an ergodic transformation in [0,1) with respect to the 
measure p generated by p((0, ¢]) = In(1 + ¢)/In 2 [3]. Thus, for almost all0 < x < 1, 
we have c < r, < d fora set of 7 © N with asymptotic density (In(1 + d) — In(1 + 
c))/In2. A fortiori, there are infinitely many subtlizers for each such x, since 
ny + +++ +n, + 518 a subtlizer for each 7 € N with c < 7, < d. Unfortunately, the 
density of numbers j for which the removal of j squares by greed leaves a rectangle 
of the indicated ratio is 0, since the expected value of n, is infinite [2]. More to the 
point, I conjecture that the set of subtlizers of almost every rectangle has density 0, 
which would be true if it were known that for every rectangle x x 1 with x small 
enough, and for every n EN, the optimal choice of squares includes at least 


one Q,. 


Discs and spheres. We return now to an old question. In removing discs from a 
square, is the square gross or subtle? To the best of my knowledge, the answer is not 
known. Even the fact that the greedy strategy will remove 100% of the area is an 
interesting problem, and has an elementary solution: 


THEOREM. If nonoverlapping open discs are removed from a square by the greedy 
strategy, the sum of the areas of the infinitely many discs removed will equal the area of 
the square. 


Proof. Among the discs removed by the greedy strategy are (first) the disc 
inscribed in the square and then the four discs in the four corners followed later 
(but not immediately) by the four in the corners, and (even later) by the four in the 
corners, ad infinitum. These clearly do not exhaust the area of the square, but leave 
infinitely many “trianguloids,” bounded on two sides by tangent circles, and on the 
third by a circle or line tangent to the other two. The result we wish follows from 
the fact that if each such trianguloid has discs removed from it as shown in FIGURE 
2, these will add up to more than half its area, leaving a new generation of 


FIG. 2 
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trianguloids to be treated similarly. Since all the discs we are removing are included 
among those taken (eventually) by the greedy strategy, we see that infinitely many 
generations of this process leaves a set of area 0. 

To show that the discs removed are more than half the area, we need only prove 
the fact for the “quadroid” marked in FiGure 3. If all the sides of the original 


FIG. 3 


trianguloid were arcs of circles, then the fact would follow from the lemma below, 
whose proof we leave as an amusement for the reader: 


LEMMA. If, in the triangle ABC, the inscribed circle meets the triangle in the points 
X, Y, and Z (AZ=AY, BZ = BX, CX = CY), then Area( XYZ) < 
(1/2) Area( AYXB). 


If one side of the trianguloid is a line segment, then the same fact follows as a 
limit (or by an analogous argument). Thus we have a proof of this theorem by 
elementary geometry. QED 


The method shown does not generalize to the removal of spheres from a cube, 
nor to higher dimensions. It is true, but not obvious, that the theorem holds in these 
dimensions, but the only proof I can construct is one which mimics the Vitali 
Covering Theorem (mutatis mutandis). A more elementary proof would be of 
interest. 


Conclusion. In geometry, greed is sometimes not the best strategy. The lesson for 
other aspects of life is obscure. 
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DRAGAN JANKOVIC: I received my Ph.D. from the University of Belgrade, 
Yugoslavia, in 1983. Since then I have spent one year at the University of 
Arkansas, Fayetteville, and three years at Texas Tech University, Lubbock. 
At present I have a year’s study leave from my position at East Central 
University, Ada, Oklahoma. 


T. R. HAMLETT: I received my B.A. (71) from Arkansas College and my 
M.S. (73) and Ph.D. (75) from the University of Arkansas under the 
direction of Paul Long. After three years military duty at Wright-Patterson 
AFB, Ohio, I spent three years at Arkansas Tech University, one year at 
Arkansas College, and am concluding my eighth year at East Central 
University, Ada, Oklahoma. 


1. Introduction. The subject explored in this paper has a lengthy and interesting 
historical development along two lines which seem to converge in the 1975 paper of 
Samuels [26]. One line is a generalization, or unification, of the concept of closure 
point, w-accumulation point (limit point in 7, topological spaces), condensation 
point, and point of the second category. The other line is a study of the properties 
which spaces have “locally,” and which, in several interesting cases, implies that the 
space has “globally.” This last line is probably best exemplified by the 1945 paper 
of Vaidyanathaswamy [31]. 

Although the topic of ideals in general topological spaces is treated in the classic 
text by Kuratowski ((16], [17]) and also in [30], it is not found in most modern texts. 
Consequently, we have written this paper as a supplement to an advanced under- 
graduate, graduate, or topics course in General Topology, as well as a topic of 
interest to a specialist in General Topology. In addition, our subject has an excellent 
potential for application in other branches of mathematics. The Cantor-Bendixson 
Theorem (Theorem 5.2), Example 4.3, and Theorem 7.9 exemplify this potential. 

In what follows, we will attempt to develop this topic in a logical, systematic 
fashion, and offer some new results, improvements of known results, and some 
applications. We have included examples liberally which are intended not only to 
reinforce the abstract concepts, but also to point out that ideals are present in some 
familiar places which the reader may have already visited. 

We will use the following notational conventions: a “space” will always mean a 
topological space; given a space (X,7) and a point x € X, W(x) will denote the 
open neighborhood system at x (i.e... W(x) = {U € t:x © U}); cl, (A) (resp., Int, 
(A)) will denote the closure (resp., interior) of A with respect to tT (we will omit 7 if 
there is no possibility of confusion). The following important ideals in a space 
(.X, 7) will be denoted as follows: 

JF, — the ideal of finite subsets of X, 

F,— the ideal of countable subsets of X, 
JF ,— the ideal of closed discrete sets in (X, 7), 
SF, —the ideal of nowhere dense sets in (X, 7), 
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JF, —the ideal of meager sets in (X, 7), 
JS —the ideal of scattered sets in (X, 7), and 
JF,—the ideal of relatively compact sets in (X, 7). 
We will use the abbreviations “iff” for “if and only if’ and “— ” to mean 
“implies.” The symbol ™ is used to indicate the end (or omission) of a proof. 


2. Fundamentals. Let us begin by defining an ideal. 


Definition 2.1. A nonempty collection % of subsets of a set X is said to be an 
ideal on X, if it satisfies the following two conditions: 

(A) AG ¥ and BCA —>BE-F heredity). 

(B) AG ¥ and BE F%> AUB EF (finite additivity). 


Note that the collection of complements of sets in a proper ideal (one not 
containing the space) will be a nonempty collection of nonempty sets closed under 
the operations of superset and finite intersection. Such a collection is called a filter; 
hence an ideal is sometimes called a dual filter. 

It is a common practice to say that a space “has a property F locally at a point 
x” if x has some neighborhood U such that U has the property. Extending this idea 
to subsets, we say that a subset A has the property F locally at a point x if x has 
some neighborhood U such that UN A has the property [16]. Note that the point x 
may or may not be in the subset A. Typically, ideals are defined as a collection of 
subsets with some particular property. The set of points in a space at which a subset 
does not have this property is a fundamental concept in what follows. 


Definition 2.2 [16]. Let (X,7) be a space with an ideal .% on X. Then A*(¥%, 7) 
={xExX:AN UEF for every UE V(x)} is called the local function of A with 
respect to ¥ and T. 


When there is no ambiguity, we will write A*(#%) or simply A* for A*(.%, 7) and 
call it the “local function of A.” We remark that in Definition 2.2 W(x) may be 
replaced by the neighborhood system of x. 

The simplest ideals are {@} and A(X) = {A:A C X}. Observe that A*({$¢}) = 
cl(A) and A*(PA(X)) = for every A C X. Now let 4% be the ideal of finite 
subsets of X, and let A® be the set of w-accumulation points of A C X(x © A® iff 
U - A is infinite for every U € W(x)). It is not difficult to check that A® = A*(.F). 
If % is the ideal of countable subsets of X, A*(.%,) is precisely the set of 
condensation points of A(x € X is a condensation point of AC X if UNA is 
uncountable for every U € W(x)). Thus A*(.#) is a generalization of closure point, 
w-accumulation point and condensation point. We will consider these examples 
again later in this section. 

The following theorem ([16], [31]), contains many basic and useful facts concern- 
ing the local function. 


THEOREM 2.3. Let (X,7) be a space with ¥ and Y ideals on X, and let A and B 
be subsets of X. Then 

(a) AC B—> A* Cc B*, 

(b) $C F—> A*X(F) C A*(F), 

(c) A* = cl(A*) C cl(A) (A® is a closed subset of cl(A)), 

(d) (A*)* ¢ A®, 

(e) (A U B)* = A* U B*, 
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(f) A* — B* =(A — B)* — B* C(A —- B)*, 
(g:) Ver>UNA*=UN(UN A)* C(UN A)*, and 
(h) TE %>(AUI)* = A*=(A-J)*. 8 


Let P(X) be the class of all subsets of X. If ( )°: A(X) > A(X) is a function 
satisfying (1) ¢ = 9, (2) AGC A(X) PACA’, (3) AECA(X), BEA(X) > 
(A UB) =A°U BS, and (4) A € A(X) — (A‘)S = AS; then ( )° is called a 
Kuratowski closure operator and {A © A(X):A = A‘} is a collection of closed sets 
for a topology on X. 

It is not difficult to check that if d: A(X) — A(X) is a function satisfying 

(1) d(o) = 4 

(2) d(A U B) = d(A) U d(B), and 

(3) d(d(A)) € d(A) 
then ( )‘ : A(X) ~ A(X) defined by A‘ = A U d(A) is a Kuratowski closure 
operator on A(X). Note that d does not necessarily coincide with the derived set 
operator in the generated topology. 

Since *: A(X) > A(X) satisfies all the required conditions for the function d 
(Theorem 2.3, (e), (d); 6* =), we have that cl*(A) = A U A®* is a Kuratowski 
closure operator. It is not difficult, however, to verify directly that cl* is a closure 
operator. We will denote by TS ) the topology generated by cl* where 7 is the 
original topology on X, that is, T*(%) = {U C X: cl *(X — U) = X — U}. When 
no ambiguity is present we will simply write 7* 

We have already observed that if %= {¢}, then A* = cl(A). Hence in this case 
cl*(A) = cl(A) and r* = +. If = A(X), then A* = o for every A C X and hence 
7*(S#) is the discrete topology. The ideals {¢} and A(X) illustrate extreme cases 
where +* = 7 and r* = discrete, respectively. Since for every ideal . on X we have 
{6} C ¥C P(X), it follows from Theorem 2.3(b) that tC 7* C discrete. More 
specifically, if ~ and Y are ideals on ( X, 7) such that %C Y, then r*(%) C 7*(F) 
(apply Theorem 2.3(b)). 

Our next example illustrates the possibility that #4 A(X) and yet 7*(#) is 
discrete. 


Example 2.4. Let us denote by N the set of natural numbers. If we endow N 
with a “partition” topology 7 generated by the base {{2n — 1,2n}: n © N} and 
consider the ideal 4%, then it is clear that N *(F) = = g. Therefore 7*(.4%,) is discrete. 
More generally, X (S, ) = piff(X,7) is nearly discrete (i.e., each x € X has a 
finite neighborhood) itt T™(¥%,) is discrete. 

Let us agree to denote by "44 (resp., A%”) the derived set of A in (X, 7) (resp., 
(X,7*(.£)) (x € A%, that is, x is a limit (or accumulation) point of A iff (U — {x}) 
A # @ for every UEWN(x)). 

Obviously, A” C A4 and also x € A” iff x € cl*(A — {x}) iff x © (A — {x}) 
U (A — {x})* iff x © (A — {x})*. This means that x © A®” iff (U— {x})N A 
€ # for every U © W(x). Clearly, A” ¢ A*. If {x} ©, then by Theorem 2.3 
(h), x € A” iff x € A*. The ideals in the following two examples have the property 
that {x} © ¥ for each x € X. 


Example 2.5. Let ¥% be the ideal of finite subsets of a space (X,7). Since 
{x} © ¥% for each x © y, A* = A®". As we have already observed that A* = A®, 
we conclude that w-accumulation points of A in (X, 7) are precisely the limit points 
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of A in (X, 7*(.4%)). Since (X, r*) is T, ({x} € 4 > {x}* =o > cl*{x} = {x}) 
and the concepts of w-accumulation and limit point coincide in 7, spaces, we have 
that the sets of w-accumulation points of A C X in (X,7) and (X, 7%) coincide. 
Note that, A*(.4%) = Af iff 7 = 7™(¥,) iff (X, 7) is Ty. 

In Example 2.4 we exhibited a nearly discrete space and noticed that T™*(F) 1S 
discrete. If we start with the indiscrete space (X, w), then A*(F) =XifAEs#, 
and A*(¥%,) = @ if A ©. %. Therefore, cl*(A) = X if A € 4% and cl*(A) = A if 
A © ¥,, This means that y*(.%,) is the well-known cofinite topology. 


Example 2.6. Let %. be the ideal of countable subsets of a space (X,7). We have 
A* = A and since A*, as mentioned earlier, is the set of condensation points of A, 
we conclude that condensation points of A in (X, 7) are precisely limit points of A 
in (X, T*(4%,)). 

If we start again with the indiscrete space (X, W), it is easy to see that ~*(.4,) is 
the cocountable topology. Let us consider another familiar example. Let (R, 7) 
be the real line with the usual topology. Then the topology 7*(.4,) is nothing other 
than the countable complement extension topology [29, Example 63]. Note that 
7*(¥%,) is the smallest topology generated by 7 U ~*(¥4); ie, the supremum 
tT V W*(F%,) of + and W*(4). 

Now, it seems that topologies generated by ideals are useful in constructing 
counterexamples. We state an important basic property of such topologies that 
justifies the previous remark. Recall that a set A in a space (X,7) is closed and 
discrete iff A¢ = 9. 


LEMMA 2.7. Let (X, 7) be a space with an ideal ¥ on X. If I = ¥, then I is closed 
and discrete in (X,T*). 


Proof [€ $> I*=¢ 717 =6.5 


Example 2.8. Let (X,7) be a space. Then it is easy to check that 4,= {AC 
X:A% = $} is an ideal on X and that A? C A*. Therefore r* = 7. (We remark that 
in general +*(.%) = riff every J © ¥ is closed in (X,7).) Moreover, Lemma 2.7 
implies that J, is the largest ideal on X with the property 7* = 7. Finally, 
A* = A®@ iff (X,7) is Ty. 


Example 2.9. Let (X,7) be a space and AC X. Observe that %(A) = 
{Bc X:B C A} is an ideal on X. We offer two examples of topologies generated 
by (A). 

Let (X, W) be the indiscrete space and p © X. Then 4(X — {p})={ACYX: 
p € A} generates a simple topology ~* = {A C X;p € A} U {¢} known as the 
particular point topology [29, Examples 8-12]. 

For a more “sophisticated” example of a topology generated by an ideal, let 
(R,7) be the real line with the usual topology and A = {1/n:n © N}. Then 
7*(.£(A)) is known as Smirnov’s topology [29, Example 64]. 

In our next example we consider the important ideal of nowhere dense sets. 


Example 2.10. Let (X,7) be a space and let %, = {A C X: Int(cl(A)) = o}. 4%, 
is the collection of nowhere dense subsets of (X, 7). It is a simple task to show that 
F, is an ideal on X and that this important ideal has a nice property [30]; namely 
the local function of a set A C X can be given “explicitly”: 


A*(¥,) = cl(Int(cl(A))). 
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Here we have that 
cl*(A) = A U cl(Int(cl(A))) 


which gives a nontrivial finer topology than 7 in general. In [21] a topology 7% has 
been introduced by defining its open sets to be sets A C X with A C Int(cl(Int(A))). 
Since closed sets in (X, 7%) are sets A C X with cl(Int(cl(A))) C A; ie., A*(4%, 7) 
C A, we have that r* = r*(4%, 7). 

We also point out that in [7] it was shown that a class of topologies [rT] on X (7 is 
a topology on X) defined as {0:0 is a topology on X and for every A C X, AC 
cl_(Int,(A)) iff A C cl,(Int,(A))} possesses a finest topology for the class, de- 
noted by F(r). It is left to the reader to establish that F(r) = 7*(4,, 7). 

Our next example is likely the most important one and it was a primary 
motivation for the introduction of a local function. First we need the following 
definition. 


Definition 2.11. An ideal ¥ is said to be a o-ideal if it is countably additive; that 
is, if J, € ¥ for eachn EN, then U{T,:n & NY} EY, 


Example 2.12. A subset of a space is said to be meager (or of first category) if it 
is a countable union of nowhere dense sets. Clearly, the collection of all meager 
subsets of a space is a o-ideal. We denote this ideal by 4%,. Since non-meager sets 
are also called the sets of second category, the points of A*(¥%,), where A is a set in 
a space, are the points of second category of A [17]. In [3] Blumberg defines a point 
x in a space to be “inexhaustibly approached” by a set A if x € A*(4,). 

In the case of #%, the ideal of nowhere dense sets in a space (X,7), we 
established that A*(.%,) = cl(Int(cl(A))). We point out that A*(.%,) has the follow- 
ing significant property: A* = cl(Int(A*)). (The reader is invited to show this by 
using A*(.%,) = cl(Int(cl(A))), Theorem 4.12 and Theorem 4.6, (b).) 


3. The Open Sets of +*. We have defined r* in terms of the closure operator 
cl*(A) = A U A*. However, a simple basis for the open sets of +* can be described. 

Let (X, 7) be a space, % an ideal on X, and observe that A is 7*-closed iff 
A* C A. Now we have UeEr* iff X—U is 1*-closed iff (X — U)* Cc 
X — U iff UC X —(X — U)*. Therefore x € U > x € (X — U)* > there exists 
a V © N(x) such that VN (X — U) € ¥. Now let J = VM (X — U) and we have 
x € V—IC U, where V € 7 and J € ¥. We shall denote 


B( 4,7) ={(V-T: Ver, TEF}, 
and, following our convention, we will simply write B(.%) or 8 when no ambiguity 
is present. These remarks suggest the following result. 


THEOREM 3.1 [30]. Let (X, 7) be a space and ¥ an ideal on X. Then B is a basis 
for T™. 


Proof. In view of the preceding remarks, we need only show that B is closed 
under finite intersections. This follows immediately from De Morgan’s laws and the 
finite additivity of %. & 


If we now return to Example 2.6 we see that the remark 7*(.4%.) = 7 V ~*(4) is 
not at all surprising in the light of Theorem 3.1. We have the following theorem. 
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THEOREM 3.2. Let (X,7) be a space and ¥ an ideal on X. Then t*(%) = 7 V 
~*(S£) where W denotes the indiscrete topology. ™ 


A question which should occur to us at this point is: Given a space (X, 7) and an 
ideal ¥ on X, we can form [r*(.%)]*(%) = 7**. We know that r** is finer than 
7™*, but is it ever true that 7r** is strictly finer than +*? The following theorem will 
lead to an answer of this question in the negative. But first, observe that if % and ¥ 
are ideals on X, then NY and ¥V Y= {J UJ:1 EF and J EY} are also 
ideals. 


THEOREM 3.3. Let (X, 7) be a space with ¥ and J ideals on X, and A < X. Then 
(a) [31] A*(4N FZ) = A*(F) UA*(F). 
(b) AX( FV J, 7) = AX(I,7*(F)) NAY, T*(F)). 


Proof. (a) is clear. To show (b) assume that x ¢ A*(%V JY, 7). Then there exists 
aUeEW(x)suchthat UN AEYVY. Let ]e€ ¥ and J EY such that UN A = 
I UJ. Because of the heredity of %, we may assume J] J = ¢. Thus we have 
(UN A)-—I=J and (UN A)-J=H=I-(U-I)NA=JEY and (U—J)N 
A=IE%>x ¢€A*(Y, T*(F)) or x € AX*(F, T*(FZ)) (x could be in J or J, but 
not both). We have shown 

(1) A*(Y, *(F)) 0 AMF, THF) CAM IV YS, 7). 

Now assume x ¢ A*(.%, 7*(¥)). This implies there exist UE W(x) and J EY 
such that (U — J) 1 A © ¥. We may assume, because of the heredity of Y, that 
J CA. Now define J = (U —J) QA, and wehaveUNA=IUJEIVJ-xE 
A*(S%V Y, 7). We have shown 

(2) AM(FV J, 1) CANS, T*(F)). 

Similarly, we have that 

(3) A*(FV J, 1) CA*(S, T*(F)). 

(1), (2), and (3) establish the result. m 


By taking %=¥Y in the above theorem, the following corollary answers the 
question about the relationship between +* and r**. 


COROLLARY 3.4. Let (X, 7) be a space and ¥ an ideal on X. Then A*(¥, 7) = 
A*(%,7*) and hence t* = r** (cf. [22]). & 


Note that it is possible to establish Corollary 3.4 by using Lemma 2.7 and 
remarks in Example 2.8. 
Theorem 3.3 also gives a straightforward proof to the following result. 


COROLLARY 3.5. Let (X,7) be a space with ¥ and JY ideals on X. Then 

(a) [26] 7*(4¥ VF) = [T*(F)N*(F) = [7*(AI"(F) 

(b) T*(FV FZ) =T*(F)V T*(F) 

(c) [26] t*(9NF) =7*(F)NT*(F). 

Proof. (a) follows from Theorem 3.3, (b). To show (b) we have r*(%V JY) = 
[7*(.£)]*(Y) by (a) and hence r*(.%V YZ) = 7*(F%) V W*(Y) by Theorem 3.2. 
Since 7*(.%) V W*(F) = T*(F)V TV w*(Y) = 7*(%)V T*(Y), by Theorem 
3.2, we have that (b) follows. (c) follows from Theorem 3.3, (a). & 


We have shown that for a given space (X, 7) and ideal ¥ on X, B 1s a basis for 
7*. If B is itself a topology, then we have B = r* (as in the case of the cocompact 
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and cocountable topologies) and all the open sets of r* are of the simple form 
V—I where Ver and J €Y¥. The following example shows that B is not a 
topology in general. 


Example 3.6. Let (X,7) be the real numbers with the “left-ray” topology, 1.e., 
T= {(-0,a):a EX} U{X, O}. Let % be the ideal of finite subsets of X. 
Consider the collection of subsets of B defined as U, — I, where U, = (— 0, n + 
1/2) and J, = {1,2,...,n}. Then U{U,- I: n © N} = X — N whichis not in B. 
Thus £ is not closed under arbitrary unions, and hence is not a topology. 

In the following section, we consider a natural condition relating + and ¥ that 
will guarantee B is a topology and hence all sets in r* will be of simple form. 


4. Compatibility of + with #%. Let us begin with the following definition. 


Definition 4.1 [22]. Let (X, 7) be a space with ¥ an ideal on X. We say the 
topology 7 is compatible with the ideal 4%, denoted 7 ~ Y, if the following holds for 
every A C X: if for every x € A there exists aU © W(x) such that VN A EY, 
then A € ¥. 


Compatible ideals are called “supercompact” in [31] and “adherence ideals” 
in [30]. 

Since ideals are usually defined as a collection of subsets of X having some 
property #, the condition 7 ~.#% then means that whenever a subset has the 
property F “locally,” it necessarily has the property “globally.” This condition may 
appear to be quite restrictive upon first consideration. However, the condition holds 
in many useful, important, and quite general situations. The next theorem gives 
sufficient conditions for tr ~ %, but first we need some preliminary definitions. 

A space is said to be hereditarily Lindeléf if every subspace has the Lindel6f 
property. Second countable spaces are hereditarily Lindeié6f but the converse is not 
true (consider (R, /*(.4,)) or (R, 7*(.4)) in Example 2.6). 


THEOREM 4.2. Let (X, 7) be a hereditarily Lindelof space and ¥ a oa-ideal on X, 
then tT ~ J. 


Proof. Let A C X and assume that for every x © A, there exists a U. € W(x) 
such that U.N A © ¥. Now {U1 A: x © A} is an open cover of A (relative to the 
subspace A) and hence has a countable subcover {V, 1 A: n © N}. Since ¥ is a 
o-ideal, A = U{V,N A: ne N})isin ¥. @ 


Example 4.3. Let (R, 7) be the real numbers with the usual topology and ¥ the 
o-ideal of subsets of Lebesgue measure zero. Clearly, tr ~ %. The topology 7*(.7 ) 
has the following nice property: 7*-Borel sets are precisely the Lebesgue measurable 
sets of t. For further details see [27]. 

The significance of compatibility is shown by the following theorem. 


THEOREM 4.4 [22]. Let (X, 7) be a space with ¥ an ideal on X. If t ~ ¥, then B 
is a topology (and, hence, B = 1%). 


We will return to the proof of this theorem after establishing some preliminary 
results. 
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THEOREM 4.5. Let (X, 7) be a space with ¥ an ideal on X. Then the following are 
equivalent: 

(a) T~ JF. 

(b) [31] If A has a cover of open sets each of whose intersection with A is in ¥, then 
A isin J. 

(c) [31] For every AC X, AN AX =H OP AES. 

(d) [31] For every A C X, A — A* EF. 

(e) For every r*-closed subset A, A — A* © F¥. 

({) For every A C X, if A contains no nonempty subset B with B C B*, then 
AES, 


Proof. (c) > (d); Let A CX, then observe that (A — A* N (A — A*)* = 9, and 
apply (c). 

(e) — (a). Let A C X and assume that for every x © A, there exists a U € W(x) 
such that UM A € ¥. Then AM A* =@ and since AU A® is 7*-closed, we 
have (A U A*) —(A UA*)* EY, and (A U A*) — (AU A*)*® = (AU AB) 
(A* U A*®**) = (A U A*) — A® =A. Therefore A € ¥. 

(d) — (f) Let A CX and assume that A has no nonempty subset B with 
Bc B*. Since A — AX € ¥, AM A* C (AN A*)* (See Theorem 4.6 (c)) and 
hence A ~™ A* = ¢. Therefore A = A — A* and AE Y. 

(f) > (d). Let A C X. Since (A — A*) N (A — A*)* = 6, A — A®* contains no 
nonempty subset B with B C B*. Hence A — A* € ¥. 


THEOREM 4.6. Let (X, 7) be a space with ¥ an ideal on X. Then the following are 
equivalent, and are implied by t ~ ¥. 

(a) For every AC X, AN AX =$6—> A®* = 6. 

(b) For every A C X,(A — A*)* = o. 

(c) For every A C X,(A 1 A*)* = A*. @ 


Theorem 4.4 will be established as a corollary to the following theorem. 


THEOREM 4.7 [22]. Let (X,7) be a space and ¥ an ideal on X compatible with +. 
A set is closed with respect to +™ iff it is the union of a set which is closed with respect 
to T and a Set in JF. 


Proof. Let A be r*-closed, then A* C A > A =(A — A*) U A*®. Now A — A* 
© % by Theorem 4.5, (d) and A® is r-closed by Theorem 2.3, (c). 

Conversely, if A = B UJ where B is t-closed and J € ¥, then A* = B* U I* 
= B* c cl(B) = BCA. Thus A* C A and A is 7*-closed. 


Theorem 4.4 is now immediate since every closed set in r* (under the hypothesis 
of Theorem 4.4 that 7 ~ #) is of the simple form BUT with B 7-closed and 
I © ¥. Thus every r*-open set will be the complement of such a set, and hence have 
the simple form U — I where U € 7 and JE Y. 

While compatibility is a sufficient condition for B to be a topology and 7* = 8B, 
the following example shows that compatibility is not a necessary condition. The 
example also shows that the equivalent conditions of Theorem 4.6 are weaker than 
compatibility. 


Example 4.8. Let (X, 7) be the real numbers with the discrete topology and ¥, 
the o-ideal of countable subsets of X. Then we have rt = B = r* but 7 is not 
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compatible with 4%. For example, X is certainly locally countable at each point but 
X is not countable. 
In this space, A* = @ for every A C X. Thus condition (a) of Theorem 4.6 holds. 


Remark 4.9. The previous example points out an error in [31] where the author 
claims condition (a) of Theorem 4.6 is equivalent to compatibility. We point out, 
however, that if an ideal satisfies ““A* = ¢ > A © ¥” for every A C X (called 
“compactness” in [31]) then the conditions of Theorem 4.6 are equivalent to 
compatibility. Instead of “¥% is compact” we will say that 7 is weakly compatible 
with #%, denoted t ~ %. To see how strong ~ is compared with ~ , note that in 
the case of ¥,, 7 ~ ¥, iff (X, 7) is countably compact [31]. It is claimed in [30] that 
for T, spaces, 7 is never compatible with 4%, except in the trivial case when X is 
finite and hence ¥ = A(X). This is not correct, however, since t ~ 4 iff (X, 7) is 
hereditarily compact. The situation is more interesting in the case of 4%. We obtain 
that + ~ ¥. iff (X, 7*) is x,-compact (a space is x,-compact iff every uncountable 
set in a space has at least one limit point). Note that (X, 7) being Lindelof > 7 ~ ¥, 
— (X, T) iS %,-compact. 

The case tr ~ ¥, is important since it gives the following converse of Theorem 
4.2. 


THEOREM 4.10. A space (X, 7) is hereditarily Lindelof iff t ~ J. 


Proof. We have only to show that the condition 1s sufficient. Suppose that (X, 7) 
is not hereditarily Lindel6f. Then there exists an uncountable set A C X such that 
for each x € A there exists aU € W(x) with UN A countable [18]. This simply 
means there is an A ¢ ¥, with A  A*(4) = 9. This contradicts t ~ %, (Theorem 
4.5, (c)). # 


Let us return now to Example 2.10 to establish that all the closed (and open) sets 
in (X, 7*(¥%,)) are of simple form. 


THEOREM 4.11. Let (X, 7) be a space and %, the ideal of nowhere dense subsets of 
X. Then t ~ F%,. 


Proof. We will show that condition (c) of Theorem 4.5 4s satisfied. Assume that 
AN A*(%,) = 6 > AN cl(Int(cl(A))) = ¢ — AN Int(cl(Int(cl(A)))) = ¢ > A 
AM Int(cl(A)) = @ > cl(A) A Int(cl(A)) = ¢ > Int(cl(A)) = 6. & 

A more difficult task is to establish that 7 ~ 4%. For the proof, we refer the 
reader to [23] (Theorem 16.1 of [23] is equivalent to tT ~ #,). 


THEOREM 4.12. (Banach Category Theorem). Let (X,7) be a space and 4%, the 
ideal of meager subsets, then tT ~ 4. & 


Remark 4.13. We conclude this section by noticing that compatibility and weak 
compatibility are properties shared by 7 and r*, namely, 7 ~ ¥ (resp., T ~ #) iff 
7* ~ SF (resp., T* ~ F£) (cf. [22]). 


5. Ideals containing %,. We have observed in §2 that if {x} © for each 
x © X, then A* = A” for every A C X. Since the converse is obvious, we state the 
following theorem, for reference. 


THEOREM 5.1. Let (X,7) be a space with ¥ an ideal on X. Then {x} © # for 
each x © Xiff A* = A” for every AC X.@ 
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Now, recall that a subset A of a space (X, 7) is dense-in-itself if A C A“; that is, 
A is without isolated points. A set which is dense-in-itself and closed is said to be 
perfect. We need this concept to state an old result proved by Cantor and Bendixson 
independently in 1883 for subsets of the real line. 


THEOREM 5.2 (Cantor-Bendixson). A second countable (moreover, hereditarily 
Lindelof ) space can be represented as the union of two sets, one of which is perfect and 
the other countable. & 


Let us see now the proof of this theorem in a general setting. 


THEOREM 5.3 [9]. Let (X, 7) be a space with an ideal ¥ on X such that t ~ ¥ and 
{x} © ¥F for each x © X. If a set A C X is closed with respect to t*, then A is the 
union of a set which is perfect with respect to t and a set in ¥. 


Proof. Let A ¢X be closed with respect to 7r*, then by Theorem 4.5 (d), 
A=A*UI with TES (and A* NI =¢). Since A* — A** C (A — A*)* by 
Theorem 2.3, (f) and (A — A*)* = $ by Theorem 4.6 (b), we have A* = A**. By 
Theorem 5.1, A* = (A*)* and consequently A* ¢ (A*)4. So, A* is perfect with 
respect to 7 and the result follows. @ 


To see that Theorem 5.3 generalizes the Cantor-Bendixson Theorem, set A = X 
and %= ¥, in Theorem 5.3. 

Recall that a set A is discrete if A (\ A?’ =. The collection of all discrete 
subsets of a space (X,7) fails to form an ideal on X; however, there is a larger 
collection that does form an important ideal. A set A C X is said to be scattered if 
A contains no nonempty dense-in-itself subset. The proof that the collection of 
scattered sets in a 7, space is finitely additive is in [17]. For further properties of 
this ideal, denoted by 4, see [30] and [24]. We point out that discrete sets are 
scattered while the converse is not true in general. In Lemma 2.7 we established that 
sets in % are closed and discrete in (X,7*(.%)) and hence are scattered in 
(X, 7*(#)). What is a condition that forces scattered sets of (X,7*) to be in 4? 
Obviously, {x} is discrete, and hence scattered in (X,7*) for each x € X, and is 
not necessarily in %. Consequently, we need {x} © ¥# for each x © X. The 
additional condition is, of course, compatibility. 


THEOREM 5.4. Let (X,7) be a space and ¥ an ideal on X. The following are 
equivalent. 

(a) T~ % and {x} © F for eachx € X. 

(b) Scattered sets in (X,7*) are in F¥. 

(c) Discrete sets in (X,7*) are in F. 


Proof. (a) — (b). Let A C X, then A* = A” by Theorem 5.1 and (b) follows by 
Theorem 4.5, (f). 

(b) > (c). Clear. 

(c) > (a). Obviously, {x} © ¥ for each x € X. Let A CX, then A — A® is 
discrete in (X, T*) by Theorem 5.1 since (A — A*) MN (A — A*)* = 6. Now, Theo- 
rem 4.5, (c) implies t ~ 4. 


Combining Theorem 5.4, Remark 4.9, and Remark 4.13, we have that a space 
(X,7) 1s hereditarily compact iff CX, T™(F)) is hereditarily compact iff CX, T™(F)) 
has no discrete infinite subspace. 
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In a similar way, we obtain the following result. 


COROLLARY 5.5. Let (X,7) be a space and ¥%, be the ideal of countable subsets of 
X. The following are equivalent. 

(a) (X, 7) is hereditarily Lindelof. 

(b) (X, 7*(.4%,)) is hereditarily Lindeléf. 

(c) If I is scattered in (X,7*), thenI € F. 

(d) If I is discrete in (X,7*), thnI € %.. & 


As a consequence of Corollary 5.5, we have for instance, that (R,7*(.4)) 
(Example 2.6) is a hereditarily Lindeléf space and that scattered sets in (R, T*(.4,)) 
are precisely the countable subsets of R. As an application of Corollary 5.5 we show 
that 7, P-spaces with discrete subspaces countable are hereditarily Lindelof (cf. 
[25]). First, recall that a space in which the intersection of a countable collection of 
open sets is open, is called a P-space. It is clear that in a 7, P-space (X, 7), 
IF, C F,,. Therefore r*(.4,) = 7*( 44) = 7. If we assume that discrete subspaces of 
(X, 7) are countable, (X, 7) is hereditarily Lindel6f by Corollary 5.5, (d). 


6. Spaces for which X = X*. An open subset U of a space (X, 7) is said to be 
regular open if U = Int(cl(U)). The regular open sets in (X, 7) form a basis for a 
new topology on X, known as the semiregularization of 7, denoted 7,. The topology 
7, 1s coarser than 7, and 7 is said to be semiregular if + = 7,. 

Given a space (X, 7) and an ideal .¥ on X, a hypothesis used by Hayashi [12] is 
that X = X* and a hypothesis used by Samuels [26] is 77 .%= {o}. These 
equivalent conditions imply 7, = (7*),, as we shall see. 


THEOREM 6.1. Let (X,7) be a space with ¥ an ideal on X. The following are 
equivalent. 

(a) X = X*, 

(b) TA F= {4}, 

(c) If I © ¥, then Int(I) = ¢, and 

(d) For every UE 7, UC U*. & 


Since X*(.%, 7) = X*(.%, r*) by Corollary 3.4, we may replace t by 7* in (b) of 
Theorem 6.1, “Int(A) = 6” by “Int *(A) = $” in (c) and “U Er” by “UE +*” 
in (d). 

The condition X = X* is satisfied in the case of the ideal .%, of nowhere dense 
sets in (X, 7). Also, it is clear that X = X* in the case of (R, 7*(.4,)) (Example 
2.6). To see the importance of X = X* consider the ideal of meager sets in (X, 7). 
Recall that a space (X,7) is said to a Baire space if the intersection of every 
countable family of open dense sets in (_X, 7) is dense. Now, it is not difficult to see 
that ( X, 7) is a Baire space iff X = X*(¥4,). 

In order to prove the promised result that (X,7) and (X,7*) have the same 
semiregularizations if X = X*, we need the following folklore lemma whose proof 
we omit. 


LEMMA 6.3. Let t and o be topologies on X and + Co. If cl (V) =cl ,(V) for 
every V € o, then 7, = 0,. & 


THEOREM 6.4. Let (X, 7) be a space with an ideal ¥ on X such that X = X*. Then 


1, = (1), 
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Proof. Let V € r*. From remarks following Theorem 6.1 it is clear that, V c V* 
and hence cl(V) C V™. This gives cl(V) ¢ cl*(V) and the result follows. & 


COROLLARY 6.5. Let (X, 7) be a space with an ideal ¥ on X such that X = X*. If 
(X,7*) is semiregular, then t = 7*. 


Proof. Observe that r* = (7*),=7,C7C 7%. & 


Let (X, 7) be a space with an ideal ¥ on X and A C X. Then ¥,= {ANTI € 
F } is obviously an ideal on A. The following easily proved lemma will be useful in 
generalizing Theorem 6.4. 


LEMMA 6.6. Let (X,7) be a space with an ideal ¥% on X and A C X. Then 
(1/A)*( 44) = T*(F)/A. 


THEOREM 6.7. Let (X,7) be a space with an ideal ¥ on X and A C X. If A C A®*, 
then (1/A), = (1*/A),. 


Proof. We will show that (7/A) 0 4%, = {}. Suppose that U € r/A ON ¥,. Then 
U € ¥ and there is a V © t such that U = V £2 A. This implies (VM A)* = @ and 
by Theorem 2.3, (g), VM A* = o. Since A C A*, U= 9. Hence 7/A NY, = {6} 
and the result follows by Theorem 6.1, Lemma 6.6, and Theorem 6.4. & 


The following easily proved corollaries also give sufficient conditions for the 
restrictions of rt and r* on a subset to have the same semiregularizations. 


COROLLARY 6.8. Let (X,7) be a space with an ideal ¥ on X and AC X. If 
A* = A**, then (1/A*), = (1*/A*),. & 


COROLLARY 6.9. Let (X, 7) be a space with an ideal ¥ on X such that X — X* € #. 
Then (1/X*), = (t*/X*),. & 


It was observed in [5] that some important topological properties are shared by 
(X, 7) and (X,7,). Some of these properties, so called semiregular properties, are: 
Hausdorffness, property of a space being Urysohn, connectedness, and extremal 
disconnectedness [5]. The list of semiregular properties is more extensive and it is 
known that H-closedness, light compactness, pseudocompactness and some other 
properties are semiregular. The following corollary now follows immediately from 
Theorem 6.4. 


COROLLARY 6.10. Semiregular properties are shared by (X,7) and (X,7*) if 
X = X*. 5 


The concept of @-continuous function [8] is closely related to that of semiregular- 
ization. A function f:(X, 7) — (Y, a) is said to be 6-continuous at a point x € X if 
for every V € W,(f(x)) there exists a U € W.(x) such that f(cl(U)) € ch(V). A 
function f: (X,7) — (Y,o) is @-continuous if f is 6-continuous at x for every 
x € X. The concept of 6-continuity is weaker than that of continuity as the identiy 
function i: (X,7,) > (X,7) for a nonsemiregular space (X,7) shows. But if the 
range of a 6-continuous function is regular, the function is continuous. It is not 
difficult to show that f: (X,7) — Y is 8-continuous iff f: (X, 7,) ~ Y is @-continu- 
ous. Hence, the following results. 


THEOREM 6.11. If X = X*, then f: (X,7) > Y is 6-continuous iff f: (X,7*) > Y 
is @-continuous. & 
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COROLLARY 6.12 [26]. Jf X = X* and Y is regular, then f: (X,7)— Y is 
continuous iff f: (X,7*) — Y is continuous. @ 


7. Some Applications. 

A. Spaces in which Compact Sets are Finite 

The class of spaces in which compact sets are finite has been a subject of some 
interest in recent years. Clearly, “The general problem is to move as far away from 
discreteness (as possible) and yet to retain the property that only finite sets be 
compact” [28]. Open problems in this area can be found in [1]. 

Spaces in which compact sets are finite have been called cf-spaces in [19], 
pseudofinite in [32], and recently, anticompact in [1]. As pointed out in [1], an 
important source of anticompact examples are the MI spaces of Hewitt [13]. A 
space is said to be MI if it is dense-in-itself, Hausdorff, and submaximal in the sense 
of Bourbaki [5], i.e., every dense subset is open. In [15] it was shown that MI spaces 
are anticompact and this result was improved in [28] where it was shown that 
dense-in-themselves Hausdorff spaces with the property that sets with dense interi- 
ors are open are anticompact. If we rephrase the last result we have that dense- 
in-themselves Hausdorff spaces with nowhere dense sets closed are anticompact. 

We will show how to obtain an anticompact space by starting with a Hausdorff 
space and generating a finer topology via an appropriate ideal. Besides %, and ¥%, 
the ideal of relatively compact sets %, = {A C X:cl(A) is compact in (X, 7)} will 
play an important role. Note that a Hausdorff space (X, 7) is anticompact iff + = 
7*(¥#,). Our next result provides a rich source of anticompact spaces. 


THEOREM 7.1. Let (X,7) be a Hausdorff space. Then (X,7*(%.0 4% 1 4,)) is 
anticompact. 


Proof. Let A be an infinite subset of X. Suppose that A is compact in (X, 7”). 
Since (X, 7) is Hausdorff, there exists a countable subset B of A which is discrete in 
(X, 7). It is clear that B € % 1 %. Since A is compact in (X,7*), A is compact in 
(X, 7), and hence closed in (X, 7). So, B € ¥, and consequently, BE ZAAN 4. 
By Lemma 2.7, B is closed and discrete in (X, 7*). This contradicts the assumption. 
Therefore (X, 7*) possesses no infinite compact set. @ 


We point out that 7*(4.N 40 4,) = 7*(%) 0 7*(4) 10 7*(4,) (Corollary 
3.5, (c)). Therefore these three topologies are anticompact. Clearly, for 7*(.%,) it is 
not necessary to assume that (X, 7) is Hausdorff. The fact that +*(4%,) is anticom- 
pact implies that a space (X, 7*) is anticompact if ¥ is a o-ideal on X and {x} €# 
for each x © X [20], as well as that 7, P-spaces are anticompact [1]. 

We remark that first countable 7, anticompact spaces are discrete and that 
locally compact Hausdorff anticompact spaces are also discrete. More generally, 
Hausdorff anticompact k-spaces are discrete [1]. 

We now use Theorem 7.1 to offer a general result. 


THEOREM 7.2. Let (X,7) be a Hausdorff space with an ideal ¥ on X. If 1 ~ # 
and {x} © ¥ for each x © X, then (X, t*(.F)) is anticompact. 


Proof. By Theorem 5.4 and Lemma 2.7, % = 4%(7*), where 4%(7*) denotes the 
ideal of scattered sets in (X, 7*). Since r** = 7* (Corollary 3.4), the result follows 
by Theorem 7.1. @ 
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Now, we return to the MI spaces. Observe that in a dense-in-itself 7, space 
scattered sets are nowhere dense [17]. As a consequence of Theorem 7.1 we have the 
following. 


COROLLARY 7.3. Let (X, 7) be a dense-in-itself Hausdorff space. Then (X, t*(F, 
1%, £,)) is anticompact. & 


It is left to the reader to establish the previously mentioned result that “dense- 
in-themselves Hausdorff spaces with nowhere dense sets closed are anticompact’’ is 
logically equivalent to the following corollary. 


COROLLARY 7.4. Let (X, 7) be a dense-in-itself Hausdorff space. Then (X, T*(¥%,)) 
is anticompact. @ 


B. Continuity with respect to t™. 

If f: (X, 7) — Y 1s acontinuous function and ¥ is an ideal on X, it is clear that 
f: (X,7*) > Y is also continuous. The converse is not true as the following 
example shows [14]. 


Example 7.6. Let (R, 7) be the real line with the usual topology, o, Smirnov’s 
topology (Example 2.9), and i: (R,7)— (R,o) the identity function. Then i: 
(R, 7*(4,)) > (R,@) is continuous since A = {1/n:in € N} © ¥, while i: (R, 7) 
— (R, a) is not. This example also shows that the assumption of regularity of Y in 
Corollary 7.8 is essential. 


THEOREM 7.7. Let f: (X,7) — Y be a function and ¥ an ideal on X. If f: 
(X,7*) — Y is 6-continuous, then f/X*: (X*, 71/X*) — Y is 0-continuous. 


Proof. First note that by Theorem 2.3, (g), UN X* = UN (UN X)* = UN U* 
for every U € tr. Now let x © X* and VE W(f(x)). Since f: (X,7*) > Y is 
6-continuous, there exists a basic open set U — J in (X,7*) containing x, where 
Uer and J €¥, such that f(cl*(U-—J)) C cl(V). This gives f((U — J) U 
(U —I)*) ¢ ch(V) and consequently f(U*) Cc cl(V). Since clye(UM X*) = 
c(UN X*) =ch(UN U*) C U*, f(clyx(UN X*)) C chV). Therefore f/X* is 
é-continuous. 


From the remarks following Corollary 6.10 the following corollary (Lemma, [14]) 
is immediate. 


COROLLARY 7.8. Let f: (X, 7) — Y be a function, with ¥ an ideal on X, and Y 
regular. If f: (X,7*) — Y is continuous, then f/X*: (X*, 71/X*) — Y is continuous. 
a 


Our next result is in [14] for o-ideals. 


THEOREM 7.9. Let f: (X,7) > (Y,0) be a function, with ¥% an ideal on X, 
X — X* € ¥ and Y regular. The following are equivalent: 

(a) f: (X, 7*) > Y is continuous, 

(b) f/X*: (X*, r/X*) — Y is continuous, and 

(c) f- '(V) = U—I with U € + and I © F for every V € o (i.e., f: (X, B(Y, T)) 
— (Y, a) is “continuous’’). 
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Proof. (a) > (b) by Corollary 7.8. It also follows by Corollary 6.9. To show 
(b) — (c) notice that 


f(F) = (fF) 9 X*) (fF) 9 (X — X*)). 


If F is closed in (Y, 0), then f-'(F) N X* = (f/X*)7\(P) is closed in (X*, 7/X*) 
and consequently in (X,7), and since f-'’(F)N(X-— X*)E 4%, f '(F)=PUTI 
with P closed in (X, 7) and I © ¥. This is clearly equivalent to (c). Since (c) > (a) 
is obvious, the proof is complete. 


Since the compatibility of r and ¥% implies X — X* €.%, Theorem 7.9 is valid if 
“xX — X* © ¥” is replaced by “7 ~ %.” This then implies one of the main results in 
[14]. 


COROLLARY 7.10. Let f: (X, 7) > (Y, 0) be a function. with Y regular and ¥ a 
o-ideal of X. If either of the following holds: 

I. (X, 7) is hereditarily Lindelof, or 

II. #% is the o-ideal of meager sets in ( X, 7). 

Then the following are equivalent: 

(a) f/Xpo is continuous for some closed X, © X with X — X) € F, 

(b) f-\(V) = U-I with U E71 andI © F for every V Ea. 


Proof. Note that in both cases, tT ~ ¥. 


REFERENCES 


1. P. Bankston, The total negation of a topological property, I/linois J. of Math., 23 (1979) 241-252. 
2. H.D. Block and B. Cargal, Arbitrary Mappings, Proc. Amer. Math. Soc., 3 (1952) 937-941. 
3. H. Blumberg, New properties of all real functions, Trans. Amer. Math. Soc., 24 (1922) 113-128. 
4. , Arbitrary point transformations, Duke Math. J., 11 (1944) 671-685. 
5. N. Bourbaki, General Topology, Addison-Wesley, Mass., 1966. 
6. B. Cargal, Generalization of continuity, Proc. Iowa Acad. Sci., 60 (1953) 477-481. 
7. §S.G. Crossley and S. K. Hildebrand, Semi-topological properties, Fund. Math., 74 (1972) 233-254. 
8. §S. V. Fomin, Extensions of topological spaces, Ann. of Math., (2) 44 (1943) 471-480. 
9. G. Freud, Ein beitrag zu dem sdtze von Cantor and Bendixson, Acta Math. Hung., 9 (1958) 
333-336. 
10. H. Hashimoto, On some local properties on spaces, Math. Japonica, If (1952) 127-134. 
11. , On the “topology and its application, Fund. Math., 91 (1976) 5-10. 
12. E. Hayashi, Topologies defined by local properties, Math. Ann., 156 (1964) 205-215. 
13. E. Hewitt, A problem in set theoretic topology, Duke Math. J., 10 (1943) 309-333. 
14. J. Kaniewski and Z. Piotrowski, Concerning continuity apart from a meager set, Proc. Amer. Math. 
Soc., 98 (1986) 324-328. 
15. M.R. Kirch, A class of space in which compact sets are finite, Amer. Math. Monthly, 76 (1969) 42. 
16. K. Kuratowski, Topologie I, Warszawa, 1933. 
17. , Topology, Vol. I, Academic Press, New York, 1966. 
18. K. Kuratowski and W. Sierpinski, Le théoréme de Borel-Lebesgue dans la théorie des Ensembles 
Abstraits, Fund. Math., 2 (1921) 172-178. 
19. N. Levine, On the equivalence of compactness and finiteness in topology, Amer. Math. Monthly, 75 
(1968) 178-180. 
20. N. F. G. Martin, Generalized condensation points, Duke Math. J., 28 (1961) 507-514. 
21. O. Njastad, On some classes of nearly open sets, Pacific J. Math., 15 (1965) 961-970. 
22. , Remarks on topologies defined by local properties, Avh. Norske Vid.-Akad. Oslo I( N.S), 
8 (1966) 1-16. 
23. J. C. Oxtoby, Measure and Category, Springer-Verlag, 1980. 
24, , The kernel operation on subsets of a 7,-space, Fund. Math., 90 (1976) 205-224. 


310 DRAGAN JANKOVIC AND T. R. HAMLETT 


25. I. L. Reilly and M. K. Vamanamurthy, On hereditarily Lindeléf spaces, Bull. Austral. Math. Soc., 
21 (1980) 357-362. 

26. P. Samuels, A topology formed from a given topology and ideal, J. London Math. Soc., (2) 10 
(1975) 409-416. 

27. §. Scheinberg, Topologies which generate a complete measure algebra, Advances in Math., 7 (1971) 
231-239. 

28. P. L. Sharma, A class of spaces in which compact sets are finite, Canad. Math. Bull., 24 (3) (1981) 
373-375. 

29. L.A. Steen and J. A. Seebach, Jr., Counterexamples in Topology, Springer-Verlag, New York, 1978. 

30. R. Vaidyanathaswamy, Set Topology, Chelsea Publishing Company, 1960. 

31. , The localization theory in set-topology, Proc. Indian Acad. Sci., 20 (1945) 51-61. 

32. A. Wilansky, Topology for Analysis, Ginn, Mass., 1970. 


LETTERS TO THE EDITOR 


Editor: 


I have read the letter by Peter F. Zoll in Vol 96, number 8, p. 709, of this 
MONTHLY. I would like give a partial answer to Mr Zoll’s questions. Regarding the 
alleged burning of Valmes in 1486 by Tomas de Torquemada because he had solved 
the general quartic equation, I have found no reference concerning either Valmes or 
his alleged punishment. However, this fact seems contradictory to known historical 
documentation, namely, that modern Algebra (Italian and German schools) was 
introduced into Spain circa 1552 by the German scholar Marc Aurel, [4], [7] p. 97. 
On the other hand, Zoll’s letter gives the impression that there were no Spanish 
mathematicians to speak of in that period. It is this point I want to address in this 
letter. 

Concerning the development of Algebra in Spain during the XVI century, one 
can come up with two names associated with highly original contributions, namely: 
Fray Juan de Ortega, whose approximation methods for square and cube roots have 
been commented on in [1], [4] and mainly by Rey Pastor in [7], who really shows a 
grasp and understanding of Ortega’s method. The Portuguese mathematician Pedro 
Numnez’s celebrated solution to the problem of the loxodromic curve made him 
famous in a world concerned with scientific navigation. Nunez was a professor at 
Salamanca, Spain, and after Aurel, was the author of a celebrated treatise on 
Algebra, written in Spanish, that contained many original views on the modern 
Algebra of the time. Other algebraists of the time that were mentioned in [4], [6], 
and [7] are J. Pérez de Moya and A. Rocha. 

It is, however, in the field of infinitesimal Arithmetic and Mechanics that the 
Iberian mathematicians excelled during the XVI century. The two names that seem 
to be at the forefront of the Iberian school of Paris-Salamanca-Alcala de Henares 
are Alvaro Thomas and Fray Domingo de Soto. A discussion of the achievements of 
this school can be consulted in [3], but are mainly in [8] and [9]. 

According to Pierre Duhem, [3] pp. 263-562, “Dominique de Soto et la escolas- 
tique parisienne,” the science of the so-called Spanish school of Paris was the step 
that allowed the early Mechanics of the masters of Merton College (Oxford, circa 
1350), Thomas Brawardine, Suiseth, etc., to mature into the views of Galileo and 
Galileo’s Science. With reference to this particular point, see [2] chap. 11, [8] and [9]. 

Domingo de Soto, Dominican priest, Spanish mathematician, physicist and 
philosopher, virtuoso “calculator,” educated in Paris and later professor at Sala- 
manca, was the first scientist to describe the law of the falling bodies as a uniformly 
accelerated motion. His treatise “Super octo libros physicorum questiones,” Sala- 
manca 1555, earlier editions: 1551 and 1545, see Wallace [8] and [9], also M. Clagett 
[2] and Duhem [3], contains this result commonly attributed to Galileo. Pierre 
Duhem, [3], was the first to rectify this mistake. Domingo de Soto has been 
described as the central figure among the Parisian “calculatores” of the XVI century 
and the key link in the school whose output ultimately led to the Mechanics 
developed by Galileo, [3]. Concerning the diffusion of the works of the Spanish 
masters of Paris-Salamanca, Wallace points out in [8] p. 232 “More important, 
perhaps the works of the two Spanish Jesuits, Francisco Toledo and Benito Pereyro, 
the first of whom surely studied under Soto, were already widely diffused through- 
out the Italian peninsula. Spain’s pre-eminence as world power during the sixteenth 
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century assured the diffusion of her books, as well as the exportation of her 
scholars. Galileo in his student notebooks mentions the works of Soto, Toledo and 
Pereyro, so these must have been known at least to his professors...” 

Regarding the school of the Iberian mathematicians of Paris and Salamanca one 
can safely say: “They lay down the axiomatic and the formalization of the Calculus 
brought up by Brawardine and the masters of Merton College to study problems of 
motion.” The infinitesimal character and essentially modern scope of this Calculus 
has been pointed out by many scholars, among them, H. Wieleitner, [10], and Julio 
Rey Pastor [7]. The names associated with this school are: Martinez Siliceo, Pedro 
Ciruelo, Alvaro Thomas, Juan de Celaya, Luis Nufiez Coronel and Diego Diest, to 
mention a few that achieved great fame. Perhaps the Portuguese Alvaro Thomas was 
the most conspicuous figure of this school. His academic life seems to have 
developed in Paris, where he wrote his treatise “Liber de triplici motu proportion- 
ibus annexis Magistri Alvari Thome Ulixbonensis philosophicas Suiseth calcula- 
tiones ex parte declarans,” Paris 1509. Among the novelties introduced by Alvaro 
Thomas we find a theory of series, which had very much the character of what we 
call today power series. The problems of convergence and divergence were clearly 
stated in the typical cases studied by Thomas. H. Wieleitner has shown in [10] 
Alvaro’s mathematical originality and virtuosity in his approach to the theory of 
numerical series. Thomas was perhaps one of the first scholars of this school to state 
a general mean value theory in the context of theory of motion, (for the case of a 
motion more general than the uniformly accelerated one) [3], [10]. Dr. Julio Rey 
Pastor has stated in [7] that Thomas deserves together with Oresme a position of 
honor in the history of Calculus. 

I like to close this letter by commenting on D. Alonso de la Veracruz (the father 
of Mexican philosophy) who wrote “Physica speculatio,” in 1557 in Mexico, 
intended to be used at the University of Mexico (founded in 1553). This treatise was 
very much inspired by the work of Soto and constitutes an early Hispanic American 
contribution to the development of Calculus and modern Mechanics, see [8] for 
further details. 
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Editor: 


I am writing in response to the article “The effect of prior calculus experience on 
introductory college calculus” by M. B. Burton which appeared in the April issue of 
the American Mathematical Monthly. In that article students in the introductory 
calculus course at the University of New Hampshire were surveyed concerning their 
high school calculus experience by appending an informational question to their 
final college calculus exam. 

It is a well-established principle in statistical science that randomized experi- 
ments should be used in determining cause and effect relationships. Otherwise, 
underlying confounding factors may be the cause of the perceived effect. In this 
survey it may well have been that students who chose to study calculus in high 
school had a propensity to do better in college calculus with or without their high 
school calculus experience. This may have been due to a variety of reasons including 
the possibility that better students come from high schools which offer calculus 
courses or that better students choose to take calculus in high school. 

It is mentioned in the article that the students with high school calculus 
experience and the students without high school calculus experience had roughly the 
same quantitative SAT scores. This observation does not justify the conclusion that 
high school calculus experience has a positive effect on grades in college calculus. 
Even if quantitative SAT scores are a good predictor of performance in college 
calculus the remaining variation amongst students with equal SAT scores could 
again be due to some confounding factor such as those described above. 

In summary, although it is possible that high school calculus experience has a 
positive effect on grades in college calculus, one cannot make such a conclusion 
based on a survey of this type. 

Sincerely, 


Tim Swartz 
Dept of Math and Stats, SFU 
Burnaby, B.C. Canada V5A1S6 


Editor: 


I have been wondering if you and the membership are prepared for the coming 
change. 

The change I am referring to is a change in name. Our Association will soon 
change its name to Math Association of America. Some of us will soon be members 
of the American Math Society, some to the Society for Industrial and Applied 
Math. That’s Math, not Mathematics. 

Do you think I exaggerate? Recently an article appeared in the Boston Globe 
about mathematics which did not use the word “mathematics” once, just “math.” A 
book publisher’s advertisement in this MONTHLY recently referred to the American 
Math Society. Look carefully at a recent issue of UME Trends. Listen to your 
students. Probably you can all supply many examples. 

Do you find this disturbing? Alarming? Do you care? 

Anyone interested in joining MATHEMATICIANS AGAINST MATH? 


R. E. Worth 
Georgia State University 
Atlanta, Georgia 30303 


UNSOLVED PROBLEMS 


EDITED BY RICHARD GUY 


In this department the MONTHLY presents easily stated unsolved problems dealing with notions 
ordinarily encountered in undergraduate mathematics. Each problem should be accompanied by 
relevant references (if any are known to the author) and by a brief description of known partial results. 
Manuscripts should be sent to Richard Guy, Department of Mathematics and Statistics, The 
University of Calgary, Calgary, Alberta, Canada T2N IN4. 


A Guessing Game of Bill Sands, 
and Bernardo Recaman’s Barranca 


RICHARD K. Guy 


Bill Sands devised a neat little problem which was used in the Alberta High 
School Mathematics Scholarship Examination, 87-02-10 [4]. 

I think of a number from 1 to 12, and you are allowed four guesses, to each of 
which I shall give you the correct one of the following three responses: 

(a) you’re exactly right, 

(b) you’re near (meaning “within 3’’), 

(c) youw’re far (otherwise). 

The last guess must give the right answer: it’s not sufficient to be able to say, “Yes, 
now I know the answer.” 

At first glance this looks easy: with three outcomes for each of four guesses, the 
number of responses, 3*, is many times more than the number of answers, 12. But 
the responses are very interdependent, and only a few can occur. We'll wait till the 
end before giving an answer, in case you want to try it out for yourself. 

It’s reminiscent of the famous counterfeit coin problem (12 pennies with one dud, 
see [1, 3], for example) but here you can’t find an “unconditional” solution: later 
guesses have to depend on feedback from earlier ones. However, for the pennies it’s 
possible to state ahead of time just which coins will be placed in each pan of the 
balance at each weighing. 

The reason for mentioning the problem here is to ask for the general result: 


How many guesses are needed to determine a number chosen from 1,2,..., N if 
you are told that each guess is either correct, or within k, or not? 


If k > N — 1, then N guesses may be needed if your guesses are constrained to 
the numbers 1,2,..., NV, but if not, or in any case if (N — 1)/2 < k < N — 2, then 
some economy is possible, e.g., the responses to an initial guess of 1 partition the 
population into the sets {1}, {2,...,k + 1}, {k + 2,..., NM} and it might seem that 
max{k, N — k — 1} further guesses could be needed. But if this max is k, i.e., if 
k > (N — 1)/2, further economy follows from the wrong guess of N: the response 
indicates which of the two sets {2,..., N—k-—1} and {N—k,...,k +1} con- 
tains the number. When the cardinality of the surviving candidates is small 
compared with k (as it must eventually become) it may still pay to “waste” a guess 
in order to halve the remaining possibilities. The problem is to quantify this. 
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If this problem isn’t sufficiently general, then answer the problem with four or 
more possible responses, e.g., with “within k,, or within k,” in place of “within k.” 
And after Ivan Niven’s delightful article [2] we can ask the variants: 


What if the responder lies exactly once? At most once? 


Bernardo Recaman, while visiting schools in Colombia, has been asked more 
than once for a game which could stimulate children to practice their multiplication 
tables, so he invented the game of Barranca. The name is short for Barrancabermeja, 
a town on the Magdalena River in central Columbia; it also means “deep hollow,” 
“ravine,” or “precipice.” Barranca is played with sixteen cards, numbered 
1,2,...,16. Two players alternately choose a card, until each has eight. The winner 
is the one who has a (sub)set of numbers whose product is 220, or, if neither player 
has that product, then the winner is the one who can display a product nearest to 
220. It is possible for ties to occur, but it seems likely that, with best play, the first 
player has a winning strategy. Is there a “strategy stealing” argument which proves 
this? Recaman doesn’t know of any simple strategy. 


Can the first player win? If so, how? What happens if there are three (or more) 
players? Are there better numbers to use in place of 16 and 220? 


Here’s a solution to the Bill Sands problem: make the first two guesses 6 and 8. 
After the responses “near, near,” “near, far,” “far, near,” and “far, far,” ‘guess 
respectively 5 (or 9), 3 (or 4), 10 (or 11), and 1 (or 2). 
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Polynomials With Large Numbers of Prime Values 


BETTY GARRISON 
Department of Mathematical Sciences, San Diego State University, San Diego, CA 92182 


Let f(x) denote a polynomial with integer coefficients, and let G(f) be the 
largest integer G such that G divides |f(x)| for all integers x. Bouniakowsky 
conjectured [1] in 1857 that if f is nonconstant and G(f) = 1, then there exist 
infinitely many integers x such that |f(x)| is prime. The case where f has degree 
one is known today as Dirichlet’s theorem on primes in arithmetic progressions. At 
present it remains unknown whether there exists even a single polynomial of degree 
greater than one which is prime for infinitely many integer values x, but evidence is 
accumulating in the literature to indicate that this is very likely to be the case. 
Sierpinski [4] has shown that for every natural number m there exists a natural 
number n such that x* + n is prime for more than m distinct natural numbers x. 
Iwaniec [3] proved that if g(x) = ax? + bx + c is an irreducible polynomial with 
integer coefficients such that a > 0 and c = 1 (mod 2) then there exist infinitely 
many integers x such that g(x) is either prime or a product of two primes. Others 
have obtained results similar to Iwaniec’s (but a little weaker) for polynomials of 
any degree > 2. 

In this note we first extend Sierpinski’s result to polynomials of degree k > 2 and 
then to polynomials x* + a”, where a is a natural number. 


THEOREM 1. Let k > 2 be an integer and let M be an arbitrarily large real number. 
Then there exist positive integers c, d such that x® + c, likewise x* — d, is prime for 
more than M positive integers x. 


Proof. Let [x] denote the largest integer < x, and let 7(x) denote the number of 
primes in the interval [1, x]. From the prime number theorem, a(x) ~ x/In x, and 
it follows that for every ¢ in the interval (0,1) there exists y(e) such that 7(x*) > 
(1 — e)x*/k In x for every x* > y(e). The proof of the theorem will depend on 
showing that for each pair k, M there exists an interval [1, J] containing more than 
MJ representations p — n’ for distinct primes p and associated positive integers 
n,. This will imply that there is at least one integer c in [1, J] equal to more than M 
of these representations. In order to accomplish this, choose a fixed ¢ in (0,1) and a 
positive integer N such that (1) N¥* > y(e), (2) (1 — &)N/k(k + 1)In N > M, and 
(3) N > 2* — k. Then for every prime p in [1, N“] there exists a positive integer 
n, =[p’/*] < N such that n* < p < (n, + 1)*. Hence 


k ~1 k-2} | k k 
p—nk<(n,+1)°—nk <knk! +4 nk (5) +(S) +43] 


< knh-* + nk-7(2k — k) < (k + 1)N**. 


Thus all the integers p — n* are in the interval [1,(k + 1)N*~*], and since there are 
a(N*) such values, there exists an integer c in the interval [1,(k + 1)N*~*] such 
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that p — n* =c for at least (1 — e)N*/k In N)/(k + 1)N*~* > M primes p. The 
conclusion is reached for x* — d by repeating the argument above with p — nf 
replaced by (n, + 1)* — p. 


THEOREM 2. Let M be an arbitrarily large real number. Then there exists a positive 
integer c such that x” + c? is prime for more than M positive integers x. 


Proof. Let a(x; 4,1) denote the number of primes p = 1 (mod 4) in the interval 
[1, x], and recall that the prime number theorem for primes in arithmetic progres- 
sions gives 7(x;4,1) ~ x/2In x. Thus for a fixed e in the interval (0,1) we can 
choose a positive integer N sufficiently large so that 7(N?; 4,1) > (1 — e)N7/4In N, 
and (1 — e)N/21n N > M. It is well known that for each prime p = 1 (mod 4) there 
exists a unique pair a,, b, of positive integers such that a, is even and a’, + b> = p. 
(See [2], pp. 264-265, for example.) Since there are at most N/2 even integer 
squares in [1, N], it follows that there exists an even integer c such that a » — ¢ for 
at least «(N?;4,1)/(N/2) > M primes p. For these primes p we have c? + b* 
prime. 
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On Inequivalent Group Extensions 


F. B. CANNONITO 
Department of Mathematics, University of California, Irvine, CA 92717 


The purpose of this note is to explicitly construct inequivalent group extensions 
which are not isomorphisms of extensions. We review the basics and terminology. 


ad € 
Let N >» E ~ G be an extension of groups with N abelian, so that G = E/N. If 
—# ot _ 
N » E > G is another extension, then a morphism of extensions is a triple (a, B, v) 
of group homomorphisms such that following diagram (1) commutes 


N» E>G 


Joa 


ad € 

N> E> G., (1) 
The triple (0, 0,0) will be called the trivial morphism. A morphism of extensions of 
the form (1, 8,1) between N > E > G and N » E > G is termed an equivalence. 
A more general type of morphism is an isomorphism; this is a morphism of the form 
(1, B, v) (resp., (a, B,1)) from N> E> G toN»E->G (resp... N » E > G to 
N >» E > G) where a or p is an isomorphism of groups. It follows from the five 
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lemmas that B is also an isomorphism of groups. Equivalence and isomorphism of 
extensions are equivalence relations. Denoting by Ext(G, N) the set of equivalence 
classes of abelian extensions of N by G, it is well known that Ext(G, N) can be 
made into an abelian group. (See, for example, Rotman [1 Chap. 10].) 

As an example of this development, the student is led to a study of Ext(Z,, Z,), 
p prime. Indeed, Ext(Z,, Z,) = Z, and so Ext(Z,, Z,) has p elements. But if 
Z, >> E > Z, is exact, then E has order p? so that E = Za or E = Z, ® Z,. This 
shows for p odd that there exist inequivalent extensions of N by G with isomorphic 
middle terms. Specializing to p = 3 it can be easily seen that given any morphism 
(a, 8, v) of inequivalent extensions of Z,; by Z3, where B is an isomorphism, one 
must have a =1 or » =1 but, of course, not both; so any morphism here is 
necessarily an isomorphism. 

We will now exhibit a morphism of inequivalent extensions of Z, by Z, for 
p > 3, which is not an isomorphism. First, we construct inequivalent extensions. 

Consider the following extensions 


Ey: Z,>Z,2 > Z, 
p > 2, (2) 
|e 


Ey: Z, >Z,2 » Z, 


where p is x > px and eis y> y + Z, and B is any group isomorphism. 
LEMMA 1. E, is not equivalent to E,. 


Proof. Suppose the diagram (2) could be completed so that it commutes wa 
respect to the morphism (1, 8,1). Then 1 + Z, = —B(1) + Z, so that (a) B(1) = 
(mod p). Furthermore, p = pB(1) (mod p”) ‘and thus (b) B(1) = = 1 (mod p). Hence 
2 = 0 (mod p) which is impossible for p > 2. 

We see from Lemma 1 that the three elements in Ext( Z3, Z,) are the equivalence 
Classes of the extension Z, ® Z, and the two inequivalent extensions of the form 
Zo. 


LEMMA 2. If p > 3 then the map B: x —> 2x is an isomorphism Z,2 > Z,2 such 
that there exists a morphism (a, B, v) of diagram (2) with neither a = 1 nor v = 1. 


Proof. Using commutativity on the right we see that if » #1 then, (writing 
v(x + Z,) as vx + Z,) (a) » #1 (mod p) and vx = —2x (mod p) and so for 
x = 1 y= —2 (mod p). Using commutativity on the left, we have pa(1) = pB(1) 
(mod p*) or pa = pB (mod p”) or a = B (mod p). Since a # 1, a #1 (mod p). 
But B is x ~ 2x, so for x = 1 we get the system (7) a = 2 (mod p) and a #1 
(mod p). Taking p > 3 we see both B(1) = 1 (mod p) and the system (7) have 
solutions for a and B and clearly for such solutions x ~ ax and y+ By are 
isomorphisms which make the diagram commute. Thus, (a, 8, v) is a morphism with 
a #1 and vp #1. 

We note for p = 3 there is no solution of » # 1 (mod 3) and »v = —2 (mod 3) and 
so each nontrivial morphism of inequivalent extensions of Z,; by Z, must be an 
isomorphism. 
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Hence we have proved the 


THEOREM. If p is a prime greater than 3 there exist nontrivial morphisms of 
inequivalent extensions of Z, by Z, which are not isomorphisms. 


The author wishes to thank W. Messing for discussions which led to this note. 
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A Rearrangement Inequality and the Permutahedron 


A. VINCE 
Department of Mathematics, University of Florida, Gainesville, FL 32611 


One chapter of the classic book “Jnequalities”’ by Hardy, Littlewood, and Polya 
[3] is dedicated to inequalities involving sequences with terms rearranged. The main 
example in that chapter is the following. Let a, < a, < --: <a, and b, <b, < 

<b, be sequences of real numbers and aw any permutation of the set 
{1,2,...,”}. Then 


n n n 
DL 4jb, 141 < LL aid, < Le a,b;. (1) 
i=1 i=1 i=l 

Hardy, Littlewood, and Polya interpret the a, as fixed distances along a rod and the 
b, as weights to be suspended at these distances: To get the maximum moment with 
respect to an end of the rod hang the heaviest weights furthest from that end; to get 
the minimum moment hang the heaviest weights closest. 

Many variations and generalizations of this rearrangement inequality exist. Three 
appear below, more at the end of this note, and Marshall and Olkin [5] contains a 
relatively recent survey. In this note all sequences {a;} and {b,;} are increasing in 
the sense a, <a, < -:: <a,andb, <b, < -:: < b,, and all sums and products 
are from i=1 to i=n unless otherwise stated. Also N = {1,2,...,n} and S, 
denotes the set of all permutations of N. 


Example 1. |'7| For sequences with positive terms and for all 7 € S, 
[1 (4, + b,_i41) 2 TI (4; + b,;) > T1(4; + b;). 
Example 2. [8] For p > 1 and all w € S, 
Lila; — b |? < Lila; — byi|? < len iat — 5,|?. 
Example 3. [4] One generalization of the Hardy-Littlewood-Polya inequality (1) 


is as follows. If the sequences have positive terms and f is an increasing convex 
function then for all 7 € S, 


Lf (iby 141) < Lif (a,b, ;) < Lf (4;b;). 


Recall that f is increasing if f(x) > f(y) whenever x > y, and f is convex if 
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f(ax + (1 -— a)y) < af(x) + 1 — a)f(y) for all 0 < a < 1. In the differentiable 
case, of course, convex is equivalent to f’’(x) > 0. 


The purpose of this note is to show how the permutahedron leads to a very 
simple generalization of the Hardy-Littlewood-Polya inequality, from which the 
inequalities above and many other rearrangement inequalities immediately follow. 
In [5] rearrangement inequalities are derived using majorization, which is a partial 
order on the set of vectors in n-dimensional Euclidean space. This elegant method 
goes back at least to Schur [10] and is a unifying principal for many types of 
inequalities. The intention in this note is to use a much simpler partial order on S,, 
to obtain the rearrangement inequalities. 

Let # denote the identity permutation %(i) =i of N and 4% the reverse 
permutation 4(i) =n —i+t+1,i=1,...,n. An inversion of a permutation 7 of N 
is a pair (aj, 7k) such that j < k and aj > wk. For example, (5, 3) is one of the 
four inversions of 7 = 25134. Now consider the directed graph P, whose vertex 
set is S,, and there is an edge (0, 77) directed from o to 7 whenever vertex o is ob- 
tained from vertex 7 by interchanging the elements of an inversion of the form 
(7j,7(j + 1)). P, is sometimes called a permutahedron [1] and an example is 
shown in Figure 1. The transitive closure of P,, induces a partial order on the set S,. 
Recall that the transitive closure is the “smallest” directed graph with the same 
vertex set as P,, and with the property that if (7,7) and (7,0) are directed edges 
then so is (7,0); the partial order on S, is defined by o > z if there is an edge 
directed from o to z in the transitive closure of P,. A real valued function g: 
S, — R is called order preserving if g(o) > g(a) whenever o > 7. This is all that is 
needed to prove the following theorem. 


1234 


1432 


8 2341 
4132 


4321 


Fic. 1. Permutahedron P,. 
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THEOREM. Let g,,..., g, be real valued functions defined on an interval I. Then 
Le 8i(By—-i+1) < L8i(ni) < 28;(b;) (2) 
for all sequences b, < b < ++: <b, inI and all 7 © S, if and only if 
2.41 — 8, isincreasingonI, 1<i<n. (3) 


Remark. If the functions g, are differentiable, then it is clear that 
Bi(x) < g3(x) <--> <g,(x) forall x EJ (3’) 
is equivalent to condition (3). 


Proof. (=) Fix a sequence b, < b, < --- <b, and let g: S, > R be defined by 
g(7) = )\2,(b,,). To show that g is order preserving it suffices to show that 
g(o) > g() whenever (0, 7) is an edge of the permutahedron: 


g(o) — g(r) = | g,(b,;) + Bj41( bac j+1))| — | 3; (0,,) + Bj 41( Baja) | 
= (8) 41 — g;)( b,j) 7 (Sj41 7 &;)( be i+1y) 2 0. 


The last inequality follows from b,, < b,,;41) and the assumption that g;.,; — g; is 
increasing. Since 4, <7 < ¥ for any permutation 7, also g(.4) < g(7) < g(4), 
which is precisely inequality (2). 

(=) By way of contradiction assume that g,,., — g,, 1S not increasing for some 
m. Then there exists x > y such that (g,,41 — 8,X) < (2m417 &m)(v). Now 
choose any sequence b, <b, < -:: < b,in J] withb, =x and b,,, =y. Let a be 
the transposition (mm + 1). Then )'¢,(b,,) = 


y g(b:) + Sn(X) + Bmaily) 


ixXé#m,m+1 
> DY g(b;) + &mly) + Smale) = Lg, (8), 
i#m,m+1 
contradicting inequality (2). 0 
The inequalities of Examples 1,2, and 3, as well as those below, result by simply 
substituting the appropriate g,; in the theorem. For example, choosing g,(x) = a;x 
yields the classic Hardy-Littlewood-Polya inequality (1). Choosing g,(x) = 
—log(a,; + x) yields Example 1; and choosing g,(x) = f(a,x) gives Example 3. In 
each case it is an exercise to show that the g, satisfy condition (3) or (3’) in the 
theorem. For the Hardy-Littlewood-Polya inequality condition (3) is immediate. For 
Example 3 the verification of condition (3) is a little tricky, but still elementary. 


Example 4. Let g,(x) = f(a; — x). If f is convex then for all 7 € S, 
Lf a; —b;)< Lf (a; — bi) < Lf (4n—i41 — b,). 
With f(x) = |x|? this is Example 2. 


Example 5. [2] Take g,(x) = g(a,, x) where g is a real valued function of two 
variables defined on a domain D = [a, b] X [c, d]. If 


8(%1, 1) — 80%, ¥2) — B(%2, 1) + 8(X2, 2) = 0 (4) 
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for all x, > x5, y, > y> in the domain then 


g(a, b,-i+1) < Vis(a;, bri) < vga; b,) (5) 


for all 7 © S,. Note that for a function with continuous second derivatives condi- 
tion (4) can be replaced by 


d°g(x, y) 
—_—— >0 forall(x, y) €D. 4’ 

ay (x, y) (4 

Example 6. Many inequalities can be generalized to more than two sequences. 
Let {aj}, {a7},...,{a}, 1 <i<n, be not necessarily increasing real sequences 
and let a), Q(y),---, 4) denote the sequence aj, a,,..., a, im increasing order. 
Suppose g(x,,..., X,,) satisfies condition (4) or (4’) for every pair of variables. Then 


Vig(al,a?,...,a”")< Vga, a2),...,47%). (6) 
This inequality follows directly by induction using the right inequality of (5) as the 
first step. Choosing g(x,,...,X,) =X, xX, °°: xX, and g(x,...,%X,) = 
—log(x, + --: +x,,), respectively, in (6) results, for sequences of positive terms, in 
inequalities analogous to (1) and Example 2 [9]: 
yi aja? eo a;" < Qn 7 ali) 


and 
TI(a + a? sa +a‘) > [1 (a1, + aii +ose: tam). 


Example 7. Choose g(x,,..., X,,) = f[min(x,,..., x,,)| in (6). If f is an increas- 
ing function then 


Ys{ min aj) < Ls{ min aj, ). 


j 
Taking f(x) = x and f(x) = log(x), respectively, leads to inequalities of Minc [6]: 


mina! < 2 minaj, 
j j 
and 
[| mina/ < [] min aj;,. 
J J 
Similar inequalities hold for the max function. 
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Musings on the Prime Divisors of Arithmetic Sequences 


PATRICK MORTON 
Department of Mathematics, Wellesley College, Wellesley, MA 02181 


Of the early proofs one usually sees in a number theory course, the most 
beautiful is the proof, due to Euclid, that there are infinitely many primes. This 
theorem may be formulated as follows. 


If { a, }°°_, is any sequence of integers, and p is a prime for which p|a,, for some 
n, p is called a prime divisor of the sequence {a,,}°°_,. (See [2].) Euclid’s theorem 
says that the sequence {n}°°_, has an infinite set of prime divisors. What other 
sequences have this property? 

For example, consider the sequences whose terms are defined by the following 
formulas: 

1) a, = f(n), where f € Z[x] is a nonconstant polynomial; 

2) b, =[n*], where brackets denote the greatest integer; 

3) ¢, = [an?? — [an] 2 | + [2]; 

4) d,=2" +1. 
Which of these sequences has an infinite number of prime divisors? 

The answer is, of course, that they all do. The fact that {a,}°°_, does was first 
proved in an elementary way by Schur [8], and is usually stated as follows. (See [2] 
and [3] for other proofs and more on the prime divisors of polynomials.) 


THEOREM 1. Let f(x) © Z[x] be nonconstant. Then the congruence 


f(x) = 0(mod p) 
has a solution x © Z for infinitely many primes p. In other words, infinitely many 
primes divide the terms of the sequence { f(n)}?-_. 


The purpose of this note is to give a surprising proof of this result using a 
well-chosen infinite series, and then to see where the proof leads. It will turn out 
that the proof can be generalized to show that the sequences {b,}%_, and {c, }°_, 
both have infinite sets of prime divisors, but that the same proof cannot decide this 
question for the simpler sequence {d,, }°°_,! The proof will also be a surprise in that 
it hides an algebraic structure. 


1. A proof of Schur’s theorem. Assume theorem 1 is false for some non-constant 
f(x) € Z[x], and let m = deg f. Then for n € Z, f(n) = 0 or (by the fundamental 
theorem of arithmetic) 


f(n) = tppp? +--+ pe, 
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for a fixed set { p,, p>,.-., Pp, } of primes and exponents a, > 0 in Z. Furthermore, 
since f is a polynomial, f can take on a given value at most m times. Choose N 
large enough so that f(n) # 0 for n > N and form the infinite series 


0° 1 
s= Y ——_~. 
1/m 
n=n | f(n)| 
By the above remarks | f(n)|'/7" = p@/"p$2/™ --- pae/™, for some da), 2>,..., Ay; 


and this holds, for a given set of integers a,, a5,..., a,, for at most 2m values of n. 
Hence we have the estimate 


(1) 


2m 


=2m|]| dX p, o7™ (2) 


so S is a convergent series. But this is impossible, since 
1 
f(a) | 0" < an, 
for some constant a > 0, so that S diverges like the harmonic series. This contradic- 


tion proves theorem 1. 


Let us now see what paths this proof can lead us down. Without any changes the 
same proof yields the following stronger theorem. 


THEOREM 2. Let f: N — Z be any arithmetic function which satisfies 

(a) |f(n)|< an™, for sufficiently large n and some a, m > 0; 

(b) the number of solutions of f(n) = k is bounded, for all k © Z. 
Then the sequence { f(n)}°_, has an infinite number of prime divisors. 


As a simple (nonpolynomial) example of a function satisfying the hypotheses of 
theorem 2 take 


f(n) = [6n*], 
where @ and a are positive real numbers and a > 1. Property (a) is clear, and 
property (b) is not hard to show. For example, if f(”) = k, then 


k<O0n%<k+1, 


k \\/4 k+1)\'74 
— <n< 
a) <"<(] 


The number N(k) of integers k, for which the last inequality holds, satisfies 


wie < (S24) — (2) a 


1 /c\G/e-1 
— + 1, 
G) 


a 


Or 
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for some c with k <c<k-+1, by the mean value theorem applied to (x/0)'/*. 
Since a > 1 and c > k, this obviously implies N(k) is bounded, and proves (b). 
For example, each of the sequences 


{[7n7]},-, and (2 |} 


=1 


has an infinite set of prime divisors. 


2. The proof hides algebraic structure. But now consider the conditions in 
theorem 2. If f, g both satisfy (a) and (b), and g: N > N, then the composition 
sequence { f(g(n))}°_, satisfies (a) and (b) also! 

This is quite easy to see. To prove (a): if | f(n)| < an™ and |g(n)| < bn™, then 


|f(g(n))| < a(g(n))” < ab™n™™. 


For the proof of (b): if f(”) = k has at most K solutions for any k and g(m) =/ 
has at most J solutions for any j, then f(g(n)) = k has at most KJ solutions for 
any k. This gives us 


THEOREM 3. Let % be the set of sequences f: N — N which satisfy the hypotheses of 
theorem 2. Then & forms a semi-group under composition, whose identity is the 
sequence e, where e(n) =n. Every sequence in > has an infinite number of prime 
divisors. 


By theorem 1 and these remarks, if f € Z[x] and 6 > 0, a > 1, then 
{g(")} a1 = { f([@n*]) } aa 


has an infinite number of prime divisors. The same is true of the sequence 
{h(n)}%_,, where 


h(n) = [| f(n)|"]. 


Along more bizarre lines, the sequence { t(n)}°°_, defined by 


t(n) = | 6, [ n°] "| 


lies in 2 if 6,, 6, > 0 and a,, a, > 1, as does the sequence 


([a[nn2]‘]} _ Or {s(n)} a1, 
where 
s(n) = [a[a?[2? --- [a*n°]]]]. 
Exercise (for the skeptical reader): Forget everything you have read in this paper 
and show that the sequence {s(n)}°°_, has an infinite number of prime divisors. 


We mention one more obvious property of 2: it is closed under translation. If 
f © 2 and k EN, then the sequence f, defined by 


f,(n) =f(n) +k 


lies in 2 also. More generally, we have 
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THEOREM 4. If a(n) #0 and b(n) are bounded sequences of integers, and if f 
satisfies the conditions of theorem 2, then so does g, where 
g(n) =a(n)f(n) + b(n). 
The proof is an easy exercise. 
COROLLARY. If f © 2 and k © Z, then the congruence 
f(n) = k(mod p) 
has a solution n for infinitely many primes p. 


3. Extensions to imaginary quadratic fields. But can we extend the original 
proof? Considering the step at equation (2) we find we can. 


THEOREM 5. Let f: N > Z be a function satisfying 

(a’) |f(n)| < an™ for sufficiently large n and some a, m > 0; 

(b’) f(n) =k has at most ck‘ solutions for every integer k > 1, where c is a 
constant and 0 <e«<1/m. 

Then infinitely many primes divide the terms of { f(n)}°_y. 


Proof. Form the series S as in (1). Then our new hypothesis (b’) implies that 


a,/m 
Qy5.+-, an.= Pre 
k —1 
=c[[(l-pfv") <0, 
r=1 
if the only primes dividing f(n) for n > 1 are p,,..., p,. However, S is a divergent 


series by (a’), as before. Thus { f(n)}*_, must have an infinite number of prime 
divisors. 


Theorem 5 has the following consequence. 


THEOREM 6. If f and g satisfy the hypotheses of theorem 2 and a, b and c are 
integers for which a # 0 and b* — 4ac <0, then the sequence {h(n)}°_, given by 


h(n) = af*(n) + bf(n)g(n) + cg?(n) 


has an infinite set of prime divisors. 


This fact depends on the arithmetic in the imaginary quadratic field Q(V— d), 
where —d = b? — 4ac. We shall derive it from the following more general result, 
whose statement and proof use several notions from elementary algebraic number 
theory, namely, algebraic integers, ideals, and the norm function 


Norm(x + y¥— d) = x* + dy’? 


from the field Q(V/- d) to Q. Recall that an algebraic integer is an algebraic 
number which satisfies a monic polynomial equation with integral coefficients, and 
that the set of algebraic integers in the field Q(V— d) forms a ring R,. In the 
following proof we make essential use of the fact that R, has a finite number of 
units, for any d > 0. (See [7].) 


THEOREM 7. Let f: N > R, be any function satisfying: 
(a,) Norm f(n) < an™ for large n and a,m > 0; 
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(b,) f(n) =A has a bounded number of solutions, for all X € Rj. 
Then the sequence { f(n)}°°_, has an infinite number of prime ideal divisors in the ring 
Rj- 


Proof. We define g(n) = Normf(n) and assume that the sequence { f(n)}?7-_, 
has only a finite number of prime ideal divisors p,, 1 < i < k, in Rj. It then follows 
from the theory of ideals in quadratic fields [3] that for every n, the principal ideal 
(f(n)) is a product of powers of the ideals p;: 


(f(n)) = pp ++ pe, (3) 


g(n) = Normf(n) = pf'™ --- pfk*, for alln > 1, (4) 


and so 


for some integers a, and integral primes p, (not necessarily distinct) defined by 
Norm p, = p/', f, = 1 or 2. 
We obtain a contradiction by showing that g satisfies the hypotheses of theorem 
5. Condition (a’) is equivalent to (a,). To prove (b’) we show that for every « > 0 
the number of solutions of 
g(n) =A (5) 
is < cA‘, for some constant c. Factoring A into a product of prime powers and 
using (4) shows that equation (5) is equivalent to 


ppm eee pf = A = gq? eee gq’, (6) 


X fiaa,=b, for l<j<r, (7) 


i 
P44; 


Or 


since several of the primes p; can equal qg,. Without loss of generality r < k; 
otherwise (5) has no solutions. By elementary results in the theory of quadratic 
fields, either only one prime p, equals q,, or two primes p; and p/ equal q,, in 
which case f, = f,’ = 1. Hence (7) becomes 


— — ,/ — 


In any case (7) has at most b; + 1 < 2b, positive integral solutions in the variables 
a;, for each j. It follows that (6) has at most 


b,+--- +b)’ 
2’b, +: b. < 2"| ——— 
r 
solutions, by the arithmetic-geometric mean inequality. Finally, 
b, +--+ +b, <b, log,g, + -:: + blog, q, = log, A, 


and so 


2'b, «++ b <2 


log, A\" 
. < i 22 | < 2*(log, A)" < cAs, 
r 


for any fixed « > 0 and a positive constant c depending on e. This shows that there 
are at most cA‘ choices for the a, in (3). Since the ring R, has a finite number of 
units (at most 6), f(”) can take on at most 6cA‘ values in R,, for any given A. By 
assumption, therefore, there are at most c,A‘ values of n for which (5) holds. This 
proves (b’), as long as e€ is chosen less than 1/m. Theorem 5 now implies that 


328 PATRICK MORTON [April 


{ g(1)}°°_, has an infinite number of prime divisors, contradicting (4). This proves 
the theorem. 

To deduce theorem 6 from theorem 7, we write a - h(n) as the norm of a suitable 
element in Q(y¥ — d), where —d = b* — 4ac: 


ah(n) = (af(n) — ag(n)6)(af(n) — ag(n)6@’) 
= Norm(af(n) — ag(n)@), 
where @ and @’ are the roots of ax* + bx +c =0. Since a@ satisfies a monic 
equation, a@ © R, and theorem 7 applies to the function af(n) — ag(n)@. The 


hypotheses of theorem 7 follow easily from (a) and (b) of theorem 2 since 1 and @ 
are linearly independent over Q. 


As an example take f(n) =[an7], g(n) = 2 |, and a=1, b= —-1, c=1. 
This gives the sequence {c, }°_, considered at the beginning of the paper. 


4. Conclusion and open questions. Our proof does not apply to exponential 
sequences like 


{2"+1}7., or {[10’r]},_,, (8) 


since the corresponding series in (1) is convergent for any m. 

The first sequence in (8) can be seen to have an infinite set of prime divisors 
using the theory of Lucas sequences [1], or more advanced results from algebraic 
number theory (see [4], [6]). It can also be deduced from a striking theorem of H. 
Kobayashi [5]. This theorem states that if the sequence { f(m)}°_, has a finite 
number of prime divisors, then the translated sequence { f(n) + a}°_, has an 
infinite set of prime divisors, for any nonzero integer a! Taking f(n) = 2” and 
a = 1 gives the first sequence in (8). Can Kobayashi’s theorem be proved in an 
elementary way? I do not know. 

It is not known whether the second sequence in (8) has an infinite set of prime 
divisors; this sequence consists, of course, of integers which are finite strings of 
digits from the decimal expansion of =. 

In this paper we have only proved positive results, and so we ask a final question 
(from Paul Bateman) in the opposite direction: is there an elementary proof of the 
fact that infinitely many primes do not divide { f(n)}°_,, for any irreducible 
polynomial f © Z[x] of degree > 1? 
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1. Introduction. In his famous Lectures on the Icosahedron, Felix Klein [6] 
presented a simple and beautiful derivation of the finite subgroups of the group of 
rotations of the sphere. His proof, which has been repeated often, is a blend of 
algebra and geometry ideally suited for a first course in the theory of groups. 
Indeed, H. S. M. Coxeter received a spontaneous ovation from a largely nonmathe- 
matical audience when he presented this proof during his lecture [3] at the Shaping 
Space Conference at Smith College in 1984! The purpose of this note is to present a 
very simple way of extending the list of rotation groups to include all finite 
subgroups of symmetries of the sphere, up to conjugation in its full group. In 
contrast to the standard argument (see section 3), it is a direct continuation of 
Klein’s method, it is intuitively clear, and the list can easily be completed by 
undergraduate students. 


2. The sphere and its symmetries. The sphere in question is the ordinary sphere 
in three-dimensional Euclidean space E?, also known as the 2-sphere because it is a 
two-dimensional manifold. The set of all symmetries of the sphere constitutes its 
symmetry group, the orthogonal group O(3); the subgroup of orientation-preserving 
symmetries is the special orthogonal group SO(3). The finite subgroups of O(3) are 
of interest because they are the symmetry groups of polyhedra and all other “finite 
figures.” 

First let us review the finite subgroups of O(2), the symmetry group of the circle. 
There are two families of groups: cyclic groups n generated by n-fold rotation about 
the circle’s center, and dihedral groups nm generated by reflection in two lines 
forming an angle of 180° /n. This result appears to be very old; Hermann Wey] [8] 
notes that Leonardo da Vinci, 


engaged in systematically determining the possible symmetries of a central 
building and how to attach chapels and niches without destroying the symme- 
tries of the nucleus. 


(An examination of the Codex Ashburnham suggests, however, that Weyl may have 
read too much into Leonardo’s pretty drawings.) 


The finite subgroups of O(3) may be said to have been implicitly studied by the 
ancient Greeks, since they include the symmetry groups of the regular polyhedra, 
but the first systematic attempts to enumerate all of them seem to have been made 
by nineteenth-century crystallographers in their efforts to classify crystal forms. 
They were, however, interested only in those groups that describe ideal crystal 
forms, polyhedra whose rotational symmetries have orders 1, 2, 3, 4, or 6. These 
crystallographers used more or less ad hoc symmetry arguments, and sometimes 
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overlooked the composite isometry, rotary reflection. Klein brought elegance, rigor, 
and abstraction to the subject, and made it possible to produce a complete list of the 
groups. 

The symmetries of the sphere are the isometries of E*? that map the sphere onto 
itself. Isometries are distance-preserving maps; a fixed-point argument shows that in 
E°, there are six types: rotation about an axis, reflection in a plane, translation along 
a vector, rotary reflection (rotation followed by reflection in a plane orthogonal to 
the rotation axis, or vice versa), screw rotation (rotation followed by translation in 
the direction of the rotation axis, or vice versa), and glide reflection (reflection 
followed by translation in the plane of reflection, or vice versa). Since the symme- 
tries of the sphere fix its center, neither simple nor composite translation is among 
them. Thus the symmetries of the sphere are rotation, reflection, and rotary 
reflection. Note that when the angle of rotation is 180°, rotary reflection is 
equivalent to central inversion, and when it is 0°, rotary reflection reduces to 
reflection. 


3. Klein’s method. Since rotations are the only orientation-preserving symmetries 
of the sphere, a finite subgroup H of SO(3) consists entirely of rotations. The 
rotation axes intersect in a point that is fixed by all the elements of H. 

We will briefly review Klein’s enumeration; for more details consult [3], [4], or 
[8]. Let H be a finite subgroup of SO(3) of order N. To each element of H (other 
than the identity) there corresponds an axis that intersects the sphere in two points. 
These points are called the poles of the rotation. The stabilizer subgroup of any pole 
is cyclic, being generated by rotation about a single axis. Klein counts, in two 
different ways, the pairs (h, p), where h € H, p is a point on the sphere, and 
hp = p (e., p is a pole of h). 

First, since there are two poles for each of the N — 1 nontrivial rotations, the 
number of pairs is 2(N — 1). The second way of counting is a little more involved. 
The poles fall into & orbits under the action of H; the ith orbit has n, poles, whose 
stabilizer subgroups have order s,. (For each orbit, the stabilizers of the poles are 
conjugate, and hence all have the same order.) Thus for the ith orbit there are 
n,(s, — 1) pairs (h, p). Now sum over all orbits and equate the two calculations. 
Using the well-known and easily established fact that, since H is finite, s,n,; = N, we 
obtain the equation 


Yil/s,=k-—24+2/N. 


Because each s, is at least 2, k is less than 4; since N is at least 2, k is greater than 
1. Thus H has either 2 or 3 orbits of poles. The few solutions are easily computed; 
they are shown in Table 1. In that table, the first three columns list the numbers of 
poles in each of the orbits of H, while the symbol in the fourth column lists the 
symbols for the cyclic stabilizer subgroups of the rotations corresponding to these 
orbits. Since H is characterized by the orders of these rotations, the symbol can be 
used to denote the group itself. In the fifth column we give the order of H, and in 
the sixth the name of a polyhedron whose rotational symmetry group is H. 

For each group in Table 1, the data force the relative positions of the poles on the 
sphere. Consider, for example, the third line of the table. There are three orbits of 
poles. The first two consist of four poles each, while the third orbit has six poles. 
Column four shows that the orders of the corresponding rotation axes are 3, 3, and 
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polyhedron 


n-gonal pyramid 
n-gonal prism or antiprism 


regular tetrahedron 
cube 
regular icosahedron 


TABLE 1. The finite subgroups of SO(3). Each subgroup H is characterized by the 
orbits of the poles of its rotation axes. For each subgroup there are two or three 
orbits; the sizes of the orbits are given in the first three columns. In the fourth 
column H is denoted by a symbol which juxtaposes the symbols for the stabilizer 
subgroups of the poles of each orbit. In the fifth column we give the size of H, and 
in the sixth an example of a polyhedron whose rotation group is H. 


2, respectively. Now, a rotation of order 3 fixing a pole of the first orbit must map 
the second orbit onto itself; this is possible only if three of its four poles lie 
equispaced on a circle orthogonal to the rotation axis, and the fourth pole is the 
other pole of that rotation. It follows that the poles of the first and second orbits 
must be located at the vertices and face centers, respectively, of a regular spherical 
tetrahedron, and the poles of the third orbit must be located at the midpoints of the 
tetrahedron’s edges (FIGURE 1). Arguing in this way, we find that the finite rotation 
groups of the sphere are the infinite families of groups of rotations of the n-gonal 
pyramid and the n-gonal prism (and antiprism), and the groups of rotations of the 
regular tetrahedron, the cube, and the regular icosahedron. 


Fic. 1. The poles of a rotation group which belong to 3 orbits of sizes 4 (white triangles), 
4 (dark triangles), and 6 (X’s) with corresponding stabilizer subgroups of orders 3, 3, and 2 must be 
located at the face centers, vertices, and midedges of a regular spherical tetrahedron. Dotted lines 
indicate that the figure lies on the hemisphere below the plane of the paper. 


This completes Klein’s enumeration of the finite subgroups of SO(3). 


4. The finite subgroups of O(3). We now turn to the main problem of this paper, 
finding all finite subgroups G of O(3). 

If G does not consist of rotations alone, then its rotation subgroup G MN SO(3), 
which we will call H, must be of index 2 in G, because the composition of two 
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orientation-reversing symmetries (of the sphere) is a rotation. Thus G = A U gH, 
where the coset representative q is not a rotation. 

The argument presented by Polya and Meyer in [7] seems to have become 
standard (see, e.g., [1, 2, 8] and, in more abstract form, [5]). The group action 
argument is abandoned. Instead, two cases are distinguished, according as we may 
or may not choose q to be inversion through the center of the sphere. In the first 
case, G is the direct product of H and a cyclic group of order 2. In the second case, 
gq can be taken to be a rotary inversion (see section 5) whose rotational component 
is not an element of H, but of a rotation group which contains H as a subgroup of 
index 2. 

The method works: every finite subgroup of O(3) is either a rotation group or a 
member of one of these two classes. But, as Johann Kepler remarked apropos 
Plato’s association of the regular solids with the elements, “it has no force of 
necessity; indeed it admits of other interpretations.” There is a very real sense in 
which distinguishing these two classes is unsatisfactory. Rotary inversion is rather 
complex geometrically and it is difficult to visualize the construction of the groups 
when they are derived in this way. Moreover, one would like the symmetry groups 
of the regular polyhedra to be classed together. But the full groups of symmetries of 
the cube and the icosahedron contain central inversion, so they belong to the first 
class, while the full group of symmetries of the tetrahedron does not, so it belongs to 
the second class. (When central inversion is added to the group of rotations of the 
tetrahedron, we obtain the symmetry group of the “pyritohedron,” a nonregular 
pentagonal dodecahedron which is an ideal form of the pyrite crystal.) 

There is, however, another interpretation. We can derive the finite subgroups of 
O(3) in a way that complements Klein’s derivation of their rotation subgroups. 
Using this approach, which can be described as a group-theoretic justification of the 
intuitive method of derivation that crystallographers and other scientists have been 
using for a long time, students can easily visualize the construction of the groups G 
and deduce the complete list. 

The main idea is the simple observation that, since H is a subgroup of index 2 in 
G, the isometry g which we add to it to obtain G must permute the poles of A. 
Either q preserves the ith H-orbit, or it merges it with another orbit of the same 
size. The latter option is of course available only if a second orbit of the same size 
exists. 

If g preserves an orbit of H, then the order of the stabilizer subgroup of the orbit 
is doubled in G, since the size of an orbit times the order of its stabilizer subgroup 
equals the order of the group, and |G| = 2|H|. Since the stabilizer subgroup is now 
dihedral, g can be taken to be a reflection in a plane containing a pole of the ith 
orbit. When k = 3, there is always at least one orbit that must be preserved by g, 
because it is the only one of its size. Thus g can always be chosen to be a reflection 
in a Suitable plane containing a pole of such an orbit. Note that this choice of q 
completely determines the extension of the cubic and icosahedral rotation groups: 
the three orbits of poles of these groups are of different sizes, so g must fix each of 
them (see FIGURE 2). For the groups n22 and 332, there are two choices for g, 
according as gq fixes or permutes the two orbits of the same size (see FIGURE 3). In 
the first case, we obtain the symmetry groups of the prisms and of the regular 
tetrahedron; in the second, the symmetry groups of the antiprisms and of the 
pyritohedron. 
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(a) (b) 


(c) 


Fic. 2. The group of rotations of the cube (a) can be extended to (c) the full group of cubic symmetries 
by adding reflection in a plane (b) which maps each of the three orbits of poles onto themselves. 


(a) (b) 


Fic. 3. Here H is the tetrahedral rotation group 332. (a) gq can be chosen to be reflection in a plane 
which maps the orbits of poles of order 3 onto one another. (b) g can also be chosen so that all orbits are 
mapped to themselves. 


When k = 2, H is cyclic. Again, g must fix or interchange the two poles. When g 
fixes them, the group G is dihedral. When g interchanges them, it is necessary to 
distinguish the cases when gq is a reflection or a rotary reflection q. 

The finite subgroups of O(3) (which are not in Table 1) are listed in Table 2. 


5. A word about notation. There are (at least) two sources of confusion that one 
encounters in reading in the literature: the distinction between rotary reflection and 
rotary inversion, and the seemingly endless proliferation of notation for the groups 
we have been discussing. 
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polyhedron 


nm nm n-gonal pyramid 

n/m — 

In rhomic disphenoid, when n = 2 
nm 2m 2m n-gonal prism 


nm 2 2 n-gonal antiprism 
3m 3m 2m regular tetrahedron 
33 2m pyritohedron 

4m 3m 2m cube 

5m 3m 2m regular icosahedron 


TABLE 2. The finite subgroups of O(3). We extend the list of 
groups H in Table 1 to include reflections and rotary reflections. 
H is listed in column 1. The symbol in the second column 
indicates the G-stabilizer subgroups of the poles of H; the symbol 
completely characterizes G. (The notation is explained in the text.) 
G is the symmetry group of the polyhedron in the third column. 


Recall that n-fold rotary reflection, usually denoted by #, is a composite motion, 
rotation through 360°/n followed (or preceded) by reflection in a plane orthogonal 
to the rotation axis. This motion generates a cyclic group n of order n or 2n, 
according as n is even or odd. When n is even, the group has orbits consisting of n 
points arranged alternately above and below the plane of reflection. However, when 
n is odd, it has orbits of 2” points arranged in mirror-image pairs. The latter group 
can also be generated by a rotation of order n and reflection in a plane orthogonal 
to the rotation axis. This is the type of group that crystallographers denote by n/m, 
Thus we will use the symbol 7 only for even values of n. 

Rotary inversion, on the other hand, is the composition of n-fold rotation about 
an axis with central inversion through a point on that axis; it is usually denoted by 
n. Rotary inversion is even more confusing than rotary reflection: its properties vary 
according as n = 4k, n= 4k + 2, or n = 4k + 1. Thus 2 =f when n is divisible 
by 4, n = 2n when n is odd, and n = (n/2)/m when n/2 is odd. Despite this, 
rotary inversion seems to be more commonly used. 

There is no commonly accepted notation for the finite subgroups of O(3), and 
there are no commonly accepted names for most of them. The notation used in this 
paper was inspired by crystallographic usage, which, when k = 3, denotes the 
rotation groups by the orders of rotation corresponding to the three orbits of poles. 
However, I have departed from the crystallographic symbols in order to keep the 
notation consistent with the derivation of the groups (see Table 3). As explained 
above, in Table 1 my symbol for each group is simply the juxtaposition of the 
symbols for the cyclic stabilizer subgroups of poles of the corresponding orbits. This 
symbolism is continued in Table 2: the symbol for the rotation subgroup is simply 
augmented, when appropriate, to denote the G-stabilizer subgroup of the pole. For 
example, the two extensions of the tetrahedral rotation group 332 are denoted 
3m3m2m and 332m; the first indicates that all three stabilizers are now dihedral, 
while the second indicates that only the third stabilizer is. However, when k = 2 
and q interchanges the two poles, both stabilizers remain cyclic. In this case g must 
be a rotary reflection and we have to distinguish the cases when the angle of 
rotation is 0 or greater than 0. In the first case G is a group of type n/m, and in the 
second of type 2n. 
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Coxeter Remark 


n 
n22 
23 
432 
nmm H3|C; 
n/m C C;'* n even 
Qn C2"\Cc3 n odd 
(n/2) C;'* n even, n/2 odd 
A C2"1Cr n even, n/2 even 
nm 2m 2m n/mmm H,'* n even 
Inm2 H}?"| H? nodd,n>1 


nm 2 2 2n2m H3"| H? n even 
nm nodd,n>1 
3m 3m 2m 43m 
332m m3 
4m 3m 2m m3m2/m 
5m 3m 2m — 


TABLE 3. Comparison of notations for the finite subgroups of O(3). In the first column we list the 
notation used in this paper. In the second, we give the symbol used by the International Union of 
Crystallography (IUCr); these symbols are defined only for the “crystallographic” groups, whose 
rotational symmetries have orders 1, 2, 3, 4, or 6. The third column (C & M) lists the symbols used by 
Coxeter and Moser [4]. The symbols in columns four and five are those used in [2] and [5] by Coxeter 
and by Grove and Benson (G & B), respectively. 


Some of the many notations that can be found in the literature are shown in 
Table 3. The rationales, advantages, and disadvantages of these different notations 
could serve as the basis for an interesting and instructive class discussion! 
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A Note on Defining Conditional Probability 


ARTHUR RAMER 
School of Electrical Engineering and Computer Science, University of Oklahoma, Norman, OK 73019 


The notion of conditional probability is usually introduced on the basis of 
frequencies of events belonging to a specified subset (of the set all admissible 
events) events. 

Given P(x;) =p; i= 1,...,n, not identically 0 on the set of events {x;}, 
i= 1,...,m, m <n, let us suppose that we wish to define an associated conditional 
distribution Q(x,;) = q;, i= 1,..., m, on that subset. It is then defined as 


P; 


» Pr 
k=1 


qi — 


9 


Information theory offers an alternative method of defining a conditional distri- 
bution, through the notion of information distance which is interpreted as the 
difference of the information content between the distributions. This approach 
suggests that the conditional distribution should be defined as the one for which the 
distance from the original distribution is minimal. 

Such distance, known as /-divergence [2, 3] is defined for discrete distributions as 


q; 
(Q, P) = qi log7 


i 


given two distributions p = (p,), ¢ = (q;), both corresponding to the same set of 
events {x,}. This distance is defined only if g; = 0 whenever p, = 0; however, it is 
sufficient for our discussion. 

To apply the distance formula we must first expand the sought-for distribution 
Ji>--+>4m to the same domain as the original one p,,..., p,. An obvious natural 
extension is q),..-,4,,,0,...,0. Now, under the convention q; log q;/p; = 0 for 
q,; = 0, the problem becomes one of minimizing | 


m qj 
H(q,---54m) = Dg) log 


i=l H 


subject to 


2(G)s---5 4m) = Ga 1 
i=l 


and 


q, = 9, i=1,...,m. 


Application of Lagrange multipliers gives 


6H 6z q; 
—+.A—=0 or log—+1+A=0, 
4, dq 


i i 
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thus suggesting that 


Pi 
qi ~ —m 
do Px 
k=1 
is the required answer. To verify it we note that (putting p = >) p,) 
k=1 
yr q; m m P; _ 
Lu glog— = LY gjlogg,- Lg, log — log p. 
i=] i i=1 i=] 


Because the last term is constant, Shannon’s inequality [1] shows that the difference 
of the first two terms is nonnegative and equals 0 only when 
q; = Pi i=1,...,m. 
q; 

Occasionally, a dual expression J( P,Q) = p, log p;/q;, or a Symmetric sum 
I(Q, P) + I(P,Q), is used as the measure of distance [2]. Again, Shannon’s 
inequality shows immediately that its minimum is reached for the same values of gq,. 

Therefore, the defining formula for a conditional probability distribution can be 
motivated by the observation that it is the closest one to the original distribution 
and concentrated exclusively on the specified subset of the space of events. 
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Continuous Functions and Connected Graphs 


C. E. BURGESS 
Department of Mathematics, University of Utah, Salt Lake City, Utah 84112 


There seems to be a tendency for students in elementary calculus, and also in 
some more advanced courses, to picture continuous real functions, with the real line 
R as their domain, to be those functions over R that have connected graphs in the 
plane R’. A standard counterexample, of course, is the function that is zero for 
x = 0 and sin1/x for x # 0. I contend that this tendency by students to equate the 
concepts of “continuity” and “connected graphs” is close to a correct equivalence 
and that the incorrect impression is very likely based on their lack of a clear 
understanding of such concepts as “limit points,” “closed sets,” and “connected 
sets.” It is correct to conclude that continuous real functions over R are those 
functions over R whose graphs, in the plane R’, are both closed and connected. 
While this follows from the characterization of a topological line as a locally 
compact metric space that is separated into two connected sets by each of its points 
[4, 5], it is my main purpose in this note to present an elementary proof. Also, I will 
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include elementary proofs that for a function f: R > R with a closed graph in R’, 
“local boundedness” and the “intermediate value property” furnish two other 
equivalences to the continuity of f. I have not noticed these equivalences in 
undergraduate textbooks. The proofs presented here are also valid where the 
domain of f is a closed interval or a closed ray in R. 


THEOREM. Jf f: R —R _ has a closed graph in R’, then the following four 
conditions are equivalent: 

(a) f is continuous. 

(b) f is locally bounded. 

(c) f has the intermediate value property. 

(d) The graph of f in R? is connected. 


Our procedure will be to show, with three lemmas, that each of (b) and (c) 
implies (a) and that (d) implies (c). There are well-known easy proofs that (a) 
implies (b), (c), and (d), so we do not include them here. 


Lemma 1. Jf f: R — R is locally bounded and the graph of f is closed in R*, then f 
is continuous. 


Proof. Suppose that f is not continuous at b € R. Then there is a sequence { 5, } 
converging to b such that the sequence { f(b,)} converges to a number g # f(b). 
Thus the sequence of points {(b,, f(b,)} converges to the point (b,q) in R’, 
contradicting the hypothesis that the graph of f is closed. 


LEMMA 2. If f: R — R has the intermediate value property and the graph of f is 
closed in R*, then f is continuous. 


Proof. Suppose that f is not continuous at b € R. Then for some e > 0, there is 
a sequence {b,} converging to b such that, for each n, |f(b,) — f(b)| > «. For 
convenience, we assume that each f(b,) > f(b) + «. The intermediate value prop- 
erty implies that, for each n, there is a number c, between b, and b such that 
f(c,) = f(6) + e. Thus the sequence of points {(c,, f(c,.))} converges to the point 
(b, f(b) + €), contradicting the hypothesis that the graph of f is closed. 


LEMMA 3. If the function f: R > R has a connected graph in R’, then f has the 
intermediate value property. 


Proof. It is easy to see that f would have the intermediate value property if, for 
a < b, the set {(x, f(x))|a < x < b} is connected. So suppose that this set is the 
union of two mutually separated sets H and K; that is, neither of the sets H and K 
intersects the closure of the other. There are two cases to consider, and each of them 
contradicts the hypothesis that the graph of f is connected. 

Case 1. The points (a, f(a)) and (b, f(b)) are in H and K, respectively. This 
would mean that the graph of f is the union of the two mutually separated sets 
H U {(x, f(x))|x < a} and K U {(x, f(x))|x = b}. 

Case 2. Both of the points (a, f(a)) and (5, f(b)) are in H. This would mean 
that the graph of f is the union of the two mutually separated sets K and 
HU {(x, f(x))|x € [a, bp. 


Remarks. It is an easy exercise to show that the continuity of f, in the above 
theorem, implies that the graph of f is closed [3]. Thus there is a redundancy in the 
part of the theorem where condition (a) implies (b), (c), and (d). However, without 
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the requirement that the graph of f be closed, the only other valid implication 
among conditions (a), (b), (c) and (d) is that (d) implies (c), as shown in Lemma 3. 
We can use the two functions 


1 1 
fi(x) = —sin-, x #0; f,(0) = 0, 
x x 
and 
f,(x) = 1, x > 0; f,(x) = 0, x < 0, 


to see that there is no valid implication between (b) and (c) and none between (b) 
and (d) unless “closed” is included in the hypothesis. The most interesting question 
in this respect, raised by the referee, is whether (c) implies (d). We furnish a negative 
answer to this question with a slight change in a function described by F. B. Jones 
[2]. In Theorems 2 and 5 of his paper, Jones used a Hamel basis [1] for R to describe 
a function g: R — R such that the graph of g intersects every horizontal open 
interval in R*. So this function has the intermediate value property. (Jones described 
some other interesting properties of this function that are not needed for our 
purposes here.) The graph of g, as described by Jones, is connected. However, we 
can change g slightly as follows to obtain a function f of the desired type that is not 
connected. Let C denote the unit circle with center at the origin. For each x such 
that (x, g(x)) © GC, let f(x) = g(x) + 3; otherwise, let f(x) = g(x). Clearly we 
still have the property that the graph of f intersects every horizontal open interval, 
so f satisfies the intermediate value property. To see that the graph of f is not 
connected, notice that it does not intersect C yet it intersects both components of 
the complement of C. 
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The /, Matrix Norm and the Spectral Radius 


ROBERT A. LESLIE 
Department of Mathematics, Agnes Scott College, Decatur, GA 30030 


A useful theorem in the theoretical development of iterative techniques to solve 
systems of linear equations in an undergraduate numerical analysis course is [1, 
p. 419] and [2, p. 21]. 


THEOREM. /f A is a real, n X n matrix then [p(A‘A)]'” = ||Al|,, where p(T) is 
the spectral radius of T and || - ||, denotes the |, matrix norm. 


In [1], the proof of this theorem is omitted as too advanced for undergraduates, 
whereas the proof given in [2], and referred to in [1], involves several other results 
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which most undergraduates will not have seen by the time they take a numerical 
analysis course. 

One idea that almost all these students will be familiar with is that of Lagrange 
multipliers. Here is an approach to this theorem involving Lagrange multipliers and 
some elementary linear algebra. In addition, several other results follow quickly as 
corollaries. All norms used are /, norms. 

Consider the function F: R” — R defined by F(x) = x'Ax, where A is a real, 
n Xn matrix and x is a vector in R”. We wish to find the critical values of F 
subject to the constraint that ||x|| = 1, ie, x'- x = 1. Since F is very smooth, then 
on the unit sphere in R” there clearly is a solution to this maximum/minimum 
problem. We define G: R” > R by G(x) = x‘: x —1 and apply the method of 
Lagrange multipliers to this constrained maximum /minimum problem for F subject 
to the condition that G=0. Since VG = 2x #0 for ||x||= 1, the method is 
applicable, and we set VF = c-: VG, where c is the Lagrange multiplier. 


LEMMA 1. VF(x) = (A‘ + A)x. 
Proof. Let A = (a,;). Then 
OF ) ax! 


= x. Al] + (0,...,1,...,0) - (Ax), where each 1 is in position i 
0 
= De ax; + Le a,x; = y (a, + a;;)x,; = [( A? +A) x]_,. oO 
j=l j=l j=l 


Hence, we have the equation that (A‘ + A)x = 2cx. Let } = 2c and we have 
that A is an eigenvalue of A’ + A and x is the corresponding eigenvector. Further- 
more, for any such critical vector x, we have 


A'x + Ax = Xx 
x'A'x + x'Ax = x'Ax =AxX =A), 
since x'x = 1. Therefore, 
(xtdx)' + x'Ax =X. 
But since x‘Ax is a real number, (x‘'A4x)' = x‘Ax, so that 
x'Ax = 5X. 
Summarizing these results, we have 


THEOREM 1. If A is a real, n X n matrix, then 
1. max (x'Ax) = 5p(A‘ + A) 


I|x]=1 
2. min (x‘Ax) = } minimum eigenvalue of A‘ + A. 
|x|]=1 


COROLLARY 1. ||Aj| = [e(4‘A)]'”. 
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Proof. 


|| Al]? = max (|| Ax||?) = max (Ax)‘(Ax) = max (x‘A'Ax) 
Ix=1 |x|]= 1 |xI=1 


= 1p| A'4 + (4'4)'] = 40[2- 44] = [444]. 0 
Other results follow quickly. The first may not be known while the second is a 


classical result in linear algebra. Implicit in the arguments below is the elementary 
fact that all eigenvalues of a real, symmetric m X n matrix are real. 


COROLLARY 2. A is positive definite if and only if A' + A has all positive eigenval- 
ues. 


Proof. From the theorem, min, ,,(x‘Ax) = + minimum eigenvalue of A‘ + A. O 


COROLLARY 3. If A is a real, symmetric, positive definite matrix then A has all 
positive eigenvalues. 


Proof. Here A‘ = A so, 5 minimum eigenvalue of A’ + A = minimum eigenvalue 
of A. O 
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Does 
20 dy —t 
y «TI log? | 
k=1 j=l 


converge? 
E3382. Proposed by Geoffrey R. Robinson, UMIST, Manchester, England. 


Suppose that R is a commutative ring with identity and that A is an n by n 
matrix with entries in R. If det A is a zero divisor in R, show that there is an n by n 
matrix B with entries in R such that B is not the zero matrix O but AB = BA = O. 
More specifically, show that B may be chosen as a polynomial in A, ie., as a finite 
linear combination of I, A, A*,... with coefficients from R. 


E3383. Proposed by H. G. Diamond, University of Illinois at Urbana-Champaign. 


Suppose a is a given number in (0,1]. Show that there exists a unique sequence of 
positive numbers dy < a, < a, < --: such that a,/n > 1 as n— oo and 


Gna, 1 + cost 1 
[——a = (n =0,1,2,...). 
a, t n+aQ 


SOLUTIONS OF ELEMENTARY PROBLEMS 


An Implicitly Defined Decreasing Function 


E 3287 [1988, 763]. Proposed by David M. Bloom, Brooklyn College, Brooklyn, 
NY. 


It is easy to see that for each x in (1, e) there is a unique y = g(x) in (e, oo) such 
that x” = y*, or equivalently (log y)/y = (log x)/x. For example, g(2) = 4. Prove 
that if h(x) = x®, then h is a decreasing function on (1, e). 


Solution I by O. P. Lossers, University of Technology, Eindhoven, The Nether- 
lands. Showing that h(x) is decreasing is equivalent to showing that loglogh is a 
decreasing function of log x. Writing u = log x and v = log g, the problem amounts 
to showing that v + log uw is a decreasing function of u for u € (0,1). 

Upon taking logs twice, the defining relationship x2) = g(x)* becomes v + 
log u = u + log v, which yields dv/du = (uv — v)/(uv — u) under implicit differ- 
entiation. We seek to show that 

d | uv — | 

__ + — 

au” 084) u(v — 1) 
is negative for u & (0,1). Since g(x) > e, we have v = log g > 1, so the denomina- 
tor is positive and it suffices to show wv < 1 for u € (0,1). 
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For this, we use the parametric representation (u, v) = (ft, te')/(e' — 1),0<t< 
oo, which can be verified by substitution. This yields 


t ? f 
ee ne 1. 
uv | ot/2 — =| | 2sinh( 1/2) | * 


Solution II by Zachary Franco, University of California, Berkeley, CA. Let 
y = g(x). We parametrize the curve (x, y) by x =s’@7) and y=s'/07) for 
s > land x = y =e for s = 1. Note that s > 1 implies y > x, so that x” = y* and 
s > 1 imply 0 < x <e. Since y = xs, we have 


dy ds 
ax =S+ x. 
From x°~! = s, we obtain 
ds s(s — 1) 
dx x(1 — slog x)’ 
SO 


dy s—slogx 
dx 1 — slogx 


Now consider h(x); we seek to show h’(x) < 0. Since h(x) > 0, it suffices to 
show 


h'(x) dlogh 
h(x) — dx 
We have log h = y log x, so it suffices to show that 


is negative. Since x° = y > e, the denominator of the last expression is negative, and 
it suffices to show s(log x)? = s(log s)*/(s — 1)? < 1 for s > 1. This is equivalent 
to logs < s'/* — s~'!/*, which follows from the integral /*(2t)~'(¢'/4 — 171/74)? at 
> 0. 


Editorial comment. Michael Vowe remarked that the parametrization in the last 
solution was due to D. Bernoulli. Several readers mentioned the related problem #1 
on the 1961 Putnam exam and its solution in Gleason, Greenwood, and Kelly, The 
William Lowell Putnam Mathematical Competition (1980). 


Solved also by the proposer and 26 other readers. 


Greatest Common Divisors, Revisited 


E 3288 [1988, 763]. Proposed by John J. Cade, Pikeville College, Pikeville, KY. 
It is well known that if a, m, and n are positive integers with a > 1, then 


(a"™—1,a"-1) =a” —- 1, 
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where (m, n) is the greatest common divisor of m and n. Determine (a™ — 1, a” + 1) 
and (a" + 1, a" + 1). 


Solution by Kee-Wai Lau, Hong Kong. Let 2' and 2/ be the highest powers of 2 
dividing m and n, respectively. Then the desired values d and d’ are 


am™na+ yo i=j 
d=(a"+1,a"+1)= 1, i#j and aeven, 
2, i#j anda odd 
a™MO+4+), i>; 
d’=(a"-1l,a"+1)= 1, i<j and aeven. 
2, i<j andaodd 


Since the techniques for obtaining d and d’ are similar, we prove only the result for 
d (see editorial comment below). Let b = a&””, r= m/(m, n), and s = n/(m, n), 
so that d= (b’ + 1, b° + 1) and (r,s) = 1. Hence there exist positive integers e 
and f such that jer — fs| = 1. 

If i = j, then r and s are odd. By symmetry we may assume that er — fs = 1, e 
is odd, and f is even. From b’ = b* = —1 mod d we have (b’)° = ud — 1 and 
(b’)/ = vd +1, where u and v are integers. Thus ud — 1 = (vd + 1)b, which 
implies d\(b + 1). Since r and s are odd, (b + 1)\(b” + 1) and (6 + 1)((b* + 1), 
which implies that d = b + 1. 

If i #7, then r and s have different parity. By symmetry we may assume that r 
is even and s is odd. Then (b”)* = yd — 1 and (b*)’ = zd + 1, where y and z are 
integers. Hence yd — 1 = zd + 1, which implies d|2. Since 2|d only if a is odd, the 
result follows. 


Editorial comment. Allan Pedersen and O. P. Lossers noted that 
(a"™—1,a"+1)=(a"+a", a" +1) =(al™""41,a"+4+1), 


so that the formula for d’ can be obtained from the formula for d. 

As David Callan pointed out, the computation of d was problem E 2378 [1972, 
906], proposed by D. E. Penney. The solution of E 2378 by Alan Stein [1973, 1059] 
can easily be extended, as Callan did, to compute d’ also. A related problem was 
mentioned by E. T. H. Wang: If (a,b) = 1, then (a”™ — b”, a” — b”) = a* — b* 
(proposed by Tom Apostol, Mathematics Magazine 53(1980) 49 and 54(1981)86-87). 
This problem, as well as E 3288, was generalized in the solution of E 3288 by 
Donald Knuth. Knuth’s generalization replaces 1 by powers of b at appropriate 
places. Let d’ = (a™ — b”™,a" + b”) and d=(a™+ b”, a” + b”). Knuth proves 
that d’ = a” + 5&” if a higher power of 2 divides m than divides n and 
d' = (a+ b,2) otherwise, and he proves that d= a” + b(™™” if the highest 
power of 2 dividing m and n is the same and d = (a + 5,2) otherwise. 


Solved also by R. Breusch, D. Callan, J. Duemmel, R. B. Eggleton (Brunei), J. Ferrer (Spain), J. 
Herzog (West Germany), W. Janous (Austria), H.-S. Ki (South Korea), D. E. Knuth, L. Kuipers 
(Switzerland), J. R. Kuttler, O. P. Lossers (The Netherlands), R. Martin (West Germany), A Nijenhuis, 
A. Pedersen (Denmark), M. Polakovic (Czechoslovakia), H. Schmidt, Jr., J. H. Steelman, Stanford 
Ordinal Group, and the proposer. 
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Two Very Special Mappings 


E 3289 [1988, 872]. Proposed by C. Alsina, Universitat Politecnica de Cataluna, 
Barcelona. 


Let J = [0,1]. Prove that there are exactly two mappings T from I X J to I that 
have the following five properties for all x, y, z in I: 

(1) T(T(x, y), 2) = T(x, T(y, z)), 

(2) T(x, y) = T(y, x). 

(3) T(x, z) < T(y, z) if x <y, 

(4) T(x, 1) = x, 

(5) T(zx, zy) = z*T(x, y) for some fixed positive k. 

Solution by Robert C. Williams, Alfred University, Alfred, NY. It is easy to 
confirm that the mappings T(x, y) = min(x, y) and T(x, y) = xy satisfy the given 
conditions. We now prove they are the only such mappings. 

Using conditions (2), (5), and (4), we have 7(x,0) = T(0, x) = x*T(0,1) = 
x*0 = 0. Together with T(x, 1) = x, this determines T on the boundary of I x I. 
Now suppose 0 < x < y < 1 and apply the same three properties: 


T(x, y) = T(y, x) = y*T(1, x/y) = y*(x/y) = xy*. 
In other words, T(u, v) = min(u, v)[max(u, v)]*~! for all (u, v) in the interior of 
the square. Since this also holds on the boundary, it is valid on all of J X J. 
Finally, it suffices to show that the parameter k must be 1 or 2. Choose any y in 
[1 /2,1] and any x such that 0 < x < (1/2) min{1, y*~1,2*y}. From (1) we have 


Peab)ratal)  e 


Since x < 1/2 < y, we have T(x,1/2) = x(1/2)*~' and T(1/2, y) = y*—~'/2. In- 
serting these into (*) and using x < 2*~'y on the left and x < y*~+/2 on the right, 
the equality becomes 


1 k-1 1 
«(5 yk} = x(y*1/2)"" 


This reduces to k — 1 = (k — 1)?, which implies k is 1 or 2. 
Note that condition (3) was not used in the argument, which proves that it is 
superfluous. 


Solved also by the proposer and 78 other readers. 


An Inequality for Intertwined Exponentials 
E 3291 [1988, 872]. Proposed by Jerrold R. Griggs, University of South Carolina, 
Columbia SC. 
Suppose y > x > 0. Is it always true that y*” > x”? 


Solution by Byeong Soo Kim (student), Yonsei University, Seoul, Korea. Yes. It 
suffices to show that x’In y — y*Inx > 0. If x” > y*, then y>1 and hence 
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x*In y — y*Inx > y*In y — y*Inx > 0. If x” < y* and 0 < x < 1, then 
x?In y — y*Inx > x”Inx — y*Inx = (x” — y*)In x > 0. 


Hence we may assume x” < y* and x > 1. Since ylnx < xIn y, we have x”In y 
— y*In x > (x’y — xy*)(In x)/x, and it suffices to show that x”y — xy* > 0. For 
this it suffices to show In(x’y) > In(xy*), so let 


f(y) = In(x’y) — In(xy*) = yinx + In y — Inx — xIn y. 


Since f is 0 at x, it suffices to show f’(y) > 0 for y > x. But 


f(y) =Inx+(1-x)/Jy>iInx+(1 — x)/x =x f Intdt > 0. 
1 


Solved also by J. Anglesio (France), G. Baloglou, R. K. Bose, D. Callan, J. Dalbec, Z. Fiiredi 
(Hungary) and the proposer, S$. K. Ghosh (India), R. A. Groeneveld, K.-W. Lau (Hong Kong), J. 
McHugh, T. S. Norfolk, R. Schramm (Israel), R. Stong, N. S. Thornber, and M. Vowe (Switzerland). 
Partially solved by A. Lenskold and by I. Szalkai (Hungary). One incorrect solution was submitted. 


Open Subsets of Closed Sets 


FE 3294 [1988, 873]. Proposed by Daniel Goffinet, Saint Etienne, France. 


Characterize the topological spaces in which the following assertion is true: For 
each open subset G of a closed subset F there exists a set A with G as its interior 
and F as its closure. 


Solution by Jesus Ferrer, Universidad Complutense, Madrid, Spain. We show that 
a topological space X satisfies the required property if and only if X has a dense 
subset with empty interior. For necessity, simply take F = X and G = @, and the 
stated property ensures the existence of such a subset. 

For sufficiency, let D be an interior-empty dense subset of X. For each open 
subset G of a closed subset F, let A = GU (F — int(F)) U (DN F — G*), where 
int(Y) and Y* denote the interior and closure of Y. We obtain A* = G* U 
(F — int(f’)) U (DO F — G*)*. For every x © F such that x © int(F) — G* 
and every open neighborhood V of x, it follows that VN (DO F-— G*)>D 
(VO int(F) — G*)A D# ©. Thus x © (DN F — G*)*, and A* = F. 

Finally, for every x € int(A), we have x © GU(DO F-— G*). If x EDN F, 
let V be an open neighborhood of x contained in int( A). It follows that 


V=VNA=(VONG)U(VO (DN F-G?*)) 


and VG # @, since otherwise V Cc D. Thus x € G*, and int(A) N(DN F — G*) 
= @, so int(A) = G. 


Editorial comment. The stated condition is a natural one, since Euclidean spaces 
satisfy it. S. F. Barger and Z. Piotrowski point out that there is a literature on the 
further analysis of this condition, going back to E. Hewitt, Duke Math. J. 10(1943) 
309-333. K. Kunen, A. Szymanski, and F. Tall relate it to measurable cardinals in 
Ann. Math. Silesianae (2) 14(1986) 98-107. 


Solved also by S. F. Barger & Z. Piotrowski, S. Beslin & V. Holmes, D. Callan, O. P. Lossers (The 
Netherlands), C. H. Scott, J. H. Steelman, R. Stong, and the proposer. One incorrect solution was 
received. 
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A Binary Summation 


E 3303 [1989, 54]. Proposed by Donald E. Knuth, Stanford University, Stanford, 
CA. Suppose m and n are positive integers. Find a closed-form expression for 
2mn_ 4 
k™\ (7 4) 2%) 
ER, 


k=0 


where v(k) is the number of 1’s in the binary representation for k. 


Solution I by José Heber Nieto, Maracaibo, Venezuela. The value of the sum is 
(nm)! 
_4\tm S74 aynm(nm-1)/2 
(-1) 7 2 , 

Let S(n,r) = Le 0k"(-1)’". We first prove that S(n,r) = 0 for n>r> 0. 
This is obvious for r = 0. Suppose it is true for 0 < r < ¢ and consider n > t¢. Let 
a = 2"~! for clarity, so S(n — 1, r) = L8=h(-1)?°k"”. Then 

2a—1 a—1 
S(n,t)=S(n-1,t)+ ¥ k(-1)° = S(n-1,t) -— ¥ (k + a)'(-1)™ 
k=a k=0 
-1 


= S(n-1,1)- S(n-1, 0) — ¥ p(k)\(-1)™, 


where p(k) is a polynomial in k of degree less than ¢. By the induction hypothesis, 
this evaluates to 0. 

We next find a recursive formula for S(n, n). Breaking this sum into even and 
odd parts, we have 


a 


1 a—1 
S(n,n) = ¥ (2k)"(-1)° 4 OY (2k +:1)"(- 1)? 
k=0 k=0 
a—1 ; 
= 2"S(n —1,n) — 2"S(n-1,n) — Y (n2"-1k""} + g(k))(-1)"™, 
k=0 


where g(k) is a polynomial in k of degree less than n — 1. The result of this 
computation is S(n,n) = —n2""'S(n — 1,n — 1). 

From this we obtain S(n, n) = (—1)"n!2""—0/, Since (£") = k™"/n! + r(k), 
where r(k) is a polynomial in k of degree less than nm, we conclude that 


am S(nm, nm) (nm)! 
k —1)\%A) 2 Dk yy" nm(nm-—1)/2. 
) 1 = = 1 2 
k=0 \ M n! (1) n! 


Solution II by the proposer. The solution is stated above and is a special case of a 


formula in L. C. Hsu, Fibonacci Quarterly, 11(1973) 480-484. Hsu proved that if 
f(x) = ayx™ + +++ +a ,x + ag is any polynomial of degree at most N, then 


y (-1)9° "Flex, + oe teyxy) = (-1)*Nlayx,... Xp. 


Setting N = nm, f(x) = (2"), and x, = 2*~! yields the desired result. The formula 
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of Hsu given above is easily proved by induction on N. (We apply the formula for 
N — 1 to the polynomial f(x) — f(x + xy) = —ayNxyxN7 1+ ---.) 


Editorial comment. Eugene Levine and Adnah G. Kostenbauder found a general- 
ization of Knuth’s formula to base b. Let w be a primitive bth root of unity and let 
v,(k) be the sum of the digits of the base b representation of k. Then 


LD (ko _ (mn) bmnmn+D/2 
k=0 +" n\(w@ —1)™ 


This follows from the corresponding generalization of Hsu’s formula: 


N 
> wit" FevF(ex, +++ +eyxy) = | NiayX, °*° Xy. 


O<ag,..., Ey <b-1 wo-] 


David Callan noted that the case n =1 of the original problem was Putnam 
problem B-5 in 1984. 


Solved also by D. Callan, M. Davidson, M. N. Kolountzakis (Greece), A. G. Kostenbauder, V. S. 
Miller, E. Levine, P. Severa & O. Such, and N. Strauss (Brasil). 


Maximizing Symmetric Functions of Side Differences in Triangles 


E 3305 [1989, 55]. Proposed by M. S. Klamkin, University of Alberta, Edmonton. 


If a,b,c are the sides of a triangle with given semiperimeter s, determine the 
maximum values of 
(i) (b — c)? +(c — a)* + (a — b)?’, 
(ii) (b — c)(c — a)| + (ce — aya — b)| + (a — bY(b — |, and 
(iii) (b — c)*(c — a)*(a — b)?. 


Solution by David Callan, University of Bridgeport, Bridegport, CT. The maxi- 
mum values are 25°, s*, s°/108, respectively. Let a, b, c denote the sides in decreas- 
ing order. Note that 5/2 < b < s. Form a new (degenerate) triangle with the same 
perimeter and middle side, the new sides in decreasing order being s, b, s — b. In 
switching to the new triangle, no difference between side lengths decreases; in fact, 
each difference increases if the original triangle is non-degenerate. Hence each part 
of the problem is the maximization of a polynomial in b over [s/2, 5]. The 
polynomials are: 

(i) (2b — s)? + b* + (s — by’, 

(ii) b(2b — s) + b(s — b) + (s — b)\(2b — s), and 

(iii) (2b — s)*b7(s — b)?. 

These maximizations can be done by elementary calculus, giving maxima as listed 
above, occurring at b = s,s, (3 + 3!/*)s/6, respectively. 


Editorial comment. Most solvers treated the problem as a constrained maximiza- 
tion problem in several variables. 


Sotved also by J. Anglesio (France), E. Arnold. M. Barr (Canada), J. Fukuta (Japan), H. Guggen- 
heimer, W. Janous (Austria), V. Konecny, L. Kuipers, H. Lipman, O. P. Lossers (The Netherlands), A. 
Pedersen (Denmark), I. A. Sakmar (Turkey), J. H. Steelman, M. Vowe (Switzerland), Western Maryland 
College Problem Group, and the proposer. Partially solved by V. Schindler (E. Germany) and S. 
Arslanagié (Yugoslavia). Comments were received from two other readers. 
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ADVANCED PROBLEMS 


6628. Proposed by Robert A. Melter, Long Island University, Southampton, NY. 


Call a triangle a Heron Triangle if it has integer sides and integer area. Fermat 
showed that there does not exist a Heron right triangle whose area is a perfect 
square. However, the triangle with sides 9, 10, 17 has area 36. 

Prove that there are infinitely many Heron triangles whose sides have no 
common factor and whose area is a perfect square. 


6629. Proposed by W. E. Briggs, University of Colorado, Boulder, CO. 
Let o, be the ith elementary symmetric function of n positive numbers. 
(1) Prove that 
2 3 2 .2 
3(6;_19;42 + 6,0;,1)° > 4(6;_10;,, + O;_ 10/4. + 0;0;42) 

forl<i<n-l. 

(ii)* Is it true that 

0; 19/42 + 0/0;,,3 + O41 > F)41( 0) 19543 + 26;6;42) 

fori<i<n-1? 

(ili)* Is | it true that 

2( 2 2 2 
0; (9; — 0;-19)41) > a7 (0.1 — 0:0;42) 


forl <i<n- 1)? 


6630. Proposed by David Pagus, Portland, OR. 


Suppose that a,, a5, a3,... are positive real numbers such that 


co 
); a, log(1 + 1/a,) < oo. 
n=1 

(i) If {b,}°°, is an arbitrary sequence of real numbers, prove that the series 


oa) 


» a,,|x ~~ b,|~* 


n=1 


converges for almost all real x. 


(ii) If k is a fixed positive integer and {b, }vc_, 1s an arbitrary sequence of vectors 
in R*, prove that the series 


oe) 

— T —k 
DL a,llx — Bl 
n=1 


converges for almost all vectors x in R“. Here || || is the usual Euclidean norm on 
vectors in R*. 
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SOLUTIONS OF ADVANCED PROBLEMS 
Odd Integers with Relatively Large Divisor Sum 


6555 [1987, 800]. Proposed by Moshe Laub, Jerusalem, Israel. 


Let B be the set of natural numbers n such that o(2n + 1) > o(2n), where o(k) 
is the sum of the positive integral divisiors of k. Let B(x) be the number of 
elements of B not exceeding x. Prove that there exist constants A and pw with 
0 <A <p <1 such that 


Ax < B(x) < px 
for all large positive x. 


Solution by L. E. Mattics, University of South Alabama, Mobile. We show that 
x/3000 < B(x) < 25x/28 for all sufficiently large x. We use the following lemma. 


LemMMA. If h and k are given coprime positive integers and x > 2, then 


o(n) «mx 1 
» -Finli- + O(log x). 
n=h(modk),n<x | 6 K pik p? 
Proof. If x is a character modulo k, then 
dL x(n)o(n)/n = Yi x(n) di 1/d 
nsx nsx d\n 
=)l/d Db x (7) 
d<x n<x, n=O(mod d) 
=) x(d)/d » x(m). 
d<x m<x/d 


If x is non-principal, this gives 
Y x(n)o(n)/n = o| > 1/4] = O(log x). 
n<x d<x 


If x is the principal character modulo k and if a dash indicates that a summation is 
restricted to integers relatively prime to k, we have 


Extdolyn= DGD 
Pr ware) 
- oe { EF + 0| - + O(log x) 
7 2) T(t ~ =| + O(log x) 


Multiplying these results by x(h) and summing over all the residue characters x 
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modulo k, we obtain 


y a(n) 1 at ) 


wx(h) Le x(n) 


n<x, n=h(mod k) n ~ $(k) & n<x 
oT 
=x—J]]1-—] +00 
atl ° tog») 


Thus the lemma is proved. 


To get an upper bound for B(x) we take k = 2 in the lemma and obtain 


o(2i +1) _ a(n) 


or 
=~ x + Ollog x). 
241 g * + O(log x) 


n 


O<i<x n=1(mod 2), n<2x+1 


Since 7” < 10, this implies that there is an N such that 


o(2i+1) 5 
———— <-x-2 f ; 
> aa 4* orx >N 


l<i<x 
Since o(2i + 1) < o(2i) for 2 <i < 6, we note that if i © B and i # 1, then 
o(2i+1) o(2i) 2i 314 7 
—_—_—— > > 
2i+1 2i 2i+1 #215 5 


Hence 


7 o(2i+1) 5 
p(B) < 2 yy sae 


so that 
B(x) < 25x/28 forx > N. 


We get a lower bound for B(x) by considering the intersection of B with the 
arithmetic progression n = 7(mod 30). Let a(x) be the number of natural numbers 
n such that n < x, n © B, and n = 7(mod 30), and suppose an asterisk indicates 
that a summation is restricted to the integers congruent to 7 modulo 30. The lemma 
gives 


oi) 3 5, ali) 
l<i<x 2i 2 l<i<x i 
3 x mr 1 1 
-sh6 alt ale 5 + O(log x) 
An? x 
— 3539 t O(log x) 
Since 7” < 99/10, this implies that there is an M such that 
o(2i) 198 x 
> *ps30 7 forx > M. 


l<i<x 
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Now if i = 7(mod 30), then 27 + 1 is divisible by 3 and 5 and so, if i is not in B, 
o(2i) of2i+1) 46 8 


2i 2+1 °35 5° 
Thus if x > M, we have 
8 x o(2i) 198 x 
~( > -2- < * ——<-—.—-4 
5 | 30 a(x)] ue 2i 125 30 
and hence 
8 x 198 x 
a —— 9 < Tz 
5 {30 ~ (*)) < a35 36 


so that B(x) = a(x) > x/3000. 


Editorial comment. Adolf Hildebrand remarks that, using the methods of proba- 
bilistic number theory, one can show that B(x)/x has a limit as x — oo. 


Solved also by P. Erdés and C. A. Spiro jointly. 


Expansion by an Iterated Mapping 


6566 [1988, 150]. Proposed by Jonathan E. Spingarn, Georgia Institute of Technol- 
ogy, Atlanta. 


Let r: R* > R* bea rotation about the origin by an irrational multiple of 7. Let 
f: R* > R? be defined by 
f(x, y)=(%,y) ifxsl, f(x, y)=(2--x,y) ifx>1. 
Does there exist (Xo, ¥y) © R* such that 


ref)" (x9, Yo) > 1 


for all positive integers k? (Here (re f)* denotes the kth iterate of ro f and || || 
denotes the ordinary Euclidean norm.) 


Solution by Nicholas Passell, University of Wisconsin, Eau Claire. Yes, there are 
many such points. In particular, let C be a circle of radius R such that 1 < R < 2 
and R is a number not constructible with straightedge and compass from the unit 
length and ||r(1,0) — (1,0)||. An open dense subset S of C is eventually carried to 
the interior of the unit disc, and the endpoints of the arcs of S are carried to the 
circle. However, the complement T of S in C is homeomorphic to the Cantor set 
and its non-endpoints never reach the disc. For the proof, define 


F, =r *o fork. 
Then 
(ref) =r" lo FoF 10 se of. 


Here F, is the folding toward the disc along the tangent to r4(1, 0). Since r”*! is an 
isometry, we may as well work with the sequence of compositions of planar foldings 
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G, where 
G,= F,° F,_,° ++: oF). 

Let B denote the interior of the unit disc, S, that part of C taken into B by G,,, and 
T,, the rest of C. Set 

S=US,CC 
and 

T=CO0S’; 
thus T is the complement of S in C. Clearly S is open in the relative topology of C. 
We need to show that S is dense in C and that T has no isolated points. The image 


under G, of C is a piecewise circular curve. If p, p, is a maximal arc in T,, then for 
some k > n we have 


r~"(1,0) € G,(pi)G,(p2) on Bd(B). 


Hence p,p, is not contained in T,. Furthermore, r~*(1,0) is neither G,( p,) nor 
G,( p,) since otherwise R would be constructible. Consequently p, and p, are 
endpoints of arcs of positive length in 7; for all j = n, and for all n the set S, is a 
finite union of open arcs which have no common endpoints. Thus, S' is dense in C 
and T has no isolated points. Since any compact, perfect, totally disconnected set in 
R (or C) is homeomorphic to the Cantor set, we are done. 


Editorial comment. This was solved also by the proposer, who indicated that 
certain generalizations to sequences of foldings in spaces of arbitrarily high dimen- 
sion are forthcoming in a paper by J. Lawrence and J. Spingarn, “On iterates of 
nonexpansive mappings and foldings.” 


REVIEWS 


EDITED BY JOSEPH KONHAUSER 
Department of Mathematics, Macalester College, St. Paul, MN 55105 


The Turing Omnibus: 61 Excursions in Computer Science. By A. K. Dewdney, 
Computer Science Press, W. H. Freeman and Company, New York, 1989. 
xiv + 415 pp. 


MARK BRIDGER 
Mathematics Department, Northeastern University, Boston, MA 02115 


It is not clear exactly when people began referring to computer science as a 
“field” in the sense that mathematics, biology, and physics are fields. In fact, the 
question of whether it is a separate field at all is of some academic interest, since 
computer science professors have been housed in the engineering, mathematics, and 
physics departments of quite a few institutions. Many people—both within and 
without computer science—would say, when asked for a definition: “I don’t know 
exactly what computer science is, but I know it when I see it (usually).” 

A. K. Dewdney, the editor of the “Computer Recreations” column in Scientific 
American, has now provided us with a useful survey of some of the topics which he 
considers computer science, or at least those which computer scientists should know. 
The Turing Omnibus 1s aptly titled; it is a sight-seeing trip through computer 
country, including significant side excursions into some of the regions of com- 
putability theory analyzed by A. Turing (and others). In all, Dewdney has written 61 
short essays, the average being 5-6 pages. These are divided into 11 groups: 
Analysis of Algorithms, Artificial Intelligence, Complexity Theory, Data Structures, 
Logic Design (hardware basics), Coding and Cryptography, Computer Graphics, 
Automata and Languages, Theory of Computation and Applications (CAT scans, 
linear programming, etc.). There is also a chapter of Miscellany which includes 
material on neural nets, perceptons, number systems, recursion, analog computers, 
and self-replicating machines. 

It is impossible in this space to summarize each essay, but descriptions of some of 
the major groups will give the flavor. 

In one excursion, Dewdney describes the four basic classes of automata (theoreti- 
cal computing machines): finite, pushdown, linear bounded and Turing machines. 
He shows how their specifications impose structural conditions on their languages, 
leading to the four language types: regular, context free, context sensitive and 
recursively enumerable. He, thereafter, concentrates on the most powerful automata, 
the Turing machines, and uses them to look at various aspects of theoretical 
computability: deterministic polynomial time algorithms, NP and NP-complete 
problems, noncomputable functions (e.g., “busy beaver”) and nonsolvable problems 
(e.g., halting). 

In another byway we look at divide-and-conquer algorithms: mergesort, heapsort, 
fast multiplication, and fast Fourier transforms. There is also an essay on parallel 
processing. The algorithms are analyzed mathematically; for example, Dewdney 
shows that both mergesort and heapsort have speed O(n log(n)), while parallel 
processing can reduce matrix multiplication from O(n’) to O(log(n)). 
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The tour of Logic Design begins with a description of the basic “gates” —the 
simplest building blocks of computer circuitry. Later essays show how these are 
combined to form multiplexers, encoders and, ultimately, a “SCRAM”: Simplified 
Complete Random Access Computer. The section ends with a quick tour of VLSI 
chip technology. 

Computer graphics is somewhat underrepresented by essays on spline curves, 
Quad trees, and Sierpinski curves (this last is more a discussion of recursion than 
graphics). It would have been nice to see something here on ray-tracing, fractal 
imaging or hidden surface algorithms. 

One interesting feature of the book is that the 11 tours are not presented as 
separate sections of the book, but each is arranged sequentially into subtours, which 
are then “shuffled” together. As the author puts it: ““The mixing is deliberate. A 
chapter preparing in some important way for another will always precede it; but 
between the two, a sequential reader will discover a number of other chapters on 
quite different topics... . These show some of the crosslinks that actually exist in 
the subject.”” This idea must be termed a partial success. On one hand, the forced 
variety made the book easy to read—if one essay didn’t strike my mood or interest, 
the next one often did. Sometimes it was hard to put the book down, since the effect 
was a kind of enforced browsing, made guilt-free since, after all, it was the author’s 
intention. 

On the other hand, Dewdney is not a very consistent writer. Some essays are 
written in an intuitive, informal style, while others are quite professorial and formal. 
For example, there are several essays centered around Nondeterministic Polynomial 
time (NP) problems. The first, dealing with the satisfiability problem, is quite 
understandable. However, the discussion of NP vs. NP-complete problems is 
confusing, and the reasons point up some problems with the author’s style. First, the 
role of a nondeterministic machine in this context is simply to provide a guess which 
we assume to be correct (best possible case); thus, solving a problem means 
verifying a correct guess. Instead of just saying this, Dewdney writes: “If for all yes 
instances of the problem (and only for these) there is a guess which will cause the 
Turing machine to answer yes, then the program is said to solve the decision 
problem. It does so in polynomial time if there is some fixed polynomial that 
bounds the number of steps in at least one yes computation for each possible yes 
instance the Turing machine might encounter. In such a case, the problem is said to 
lie in NP.” While this is technically true, it is somewhat obscure, and certainly 
difficult to read. The problem is compounded by the author’s tendency to use terms 
before they have been defined; in fact, the chapter on NP-completeness does not 
contain a definition for the term, though hints are given. It is only in later essays 
that we may find out that problems are NP-complete if the deterministic solution of 
any one of them in polynomial time implies that every problem in NP is (determin- 
istically) solvable in polynomial time. The fact that these later essays are scattered 
throughout the remainder of the book just makes putting the pieces together more 
difficult. 

The reader should also note that there are some confusing typos; nothing serious, 
just don’t believe that every circuit is exactly correct as drawn, or that every checker 
move is legal (!). Dewdney also seems to miss the role that second derivatives play 
in splines, and that solving simultaneously for all spline coefficients is, in fact, O(n) 
(where n is the number of points to be fitted). Also, the essay on CAT scanning is 
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just a summary of the abbreviated account in Pavlidis’ book Algorithms for Graphics 
and Image Processing. Nevertheless, there are many more vistas—such as the 
excellent 6-page condensation of Gédel’s proof of the Incompleteness Theorem— 
than (permit me) billboards. 

At one time Dewdney intended this book to be accessible to the “educated 
layperson”; he even included a section on algorithms-as-recipes. Nevertheless, 
things turned out differently, and a reader without some training in theoretical 
science would have a difficult time indeed with most of these essays. However, 
technically oriented people, especially mathematicians, computer science students 
and professionals, will have a challenging and rewarding time surveying the land- 
scape presented, and trying some of the inspirational exercises. Travel broadens, it is 
said, and nearly anyone will learn something from The Turing Omnibus. 

Upon finishing the book it occurred to me that 25 years ago many of the topics 
Dewdney has included would have been considered exclusively in the province of 
mathematics or philosophy. These include Boolean algebra, graph theory, splines (as 
part of curve fitting), Fourier transforms (fast and otherwise) and the predicate 
calculus. In fact, the whole methodology of modern abstract computer science has 
been borrowed from the abstractions of formal logic and, more generally, theoretical 
mathematics. However, computer science is no more just mathematics than, say, 
physics is. Reading papers in modern theoretical physics is a lot like reading 
mathematics, but that doesn’t make physics a branch of mathematics. Physics starts 
with an attempt to model something given: the real world; this informs and shapes 
the mathematics used and created by the physicist. Similarly, computer science is 
ultimately bound to the questions: What problems can a computing machine solve, 
and how can the machine be designed to solve them efficiently? The flavor of the 
mathematics that is used to answer these questions is quite a bit different from that 
constructed by the “pure” mathematician or even the theoretical physicist. Finite- 
ness, computability, and efficiency are the criteria which govern the direction of 
computer science research, and developments in this new field have profoundly 
influenced some of the directions taken in its more mature relatives. 


A Course in Constructive Algebra. By R. Mines, F. Richman, and W. Ruitenberg, 
Springer-Verlag Universitext, New York, 1988. xi + 344 pp. 


MICHAEL J. BEESON 


Department of Mathematics and Computer Science, San Jose State University, 
San Jose, CA 95192 


What is Constructive Algebra?. Elementary algebra is full of algorithms: factor- 
ing polynomials, solving equations, inverting matrices. Advanced algebra is full of 
non-constructive arguments, for example “constructing” the algebraic closure of a 
field by Zorn’s lemma. The question arises: to what extent can algebra be made 
constructive? If we are careful, can we eliminate things like Zorn’s lemma that don’t 
correspond to algorithmic constructions, or are they essential to the mathematical 
theory? The book under review contains the answers. 

The word “constructive” is not always used in the same way by different 
mathematicians. I shall therefore begin by explaining what I mean by it in this 
review. First, by an algorithm I understand a procedure that could be programmed 
for a computer, but without regard to questions of efficiency. For example, 
Kronecker gave algorithms in 1881 for factoring polynomials in several variables 
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over the integers. These algorithms are hopelessly inefficient, and much better 
algorithms are known today; but they do nevertheless count as algorithms. Second, 
by a “correctness proof” of an algorithm, I mean a proof that if the input data 
satisfy certain conditions, then the algorithm terminates (produces output) and the 
output data satisfy certain conditions. For example, a factoring algorithm accepts as 
input data a polynomial with integer coefficients, and terminates producing a list of 
polynomials, satisfying the condition that their product is the input polynomial. 

With the concept of an algorithm understood, we may call a piece of mathemat- 
ics constructive if it consists of 

(1) definitions of certain data structures (e.g., polynomials over the integers) 

(2) definitions of certain algorithms over those data structures 

(3) correctness proofs of those algorithms. 

The above cannot be taken as a definition of “constructive”, because it would 
then be circular; the correctness proofs in (3) have to be constructive themselves. 
Rather, it is intended as an aid to the intuition. The concept of “constructive proof” 
is both as simple and as difficult to define as the usual concept of “correct proof.” It 
is not usual to begin teaching mathematics by defining the notion of “proof”; nor 
can that really be done, as we have no more primitive concept in terms of which to 
couch the definition. The same goes for “constructive proof.” About such funda- 
mental philosophical matters, we shall say no more in this review. 


Constructive algebra versus ordinary algebra. Until the early twentieth century, 
the predominant view of most mathematicians was that mathematics consists of the 
three steps outlined above as “constructive mathematics.” The work of Hilbert 
broke new ground by making extensive use of existence proofs by contradiction, 
which nobody knew how to fill in by algorithms. Similarly, Zermelo’s non-construc- 
tive proof of the existence of a well-ordering of the reals created a stir in 1906. The 
widespread acceptance of existence proofs not accompanied by algorithms did not 
really begin until about 1930; so we only have a half-century of non-constructive 
mathematics. Nevertheless, it has been a very productive half-century, so the effort 
to consolidate its gains from the constructive viewpoint must be considerable. 

This consolidation was begun by Errett Bishop in his 1967 book, Foundations of 
Constructive Analysis. He deliberately did not attempt to constructivize modern 
algebra. Since 1967, then, algebra has been crying out for a systematic constructive 
treatment like that given to its sister. The book under review is a sister book to 
Bishop’s, doing for algebra what Bishop did for analysis. It differs from Bishop’s 
book in one way: there has been a deliberate attempt to make it usable as a 
textbook. 

The authors are to be congratulated for their success in providing an algebraic 
companion to Bishop’s volume. No doubt Bishop would be pleased with their work 
and very much appreciate the dedication to him. 


Constructive algebra versus algorithmic algebra. Today nearly every mathemati- 
cian has at least heard of computer programs that carry out symbolic computation, 
such as MACSYMA, Mathematica, MAPLE, CAYLEY, and SCRATCHPAD. 
These programs embody steps (1) and (2) above for certain predefined data 
structures and algorithms, and to some extent (particularly in SCRATCHPAD) they 
allow the user to define new data structures and algorithms. The development and 
use of such programs has awakened new interest in constructive mathematics. This 


1990] REVIEWS 359 


turn in the way mathematics is viewed is not really a new direction, but a return to 
tradition, as discussed above, although on the time scale of one human life it may 
appear to be a new direction. 

By “algorithmic mathematics,” I mean something similar to constructive mathe- 
matics, but with a difference: in algorithmic mathematics, we are concerned with the 
actual implementation of the algorithms. Thus Kronecker’s factorization algorithms 
are constructive, but not of interest for algorithmic mathematics; for actual imple- 
mentation, we need the modern factorization algorithms (see Knuth vol. 2 for an 
introduction). The number of people who are actively using computerized symbolic 
computation is in the thousands; and several hundreds are involved in research. By 
contrast, the community of people who have read and understood the work of Errett 
Bishop probably numbers not more than a few hundred, while about ten are 
involved in research. 

I think it is safe to say, moreover, that the intersection of these two groups is very 
small. The number of people who could give you a coherent explanation of Bishop’s 
definition of real numbers and also describe what Buchberger did about Grdbner 
bases! (this is fundamental work in algorithmic algebra) is vanishingly small. (Of 
course it’s impossible to know the exact number, but I could not name ten such 
people.) This is, in my view, an unfortunate situation. 

There would be benefits all around from more communication. The followers of 
Bishop ought to read the literature in symbolic computation. The algorithmic 
algebraists could benefit considerably from reading both Bishop and the volume 
under review.* The classical algebraists should read both, and play with the 
symbolic manipulation programs. Everybody should listen more and argue less. 

The book under review appears to have been intended, not for the mathemati- 
cians involved with algorithmic algebra, but for the ordinary mathematician, per- 
haps while teaching a course in algebra. The authors hope that such mathematicians 
will consult the book while preparing their lectures, or even use it as a textbook. It 
seems to be well-written for such a purpose. 

On the other hand, in 1988 twenty-one years had passed since Bishop’s book, and 
during those twenty-one years the “computerization of mathematics” has begun. 
Bishop himself was interested in the actual computerization of mathematics. I do 
not think it an unfair criticism to say that in 1988, it would have been appropriate 
to take the efficiency of algorithms into account while writing a volume on 
constructive algebra. If the algorithmic algebraists do read the volume under review, 
they will not find a systematic development of the mathematics that underlies their 
algorithms, but instead a systematic development of a constructive-in-principle-but- 


"A Grobner basis for a polynomial ideal is a finite set of polynomials g, such that the rules f > 
f mod g, form a terminating, confluent set of rewrite rules, i.e., if we apply these rules in any order 
repeatedly to a polynomial f, we always arrive at a unique stopping point. What Buchberger did was give 
an algorithm for the construction of a Grébner basis. Once we know how to construct a Grdbner basis, 
many other questions about polynomial ideals have easy algorithmic solutions. See Buchberger and Loos 
(1982), p. 37; and trace the bibliographical references in that volume for more information on algorithmic 
algebra. 

71 understand there is a forthcoming book by J. Davenport on the subject of algorithmic algebra; I do 
not know its title or projected date of publication, but it should be a good companion to the book under 
review. 
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not-in-practice kind of algebra. I maintain that this would still be a good and useful 
thing for them to read; whether they will perceive the matter in this way, I doubt. 


The flavor of the subject. Now let me turn to a more detailed consideration of 
the contents of the book. My aim is not to outline the book, but rather to provide a 
few examples of the kind of considerations that arise when one does algebra 
constructively. 

The first distinction to be made is that between discrete structures and not 
necessarily discrete structures. A set is called discrete if we have an algorithm for 
deciding equality on the set. This definition presumes the constructive viewpoint in 
which one manipulates data structures representing elements of an abstract set. For 
example, real numbers are given by programs for generating arbitrarily close 
rational approximations. The reals are an example of a non-discrete structure: there 
is no algorithm for deciding if a program represents the real number zero or not.? So 
far, in algorithmic algebra only discrete structures have been used; and for good 
reason, as the constructive theory of non-discrete algebraic structures is fraught with 
difficulties. In order not to obscure the overall picture, the authors have chosen to 
treat primarily the case of discrete structures. This was a wise choice: the subtleties 
of the non-discrete theory would have exhausted the patience of most readers.* 
Among the discrete structures are all finite structures, the rationals, polynomial 
rings over discrete coefficient rings, and the algebraic numbers. Among the non-dis- 
crete structures are the reals and the complex numbers. 

In addition, the systematic theory must be prepared to cope with a structure like 
the field R(ia), obtained from the reals by the adjunction of a purely imaginary 
number ia. The difficulty here is that we have no algorithm to determine (as a 
function of the real number a) whether this field is equal to R or to C, since we 
can’t tell if a is zero or not. Such fields do not arise in algorithmic mathematics, 
which has not yet considered algorithms whose input and output are algebraic 
structures. All the algorithms so far implemented deal only with fixed groups, rings, 
and fields, and manipulate their elements only.° 

Now, for example, the Euclidean algorithm for the greatest common divisor 
(perhaps the most fundamental algorithm in algorithmic algebra) finds its proper 
context in Theorem 5.6 of the book: it works in a discrete commutative ring with 
“recognizable units.” This means that there is an algorithm for testing whether an 
element is a unit or not. 

The next most fundamental algorithm is factorization. On p. 109 of the book we 
find the fundamental definition (here rephrased): an element p is irreducible if we 
have an algorithm to determine (given input a and b) that one of a or b 1s a unit, 
and which one. A UFD is then defined as a discrete domain R with a factorization 
algorithm, i.e., an algorithm for determining of a nonzero element r either that r 1s 


*Computer implementations of “real numbers” are usually either of rational numbers with a fixed 
bound on the denominator (“floating point’) or of rational numbers without a fixed bound on the 
denominator (“‘bigfloats”). Computer mathematics has yet to catch up to Weierstrass. 

“An example of the complications: In the non-discrete case we want to consider, e.g., groups with an 
apartness relation. Under what circumstances can we carry over an apartness relation to a factor group? 
For example, there is no natural apartness on R/Q, the reals mod the rationals. 

“Algorithmic mathematics has yet to catch up to Noether and Hilbert, as well as Weierstrass. 
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a unit, or for producing a factorization of r into irreducibles.° The famous theorems 
of Kronecker (1881) find their statements as follows: Let R be an infinite UFD with 
finitely many units. Then R[x] is a UFD. Moreover, if R[x] is a UFD, so is 
R[x, y]. That is, factorization of elements implies factorization of polynomials, even 
in several variables. In algorithmic algebra, we would go on to look for efficient 
factorization algorithms. In constructive algebra, we go on instead to look at the 
dependence on the data types: can we remove the hypothesis that the ring R be 
infinite? After all, in the case of a finite ring, there are only finitely many divisors of 
a given polynomial, so in principle we could examine them all; hence in case R is 
finite we are done. But we don’t have an algorithm to determine from the 
presentation of a ring (e.g., as a factor ring) whether it is finite or not; so we can’t 
make the case distinction between the search algorithm (in the finite case) and 
Kronecker’s algorithm (in the infinite case). (See p. 126 of the book.) 

In Chapter V of the book, there is a nice treatment of the decomposition theorem 
for modules over a PID, with applications to Jordan canonical form and later, in 
Chapter VI, to Galois theory. In Chapter VI, there is a construction of a splitting 
field for a nonzero polynomial over a countable discrete field. The construction of 
splitting fields is a tricky business, since we may not be able to factor polynomials. 
If f is a polynomial, then k[X]/(f) is only a ring; it is a field only if f 1s 
irreducible, and the classical technique of working with an irreducible factor of f is 
not constructive, since we may not have an algorithm for finding such a factor. 
Instead, we consider k[X]/M, where M is a maximal ideal containing (/), 
constructed using an enumeration of the base field k (bottom of page 151 in the 
book). Without the assumption that k is countable (and the enumeration algorithm 
implicit in that assumption) we can’t construct a field in which f has a root. This 
construction does not support a proof of uniqueness of the splitting field, and 
indeed Exercise 4 (p. 153) shows that uniqueness is not constructively true. 

Similarly, the construction of an algebraic closure of a discrete field k relies on 
countability. The classical existence proof of an algebraic closure of a field k 1s 
highly non-constructive, appealing as it does to Zorn’s lemma or a transfinite form 
of the axiom of choice. There are interesting cases still open, between the theorem 
just mentioned and the classical but non-constructive proof. For example, what 
about the field C(t) of rational functions with complex coefficients? Can you 
construct its algebraic closure? (If so, write the reviewer.) 

Questions about the zeroes of polynomials have a central place in algebra, 
starting with the fundamental theorem of algebra. A number of constructive proofs 
of the fundamental theorem of algebra are known (see the Historical Appendix of 
Beeson (1985)). A purely algebraic proof, using Galois theory and the existence of 
2-Sylow subgroups of finite groups, is given in the book. 


°I have phrased these definitions in terms of algorithms to avoid the usual squabbling about the - 
meaning of “there exists.” The authors, like Bishop and his predecessors back to Brouwer, use “there 
exists” as a shorthand for assertions about algorithms which are implicit in the discussion. Other people 
seem unable to accept this redefinition of the phrase “there exists,” even for the moment by “willing 
suspension of disbelief,” and instead of considering the mathematics, argue about the phrase. Worse, the 
constructivists argue back! claiming, for example, that it isn’t a redefinition but a definition, the phrase 
never having been properly defined. In this review I shall avoid the whole linguistic problem by speaking 
directly of the algorithms instead. 
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An interesting result about factorization is this: Call a field k fully factorial if we 
have a polynomial factorization algorithm for every finite extension k(a,,..., a,,). 
(Here the a, are inputs to the algorithm.) Can we characterize the fully factorial 
fields? It turns out that we can: the right condition is “Seidenberg’s condition P”’: 
there must be an algorithm for deciding whether a finite set of linear equations over 
k is solvable in k? (the set of pth powers in k), and finding the solution if it exists. 

Condition P is closely connected to Wedderburn’s theorem as well, as we find 
out several chapters further on (p. 244): a discrete field satisfies condition P if and 
only if each finite-dimensional k-algebra has a finite-dimensional radical. That is, if 
we have an algorithm for passing from a basis for the k-algebra to a basis for the 
radical. The book goes on to give a constructive version of Wedderburn’s structure 
theorem for semi-simple algebras. The next chapter gives a constructive treatment of 
Ulam’s theorem on the structure of Abelian groups. 

All the chapters from Jordan canonical form onwards have this feature: the main 
difficulties arise because of the dependence of algorithms on the (presentation of 
the) ring or field, rather than on the elements. The algorithms involved, to the extent 
that they go beyond the usual undergraduate curriculum, are unusual in that they 
make use of the hypothesized presentation of the field, as in the constructions of 
splitting fields or algebraic closures described above. In the last chapter, on 
valuation theory, the algorithmic algebraist will feel at home again. Here we meet 
our old friend, Hensel’s lemma, and the wonderful corollaries of the algorithm 
embodied in Hensel’s lemma. As it happens, the efficient modern polynomial 
factorization algorithms depend on Hensel’s lemma and on factorization in finite 
fields, using the GCD algorithm in the finite field. No mention is made in the book 
of this amazing connection between the last chapter and the early chapters (for 
which see Knuth vol. 2), but the systematic constructive theory of valued fields is 
nicely developed. 
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Berstein, Eugene Tomer. Chelsea, 1989, x + 257 
pp, $23.95. [ISBN: 0-8284-0331-7] Almost peri- 
odic functions are (as defined here) uniform limits 
of trigonometric polynomials; hence they admit suit- 
able versions of harmonic analysis, differential equa- 
tions methods, etc. This slightly revised edition of 
the first English edition, published in 1968, covers 
elementary theory and applications (TR, June-July 
1969). Style is leisurely, readable, and essentially 
self-contained, assuming only beginning graduate- 
level analysis. Includes a colossal bibliography, up- 
dated to around 1987. PZ 


Functional Analysis, T(16-17: 1), L. Anal- 
yse Fonctionnelle. M. Samuelides, L. Touzillier. 
Cepadues-Editions, 1989, 289 pp. [ISBN: 2-85428- 
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233-7] A concise and careful introduction intended 
for engineers. Includes exercises and a chapter on 
the history. JD-B 


Analysis, P. Orthogonal Polynomials and Their Ap- 
plications. Ed: Jaime Vinuesa. Lect. Notes in Pure 
& Appl. Math., V. 117. Marcel Dekker, 1989, x 
+ 207 pp, $99.75 (P). [ISBN: 0-8247-8161-9] Pro- 
ceedings of the International Congress on Orthogo- 
nal Polynomials held in September 1987 at Laredo, 
Spain. LC 


Analysis, T(17: 1), S. Point Set Theory. John 
C. Morgan II. Pure & Appl. Math., V. 131. Mar- 
cel Dekker, 1990, viii + 279 pp, $99.75. [ISBN: 0- 
8247-8178-3] A unified treatment of Baire theory, 
Lebesgue theory, and point set topology which re- 
quires a background in all three for full appreciation. 
Written in concise definition-theorem-proof-example 
style. Contains extensive bibliography, but no ex- 
ercises. Over 50% discount on price for classroom 
use. GG 


Algebraic Geometry, P. Lecture Notes in Mathe- 
matics-1399: Arithmetic of Complez Manifolds. Eds: 
W.-P. Barth, H. Lange. Springer-Verlag, 1989, 170 
pp, $16.20 (P). [ISBN: 0-387-51729-4] A collection 
of twelve papers presented at a 1988 conference in 
Erlangen. SG 


Geometry, P. Recent Developments in Geometry. 
Eds: S.-Y. Cheng, H. Choi, Robert E. Greene. Con- 
temp. Math., V. 101. AMS, 1989, xiii + 338 pp, 
$39 (P). (ISBN: 0-8218-5107-1] Out-growth of the 
special session on geometry at the AMS meeting at 
UCLA in November 1987. Purpose is to provide ge- 
ometers with insight into recent developments in a 
variety of different specialties in geometry. Papers 
are surveys of relatively broad areas of geometry, not 
presentations of new results as such. JDEK 


Operations Research, S(15-16), C, P. Ezplor- 
ing Operations Research and Statistics in the Micro 
Lab. Erwin Kalvelagen, Henk C. Tijms. Prentice- 
Hall, 1990, ix + 192 pp. [ISBN: 0-13-297334-0] The 
textbook and accompanying software package (for an 
IBM PC or compatible) is designed to be used as a 
laboratory supplement to courses in operations re- 
search and statistics. Emphasizes model formulation 
and the interpretation of computer-generated solu- 
tions. Topic areas include linear and integer pro- 
gramming, dynamic programming, and probability 
and statistics (the law of large numbers, simulation, 
and the central limit theorem). AO 


Operations Research, P, L. Knowledge-Based 
Process Planning for Construction and Manufac- 
turing. Carlos Zozaya-Gorostiza, Chris Hendrick- 
son, Daniel R. Rehak. Academic Pr, 1989, xi + 
306 pp, $39.95. [ISBN: 0-12-781900-2] A mono- 
graph describing the knowledge-based process plan- 
ner PLANEX, a software system capable of aiding 
the planning of very diverse projects. While the au- 
thors focus on the construction of buildings, the sys- 
tem is flexible enough to handle bridges, highways 
and, as illustrated with an example, manufacturing 
operations for automobile electrical wire harnesses. 
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The book is intended for two audiences. First, devel- 
opers of practical process planning tools, and second, 
researchers of planning systems. SM 


Operations Research, T(15-16), S, L. Mathe- 
matical Modelling: A Case Study Approach. Dick 
Clements. Cambridge U Pr, 1989, vili + 166 pp, 
$44.50. (ISBN: 0-521-34340-2] A two-part textbook 
of mathematical modelling. Part 1 consists of a brief 
description of the modelling process, pitfalls to be 
aware of in applying mathematical methods to prac- 
tical problems, and a chapter on pedagogical con- 
cerns arising from a modelling course focused on case 
studies. Part 2 contains the seven case studies pre- 
sented in the form of internal corporate memos and 
blueprints. After each collection of memoranda, one 
possible solution to the problem is presented. SM 


Dynamical Systems, P. From Data to Model. Ed: 
Jan C. Willems. Springer-Verlag, 1989, 246 pp, $59. 
[ISBN: 0-387-51571-2] Five papers cover various as- 
pects of model selection and estimation procedures 
for time series data. RWJ 


Control Theory, P. Lecture Notes in Control and 
Information Sciences-131: Control of Large Flezible 
Space Structures. S.M. Joshi. Springer-Verlag, 1989, 
ix + 196 pp, $30.30 (P). (ISBN: 0-387-51467-8] This 
research monograph discusses the unique challenges 
of controlling the very large and very fragile struc- 
tures which will be constructed in earth orbit in the 
reasonably near future. The light weight and expan- 
siveness of the structures will lead to extremely low 
frequency elastic modes. At the same time, highly 
accurate attitude controls are required. The book 
focuses primarily on the author’s research into these 
questions, with extensive references to the work of 
others. Indexed. SM 


Stochastic Processes, P. Asymptotic Methods 
in the Theory of Stochastic Differential Equa- 
tions. A.V. Skorokhod. Transl. of Math. Mono., 
V. 78. AMS, 1989, xvi + 339 pp, $119. [ISBN: 0- 
8218-4531-4] This book, translated from Russian, 
was the idea of the author and Iosif I]’ich Gikhman, 
who died before he could take part in the writing. It 
covers asymptotic problems for stochastic differential 
equations, including ergodic theory for Markov pro- 
cesses, solutions for systems of stochastic differential 
equations, stability theory, averaging, and linear sys- 
tems of equations. The coverage is very theoretical. 
Note the price. JJ 


Statistics, S(17), P, L. Rank Tests with Estimated 
Scores and Their Application. Konrad Behnen, 
Georg Neuhaus. BG Teubner Stuttgart, 1989, ix + 
416 pp, DM 54 (P). [ISBN: 3-519-02728-3] Devel- 
ops a new class of nonlinear rank tests to compare 
how two distributions differ. Includes location, scale, 
symmetric difference, and independence tests. RWJ 


Statistics, P. Group Invariance Applications in 
Statistics. Morris L. Eaton. Reg. Conf. Ser. in 
Prob. & Stat., V. 1. Institute of Mathematical Statis- 
tics (3401 Investment Blvd., No. 7, Hayward, CA 
94545), 1989, v + 133 pp, $25 (P). [ISBN: 0-940600- 
15-3] Lectures given at an NSF/CBMS Regional 
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Conference at the University of Michigan in June 
1987. RSK 


Statistics, S(17-18), P. Methoden zur Schdtzung 
der Ordnung bet autoregressiven Modellen. Hartmut 
Maier. Teubner-Texte zur Math., B. 113. BG Teub- 
ner Leipzig, 1989, 148 pp, DM 18,50 (P). (ISBN: 3- 
322-00737-5] 


Statistics, T(18: 1), S. Model Discrimination for 
Nonlinear Regression Models. Dale S. Borowiak. 
Statistics: Textbooks & Monographs, V. 101. Marcel 
Dekker, 1989, xi + 177 pp, $89.75. [ISBN: 0-8247- 
8053-1] Provides the practitioner with techniques 
and the statistician with basic theory in the area of 
model discrimination for nonlinear regression mod- 
els. Assumes a set of competing nonlinear regression 
models have been established for explaining empiri- 
cal data. Diagnostic statistics called discrimination 
functions are constructed and used on the empiri- 
cal data to help in selecting the best model. Pre- 
diction ability of the model and parameter stabil- 
ity are both considered. Models looked at from the 
view of social scientists and engineers. When initial 
model assumptions of normality are violated resam- 
pling techniques, such as jackknife, bootstrap, and 
simulation, are used. Basic knowledge of statistical 
theory (estimation, distributional theory, hypothesis 
testing) required. Applications can be used without 
understanding proofs. JJ 


Statistics, P. Handbook of Nonlinear Regression 
Models. David A. Ratkowsky. Stat.: Textbooks & 
Monographs, V. 107. Marcel Dekker, 1990, ix + 241 
pp, $69.75. [ISBN: 0-8247-8189-9] Divided into two 
parts. Part I gives background to and philosophy 
of nonlinear regression modelling. Part II presents 
the most commonly used nonlinear regression mod- 
els containing at most six parameters, together with 
the statistical properties of the estimators of the pa- 
rameters. Emphasis is on “close-to-linear” models, 
whose parameter estimators are close to being unbi- 
ased, normally distributed, minimum variance esti- 
mators. Note price. RSK 


Statistics, S(18), P. Bayesian Forecasting and Dy- 
namic Models. Mike West, Jeff Harrison. Ser. in 
Stat. Springer-Verlag, 1989, xxi + 704 pp, $56. 
[ISBN: 0-387-97025-8] Provides a broad introduc- 
tion to the modelling ideas and practices of Bayesian 
forecasting in dynamic model environments. Begins 
with the simplest and most widely used model—the 
first-order polynomial model. Goes on to cover re- 
gression models, dynamic linear models, and time 
series models, emphasizing sequential learning and 
forecasting in a Bayesian fashion. Also covers more 
advanced, recent developments in the field. Covers 
theory behind Bayesian forecasting and gives several 
applications. Extensive bibliography. Comments on 
computer programs FAB and BATS. JJ 


Statistics, T(18), S. Approzimate Distributions of 
Order Statistics With Applications to Nonparamet- 
ric Statistics. R.-D. Reiss. Springer-Verlag, 1989, 
xii + 355 pp, $59. [ISBN: 0-387-96851-2] Mathe- 
matically rigorous treatment of the asymptotic dis- 
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tribution theory of order statistics. Emphasizes accu- 
racy of limit theorems and higher order approxima- 
tions. Results formulated in terms of variational and 
Hellinger distance. Explores parametric approxima- 
tions of nonparametric models of joint distributions 
of order statistics, as well as nonparametric curve es- 
timation, the bootstrap method, and graphical meth- 
ods. Solid knowledge of probability and statistics re- 
quired. Knowledge of measure-theoretic probability 
necessary for much of book. Very theoretical. JJ 


Statistics, T(18: 1), S, P. Analysis of Binary 
Data, Second Edition. D.R. Cox, E.J. Snell. Mono. 
on Stat. & Appl. Prob., V. 32. Chapman & Hall, 
1989, xi + 236 pp, $39.95. [ISBN: 0-412-30620-4] 
Deals with the analysis of binary data (zero-one data 
in which an observation takes one of two possible 
forms—success or failure). Many methods used to 
model how the probability of success depends on ex- 
planatory variables are unified by considering models 
in which the logistic transform of the probability of 
success is a linear combination of unknown param- 
eters. This edition contains more details on regres- 
sion diagnostics and maximum likelihood fitting than 
the 1969 First Edition by D.R. Cox (TR, June-July 
1973). Good mix of theory, application, and exam- 
ples with real data. Assumes knowledge of statistics. 
Bibliographic notes in each chapter. JJ 


Programming, T(13-14: 1, 2). Pascal: Problem 
Solving and Program Design, Third Edition. Elliot 
B. Koffman. Addison-Wesley, 1989, xix + 837 pp, 
(P). [ISBN: 0-201-11834-3] Using smaller chapters, 
this new edition (First Edition, TR, June-July 1981; 
Second Edition, TR, January 1986) increases empha- 
sis on its earlier software engineering topics and adds 
new ones; uses data abstraction concepts through- 
out, presents additional theoretical aspects of com- 
puter science; adds exercises, examples, case stud- 
ies, programming projects. Suitable for CS1 and 
much of CS2 and for the programming course pre- 
requisite for ACM’s newly-proposed introductory CS 
sequence. DFA 


Programming, T(15-16: 1), S, P. Programming 
in ADA, Third Edition. J.G.P. Barnes. Intern. 
Comput. Sci. Ser. Addison-Wesley, 1989, xvi + 494 
pp, (P). (ISBN: 0-201-17566-5] An introduction to 
the entire language, by a member of the original Ada 
design team. Revised from the earlier editions (First 
Edition, TR, February 1983) to conform to the cur- 
rent interpretation of the language. Includes an en- 
larged language overview, some rearrangements and 
rewriting, some new sections, many additional exer- 
cises (with solutions), and examples. Tutorial style. 
Very readable. DFA 


Programming, S, P, L*. Smalltalk-80: The Lan- 
guage. Adele Goldberg, David Robson. Addison- 
Wesley, 1989, xvii + 585 pp, (P). [ISBN: 0-201-13688- 
0] A revised and updated version of Smalltalk-80: 
The Language and Its Implementation (TR, January 
1985). Most of the material on implementation of 
the language has been eliminated. AO 


Programming, S, P, L. Real World PostScript: 
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Techniques from PostScript Professionals. Ed: 
Stephen F. Roth. Addison-Wesley, 1988, xiv + 383 
pp, (P). [ISBN: 0-201-06663-7] A “forager’s book” 
of real opportunities of PostScript as a programming 
language, including applications to development of 
type fonts, half-toning and color, and four case stud- 
ies in major projects. Each chapter is individually 
authored; most include extensive chunks of working 
PostScript code as illustrations. Neither comprehen- 
sive nor systematic, but full of practical wisdom not 
available in standard reference manuals. LAS 


Programming, S. HP-28S Software Power Tools: 
Utilities. Solve and Integrate Corp. Grapevine Publ 
(460 SW Madison Ave., Suite 1, POB 118, Corvallis, 
OR 97339), 1989, 313 pp, $18 (P). [ISBN: 0-931011- 
27-2] A collection of 200 utility programs for the 
HP-28S symbolic and graphing calculator. Math- 
ematical examples: finding gcd and lcm; generat- 
ing primes and random numbers; converting formats 
of complex numbers and vectors; building, decom- 
posing, and editing arrays. Other utilities pertain 
to strings, lists, directories, output, and program- 
ming. LAS 

Programming, S, P, L. An Introduction to Object- 
Oriented Programming and Smalltalk. Lewis J. Pin- 
son, Richard S. Wiener. Addison-Wesley, 1988, xxvi 
+ 502 pp, (P). [ISBN: 0-201-19127] An introduc- 
tion to the principles of object-oriented program- 
ming, the Smalltalk programming environment, and 
the Smalltalk-80 programming language. Does not 
assume that the reader has any prior experience with 
Smalltalk or object-oriented programming. AO 


Programming, S, P, L. C++ Primer. Stanley 
B. Lippman. Addison-Wesley, 1989, xiv + 464 pp. 
[ISBN: 0-201-16487-6] A tutorial introduction to 
the programming language C++. Does not assume 
prior experience with C or C++. Describes Release 
2.0 of the language and the differences between Re- 
lease 1.2 and Release 2.0. AO 


Programming, T(13-14: 1), P. Fortran 82 Ez- 
plained, Revised Edition. Michael Metcalf, John 
Reid. Clarendon Pr, 1989, xv + 315 pp, $22.50 
(P). [ISBN: 0-19-853824-3] A presentation of the 
second draft, not yet adopted, of Fortran 8z, the 
proposed new standard containing Fortran 77 as a 
subset. Written in textbook style, as if Fortran 8x 
were in existence. Notes new features and proposed 
“obsolescent” features. FR 


Languages, P, L. Constraint Programming Lan- 
guages: Their Specification and Generation. Wm 
Leler. Addison-Wesley, 1988, vi + 202 pp, $10.95 
(P). [ISBN: 0-201-06243-7] Constraint languages 
are a relatively new programming paradigm. They 
are declarative in that they specify a desired goal 
rather than an algorithm to achieve that goal. 
This book describes a general-purpose specifica- 
tion language called Bertrand that can be used to 
produce constraint-satisfaction systems based on a 
new rewrite rule technique entitled augmented term 
rewriting. AO 


Languages, T(15-17: 1, 2), P, L. Elements of 
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Functional Programming. Chris Reade. Intern. 
Comput. Sci. Ser. Addison-Wesley, 1989, xv + 600 
pp. [ISBN: 0-201-12915-9] A comprehensive intro- 
duction to the concepts and techniques used in func- 
tional programming. Gives particular emphasis to 
the use of abstraction and strong polymorphic type 
systems. Standard ML is used in the examples. AO 
Algorithms, T(15-16: 1), L. Computer Algo- 
rithms: Introduction to Design and Analysis, Sec- 
ond Edition. Sara Baase. Addison-Wesley, 1988, xv 
+ 416 pp. [ISBN: 0-201-06035-3] Second Edition 
of this well-known textbook on algorithm analysis 
(First Edition, TR, February 1979). Additions in- 
clude a large number of exercises as well as material 
on adversary arguments and selection problems, dy- 
namic programming, and parallel algorithms. AO 


Computer Systems, S(15-17), P, L*. The De- 
sign and Implementation of the 4.3BSD UNIX Op- 
erating System. Samuel J. Leffler, et al. Addison- 
Wesley, 1989, xxii + 471 pp. [ISBN: 0-201-06196- 
1] Describes the internal structure (concepts, data 
structures, algorithms, etc.) of the widely used 
4.3BSD version of UNIX developed at the Univer- 
sity of California at Berkeley. Some exercises are 
provided to facilitate the use of this book as a sup- 


plement in a second course on operating systems de- 
sign. AO 

Computer Systems, P. Digital Typography: An 
Introduction to Type and Composition for Computer 
System Design. Richard Rubinstein. Addison- 
Wesley, 1988, xi + 340 pp. [ISBN: 0-201-17633-5] A 
broad introduction to the technical and aesthetic is- 
sues that arise in attempts to produce typeset-quality 
output with computer-driven equipment. The topics 
covered include a brief history of letterforms, the psy- 
chology of reading, characteristics of various output 
devices, principles of font design and page layout, 


and the relationship between screen and paper im- 
ages. AO 


Computer Science, T(18), P. Algebraic Specifi- 
cation. Eds: J.A. Bergstra, J. Heering, P. Klint. 
Addison-Wesley, 1989, xiii + 397 pp. [ISBN: 0-201- 
41635-2] There is a good deal of research going on 
in the area of the formal specification of the syn- 
tax and semantics of programming languages. This 
monograph describes the research activities in one 
area of this research—algebraic specifications. The 
authors describe a system called ASF which uses al- 
gebraic techniques to describe language. They then 
apply this technique to a number of sample languages 
to show how it can be used. GMS 


Applications, T(13-14: 1). How to Model It: 
Problem Solving for the Computer Age. Anthony 
M. Starfield, Karl A. Smith, Andrew L. Bleloch. 
McGraw-Hill, 1990, xii + 206 pp, $19.95 (P). [ISBN: 
0-07-005897-0] An informal non-mathematical ap- 
proach to modelling designed to stimulate active, 
thoughtful discussion-centered teamwork among stu- 
dents who may not be very comfortable with formal 
mathematics. Encourages use of spreadsheets as a 
standard tool. Sample modelling projects: estimat- 
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ing how long it takes to cool a warm beer in a re- 
frigerator; planning to cook a dinner in a limited 
amount of time; determining how many ping pong 
balls would fill a room. LAS 


Applications (Physics), P. Random Media and 
Composites. Eds: Robert V. Kohn, Graeme W. 
Milton. SIAM, 1989, 214 pp, $27.50 (P). [ISBN: 
0-89871-246-7] Compiled from papers delivered at 
the workshop on random media and composites held 
at the Xerox Training Center in Leesburg, Virginia in 
December 1988. Topics of principle concern include 
percolation and breakdown in random resistor net- 
works, bounds and mean fields theories for the effec- 
tive moduli of composites, seismic inverse problems, 
localization in random media, and superconductiv- 
ity. MU 


Applications (Physics), S(18), P. Lecture Notes 
in Physics-844: PDEs and Continuum Models of 
Phase Transitions. Eds: M. Rascle, D. Serre, M. 
Slemrod. Springer-Verlag, 1989, vii + 229 pp, 
$29.70. [ISBN: 0-387-51617-4] Proceedings of an 
NSF-CNRS Joint Seminar held in Nice, France, Jan- 
uary 1988. The first section, Continuum Mechanics, 
includes papers on liquid crystals, elastic crystals, 
fluids, and general theory. The second section, Math- 
ematical Analysis, concerns systems of conservation 
laws, and miscellaneous topics. MU 


Applications (Physics), P. Boundary Stabiliza- 
tion of Thin Plates. John E. Lagnese. Stud. in Appl. 
Math., V. 10. SIAM, 1989, viii + 176 pp, $36.50. 
[ISBN: 0-89871-237-8] A comprehensive treatment 
of asymptotic stability of a thin plate when stabiliz- 
ing feedback mechanisms are introduced along a part 
of the edge. Primary emphasis is placed on the use of 
multipliers to obtain explicit estimates of the decay 
rate of the energy of the plate. LCL 


Applications (Social Science), P, L. Nystuen- 
Dacey Nodal Analysis. Ed: Keith J. Tinkler. Mono. 
No. 7. Institute of Mathematical Geography (2790 
Briarcliff, Ann Arbor, MI 48105), 1988, iii + 115 pp, 
$15.95 (P). A monograph giving detailed instructions 
for the implementation of Tinkler’s method of nodal 
analysis. Plenty of examples in the appendices. Ap- 
pendices also include reprints of earlier work by Tin- 
kler, as well as work by John D. Nystuen and Michael 
F. Dacey, the founders of nodal analysis in geogra- 
phy. SM 


Reviewers 


DFA: David F. Appleyard, Carleton; SB: Steve Benson, 
St. Olaf; JNC: Judith N. Cederberg, St. Olaf; LC: Laura 
Chihara, St. Olaf; WE: William Etter, Macalester; SG: 
Steven Galovich, Carleton; GG: George Gilbert, St. Olaf; RH: 
Rhonda Hatcher, St. Olaf; RWJ: Roger W. Johnson, Car- 
leton; JJ: Jason Jones, St. Olaf; RSK: Richard S. Kleber, 
St. Olaf; JDEK: Joseph D.E. Konhauser, Macalester; LCL: 
Loren C. Larson, St. Olaf; SM: Steve McKelvey, St. Olaf; AO: 
Arnold Ostebee, St. Olaf; FR: Flauren Ricketts, Macalester; 
GMS: G. Michael Schneider, Macalester; JS: John Schue, 
Macalester; KES: Kay E. Smith, St. Olaf; LAS: Lynn Arthur 
Steen, St. Olaf; MU: Milton Ulmer, Carleton; PZ: Paul Zorn, 
St. Olaf. 
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CALCULUS 


like you’ve never seen it before! 


Ow you and your students 
can keep in step with 
today’s colorful new math 


movement through our innovative 
calculus texts. 


Calculus 

Ross L. Finney and George B. 
Thomas Jr., both of Massachusetts 
Institute of Technology 


This text emphasizes a new way of 
learning and teaching calculus. 
Finney and Thomas integrate 
calculators and computers to give 
students an overall problem-solving 
and graphing approach to learning. 


An extensive supplements package 
includes these computer 
supplements: 


¢ AWTest (Computerized 
Testing) 


Calculus Toolkit 2.0 (Apple 
and IBM) 


Computer Explorations in 
Calculus (Workbook to 
accompany the Calculus 
Toolkit)* 

Mathematical Modeling with 
MathCAD (Workbook/disk to 
accompany The Student 
Edition of MathCAD)* 

The Student Edition of 
MathCAD® (IBM)* 

Master Grapher and 3D 
Grapher for the IBM, Apple II 
series, and the Macintosh; in 

3 1/2" and 5 1/4" disk sizes.* 


Mathpro (IBM)* 


Calculus with Analytic 


Geometry, Third Edition 
John B. Fraleigh, University of 
Rhode Island 


Fraleigh illustrates topics through 
graphing and extensive calculator use 
to give students a clear understanding 
of calculus. Exponential and logarith- 
mic functions are introduced early, 
which means your students can apply 
these concepts throughout the 
semester. Plus, this edition includes: 


¢ A comprehensive dictionary of 
functions in the precalculus 
review chapter. 


¢ Exploring Calculus, an IBM 
tutorial software package. 


¢ AWTest Edit, a computerized 
testing system with editing 
capabilites. 


Calculus and Analytic 


Geometry, Seventh Edition 
George B. Thomas Jr., and Ross L. 
Finney, both of Massachusetts 
Institute of Technology 


Seventh edition of the classic, best- 
selling text, known for its excellent 
applications and mathematical 
precision. 


* These items accompany both Calculus 
and Calculus with Analytic Geometry, 
Third Edition. 


vv Addison-Wesley 
Publishing Company 
1 Jacob Way « Reading, MA 01867 
617-944-3700 


Riddles 
of the 


Sphinx 


mathematical 
puzzle tales. 


by Martin Gardner 

Volume 32 in the New Mathematical Library 
184 pp., Paper. ISBN-0-88285-632-8 

List: $14.50 | MAA Member: $12.50 


Martin Gardner has charmed readers for over fifty 
years with his delightful books and articles on sci- 
ence. He is best known for the popular column, 
“Mathematical Games,” which appeared in Scien- 
tific American for twenty-five years. Generations 
of scientists and mathematicians have been 
inspired by his writing and the MAA is proud to 
include his name in its list of authors. 


This book was drawn from Gardner’s column 
in Isaac Asimov’s Science Fiction Magazine. The 
riddles presented here incorporate the responses 
of his initial readers, along with additions sug- 
gested by the editors of the New Mathematical 
Library. Each chapter (riddle) poses a problem 
answered in the First Answers section. The solu- 
tion in turn raises another problem that is solved 
in the Second Answers section. This may suggest 
a third question and in several instances there is 
a fourth. Gardner draws us from questions to 
answers always presenting us with new riddles— 
some as yet unanswered. There are 125 different 
pieces altogether. 


Ridd 


Solving these riddles is not simply a matter of 
logic and calculation, although these play a role. 
Luck and inspiration are factors as well, so begin- 
ners and experts alike may profitably exercise their 
wits on Gardner’s problems, whose subjects range 
from geometry to word play to questions relating 
to physics and geology. 


We guarantee that you will solve some of the 
riddles, be stumped by others, and be amused by 
almost all of the stories and settings that Gardner 
has devised to raise these questions. 


The Mathematical Association of America 
1529 Eighteenth Street, N.W. 
Washington, D.C. 20036 
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Surfaces. 
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In two or 
three 


dimensions. 
And tt's animated! 


From our mail... 


“Another Success from Loscaux Graphics!" 

ME. City College of N Y 
"Excellent Program!! ..An exceptional educe- 
tional aid." D.W.-Dow Chemical 


"Just what I've been looking for!" 
D.B. Olivet College 
"Keep up the valuable programs at decent prices." 


Dr. R. B. TNU 


"Without your program's powerful layering and 
animation capabilities, we would not have attempted 
such a task.” M. T. Univ. of Wisconsin 


“Amazing Software - write more." 


“A super product * 
Recreational & Educational Computing 


For PC or Macintosh 


Lascaux Graphics 3220 Steuben Ave. Bronx, NY 10467 (212) 654-7429 


Writing Mathematics Well, by Leonard Gillman 


64 pp., 1987, ISBN-0- 88385-443-0 
Catalog Number - WMW 


Good writing conveys more than the 
author originally had in mind, while poor 
writing conveys less. Well-written papers 
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choice of title, to the presentation of 
results, to fine points on using words and 
symbols, to revision, and finally, to the 


gs Order from 
mechanics of putting your manuscript into , sas 
print. No book can by itself make you a Th 6 Mathematical Association 


better writer, but this one will alert you to 


? 1529 Eighteenth St. NW 
the opportunities for better and more 


Washington, D.C. 20036 


BE DISCRETE 


CONTENTS: Introductory Ideas and 
Examples. Series. Cards, Decks, and 
Hands. Applications of Generating Func- 
tions. Analytic and Asymptotic Methods. 
Each chapter includes exercises. Solu- 
tions. 


January 1990, 196 pages, $29.95 
ISBN: 0-12-751955-6 


generatingfunctionology 
Herbert S. Wilf 


Generating functions are a “bridge” 
between discrete mathematics on the one 
hand, and continuous analysis and com- 
plex variable theory on the other. This 
book is about generating functions and 
some of their uses in discrete mathemat- 
ics. It is designed for use at the advanced Order from your local bookseller or directly from 
undergraduate level for courses in discrete 
mathematics and it gives a good introduc- @ 
tion to most of the basic enumerative ap- 
plications of generating functions in a 
very readable form. Each chapter includes 
exercises with solutions. 


ACADEMIC PRESS 


Harcourt Brace Jovanovich, Publishers 
Book Marketing Dept. #12040 
1250 Sixth Ave., San Diego, CA 92101 


CALL TOLL FREE 


1-800-321-5068 


Quote this reference number for free postage and handling 
on your prepaid order = 1 
Prices subject to change without notice. ©1990 by Academic Press, Inc 
All Rights Reserved. TC/MJD —12040 


FROM THE NEW MATHEMATICAL LIBRARY 


INVITATION TO 
NUMBER THEORY 


Invitation to Number Theory, 


by Oystein Ore 
129 pp., 1967, Paper, ISBN-0-88385-620-4 
List: $9.90 MAA Member: $7.90 


This outstanding book gives the reader some of the history of number theory, 
touching on triangular and pentagonal numbers, magic squares and Pythagorean 
triples, and numeration systems. It covers the primes and prime factorization 
(including the fundamental theorem of arithmetic), congruences (modular arith- 
metic) and their applications (including methods of checking numerical calcula- 
tions), tests for primality, scheduling tournaments, and ways of determining the 
week day of a given date. 

Ore writes of his book, ‘‘The purpose of this simple little guide will have been 
achieved if it should lead some of its readers to appreciate why the properties of 
numbers can be so fascinating. It would be better still if it would induce you to try 
to find some number relations of your own; new curiosities devised by young 
people turn up every year.’’ The enterprise of making such discoveries is very 
broad including the invention and study of the curious sequence, 1, 11, 21, 1211, 
111221, ... (to understand it read it aloud) by Cambridge Professor John 
Horton Conway to the discovery of new Mersenne primes by high school students 
Laura Nickel and Kurt Noll. Ore’s book is a good place for readers to learn of the 
fascination that numbers hold. 


Order from: 

The Mathematical Association of America 
1529 Eighteenth Street, NW 

Washington, DC 20036 


Toward | 
52 Lean 
co and Lively). 
= ~=Calculus 


Report of the Conference/Workshop to Develop Curriculum 
and Teaching Methods for Calculus at the College Level, 
Ronald G. Douglas, Editor. 

MAA Notes #6 

249 pp., 1987, Paperbound, ISBN-0-88385-056-7 

Catalog Number - NTE-07 


Price: $12.50 


Should calculus be taught differently? Can it? Common wisdom says 
‘‘no’’——which topics are taught, and when, are dictated by the logic of the 
subject and by client department. The surprising answer from a four-day 
Sloan Foundation-sponsored conference on calculus instruction, chaired by 
Ronald Douglas, is that significant change is possible, desirable, and neces- 
sary. Meeting at Tulane University in New Orleans in January, 1986, a diverse 
and sometimes contentious group of twenty-five faculty, university and foun- 
dation administrators, and scientists from client departments, put aside their 
differences to call for a leaner, livelier, more contemporary course, more 
sharply focused on calculus’s central ideas and on its role as the language 
of science. 

This volume contains the results of that conference and the papers pre- 
sented to the conferees. These are certain to be the point of departure and 
basis for efforts to strengthen and reshape calculus in the next decade. 


Order from 
The Mathematical Association 
of America 
1529 Eighteenth St., NW 
Washington, DC 20036 


AA STUDIES IN 
MATHEMATICS 


Studies in Numerical Analysis 
MAA Studies in Mathematics #24 

Gene H. Golub, Editor 

415 pp. Hardbound. 

List: $46.50 MAA Member: $34.50 


This volume is a collection of papers describing the wide range of research ac- 
tivity in numerical analysis. The articles describe solutions to a variety of prob- 
lems using many different kinds of computational tools. Some of the compu- 
tations require nothing more than a hand held calculator: others require the 
most modern computer. While the papers do not cover all of the problems that 
arise in numerical analysis, they do offer an enticing and informative sample. 

Numerical analysis has a long tradition within mathematics and science, 
beginning with the work of the early astronomers who needed numerical pro- 
cedures to help them solve complex problems. The subject has grown and de- 
veloped many branches, but it has not become compartmentalized. Solving 
problems using numerical techniques often requires an understanding of sever- 
al of the branches. This fact is reflected in the papers in this collection. 

Computational devices have expanded tremendously over the years, and 
the papers in this volume present the different techniques needed for and made 
possible by several of these computational devices. 


Table of Contents 
The Perfidious Polynomial, James H. Wilkinson 
Newton’s Method, Jorge J. Moré and D. C. Sorensen 
Research Directions in Sparse Matrix Computations, Jain S. Duff 
Questions of Numerical Conditions Related to Polynomials, Walter Gautschi 
A Generalized Conjugate Gradient Method for the Numerical Solution of Elliptic 
Partial Differential Equations, Paul Concus, Gene H. Golub and Dianne 
P. O’Leary 
Solving Differential Equations on a Hand Held Programmable Calculator. 
J. Barkley Rosser 
Finite Difference Solution of Boundary Value Problems in Ordinary Differential 
Equations, V. Pereyra 
Multigrid Methods for Partial Differential Equations, Dennis C. Jespersen 
Fast Poisson Solvers, Paul N. Swarztrauber 


Poisson’s Equation in a Hypercube: Discrete Fourier Methods, Eigenfunction 
Expansions, Pade Approximation to Eigenvalues, Peter Henrici 


geticar 
xf. Order From: 
| sa & The Mathematical Association of America 
: £ 1529 Eighteenth Street, N.W. 


amen” Washington, D.C. 20036 


THE AMERICAN 
MATHEMATICAL MONTHLY 


olume 97, Number 5 May 1990 


Contents (ISSN 0002-9890) 
ARTICLES 
The Ergodic Character of Sequences of Pedal Triangles........ PETER D.LAX 377 
Stability Properties of Geometric Inequalities ................. H.GROEMER 382 
NOTES 
Separable Extensions and Diagonalizability ................ FRED RICHMAN 395 
A Birthday Present for Ramanujan .................... M.D. HIRSCHHORN 398 
A Condition for Flatness 
of Curvesin RAR” ...... 2. YAAKOV S. KUPITZ ann MICHA A. PERLES 401 
A Counterexample on Vector Norms 
and the Subordinate Matrix Norms ................ PIETER P.N.DE GROEN 406 
Ambiguous Numbers are Dense ..................... MARKO PETKOVSEK 408 
The Diophantine Equation x2 + y"™=z?", Addendum ......... DAVID W. BOYD 411 


THE TEACHING OF MATHEMATICS 


Singular Values and the Spectral Theorem ............... K. HOECHSMANN $413 
A Remark on the Shortest Confidence Interval 

ofaNormal Mean ................ ROBERT BARTOSZYNSKI anno WAI CHAN 415 
For Your Eyes Only ..........0 00.00. c eee eee eee GARY A. MARTIN 418 
Cauchy-Bunyakowski-Schwarz Inequality Revisited ....... FRANCOIS DUBEAU 419 
An Old Max-Min Problem Revisited ............... MARY EMBRY-WARDROP_ 421 


Functional Equations and L’H6pital’s Rule 
in an Exact Poisson Derivation ................ 0.0.00 000 eee J.ACZEL 423 


PROBLEMS AND SOLUTIONS 


Elementary Problems and Solutions................ 22220 Fee eee. 427 

Advanced Problems and Solutions................00 00002002 432 
REVIEWS 

The Mathematics of Plato’s Academy. A New Reconstruction. 

By D.H. Fowler ......... 20.00.0002. RICHARD K. GUY 440 


TELEGRAPHIC REVIEWS ...........0 000.0000 0000 ee 444 


NOTICE TO AUTHORS 


See statement of editorial policy (volume 89, p. 3). Follow the format in current issues. Put your full mailing 
address in a line at the head of the paper, following the title and the author’s name. Send three copies of the 
manuscript, legibly typewritten on only one side of the paper, double-spaced with wide margins, and keep one as 
protection against loss. Prepare illustrations carefully, on separate sheets of paper in black ink, the original without 
lettering and two copies with lettering added. Rough copies of the illustrations should be included in the manuscript 
at the appropriate places. Supply the full five-symbol Mathematics Subject Classification number, as described in 
Mathematical Reviews, 1985 and later. (This is necessary for indexing purposes.) 

All manuscripts whose length is more than 7 manuscnpt pages should be sent to HERBERT S. WILF, 
Department of Mathematics, University of Pennsylvania, Philadelphia, PA 19104. Shorter articles about the 
teaching of mathematics should be sent to either MELVIN HENRIKSEN (linear algebra, algebra, differential 
equations), Department of Mathematics, Harvey Mudd College, Claremont, CA 91711, or STAN WAGON 
(calculus, analysis, number theory, discrete mathematics, statistics), Department of Mathematics, Smith College, 
Northampton, MA 01063. Other shorter articles should be submitted as follows: in algebra, discrete mathematics, 
or probability, to RODICA SIMION, Department of Mathematics, George Washington University, Washington, 
DC 20052; in geometry or topology to DENNIS DETURCK, Department of Mathematics, University of 
Pennsylvania, Philadelphia, PA 19104; in analysis to DAVID J. HALLENBECK, Department of Mathematical 
Sciences, University of Delaware, Newark, DE 19716. Papers in other fields may be sent to any of the editors; 
however, do not send a paper to more than one editor. Proposed problems or solutions should be sent to PAUL T. 
BATEMAN, MONTHLY Problems, Department of Mathematics, University of Illinois, 1409 West Green Street, 
Urbana, IL 61801; unsolved problems to RICHARD GUY, Department of Mathematics and Statistics, University 
of Calgary, Alberta, Canada T2N 1N4. 


EDITOR 
HERBERT S. WILF 


ASSOCIATE EDITORS 


PAUL T. BATEMAN DAVID J. HALLENBECK RODICA SIMION 
DONNA BEERS PAUL R. HALMOS ANITA E. SOLOW 
DENNIS DETURCK MELVIN HENRIKSEN JOEL SPENCER 
HAROLD G. DIAMOND JOAN P. HUTCHINSON LYNN A. STEEN 
JOHN DIXON WILLIAM M. KANTOR KENNETH B. STOLARSKY 
J.H. EWING JOSEPH KONHAUSER STAN WAGON 
RICHARD GUY RICHARD LIBERA DOUGLAS B. WEST 
LEE A. RUBEL 


Reprint Permission: Marcia P. SwarbD, Executive Director, Mathematical Association of America, 1529 
Eighteenth Street, N.W., Washington, DC 20036. 

Advertising Correspondence: Ms. ELAINE PEDREIRA, Advertising Manager, Mathematical Association of Amer- 
ica, 1529 Eighteenth Street, N.W., Washington, DC 20036. 

Subscription correspondence, changes of address, and inquiries about nondelivered or defective issues: Member- 
ship /Subscriptions Department, Mathematical Association of America, 1529 Eighteenth Street, N.W., Washington, 
DC 20036. 

Microfilm Editions: University Microfilms International, Serial Bid Coordinator, 300 North Zeeb Road, Ann 
Arbor, MI 48106. 


The AMERICAN MAILHEMATICAL MONTHLY (ISSN 0002-9890) is published monthly except bimonthly June-July and 
August-September by the Mathematical Association of America at 1529 Eighteenth Street, N.W., Washington, DC 20036 
and Montpelier, VT. 

The annual subscription price of $30 for the AMERICAN MATHEMATICAL MONTHLY to an individual member of the 
Association is included as part of the annual dues. (Annual dues for regular members, exclusive of annual subscription 
prices for MAA journals. are $60. Student and unemployed members receive a 66% dues discount; emeritus members 
receive a 50% dues discount: and new members receive a 40% dues discount for the first two years of membership.) The 
nonmember/library subscription price is $120 per year. 

Copyright « by the Mathematical Association of America (Incorporated), 1990, including rights to this journal issue as 
a whole and, except where otherwise noted, rights to each individual contribution. General permission is granted to 
Institutional Members of the MAA for noncommercial reproduction in limited quantities of individual articles (in whole or 
in part) provided a complete reference is made to the source. 

Second class postage paid at Washington, DC, and additional mailing offices. 

Postmaster: Send address changes to the AMERICAN MATHEMATICAL MONTHLY, Membership/Subscriptions Depart- 
ment, Mathematical Association of America, 1529 Eighteenth Street, N.W., Washington, DC 20077-9564. 

PRINTED IN THE UNITED STAILS OF AMERICA 


The Ergodic Character of Sequences of Pedal Triangles 


PETER D. LAx 


PETER D. LAX received his Ph.D. from the Courant Institute of Mathematical 
Sciences of New York University in 1949 and has been on its faculty since 
1950, doing research in both pure and applied mathematics. The present 
article is an application of concepts from applied mathematics to a problem 
in pure mathematics. 


In an interesting article in this journal, Kingston and Synge discuss sequences of 
pedal triangles {7,,}, where T,,, is the pedal triangle of T,,, 1.e., its vertices are the 
feet of the altitudes of T,. The authors observe that there are pedal sequences which 
are periodic in shape, and construct all of them. In this note we demonstrate that 
the shape of triangles in a pedal sequence is ergodic. 

Denote by A, B,C the angles of a given triangle; the angles A’, B’, C’ of its pedal 
triangle are simply related to A, B,C: 

If all angles A, B,C are acute, 


A’ =a2-—2A, B’ =a — 2B, C’ =a —2C; (1) 
if A is obtuse, 
A’ =2A—1_, B’ = 2B, C’ = 2C, (2) 
with analogous formulas when B and C are obtuse. Let’s call the mapping 
A,B,C? A,B,C 
the pedal mapping, denoted by P. 


Kingston and Synge give an elegant and useful representation to the pedal 
mapping renormalized by dividing all angles by 7, 1e., setting 


A =aretc., A’ = a'r etc. 
For then 
at+t+b+c=1, a’t+b’+c’=1, 


so that (a, b, c) and (a’, b’, c’) can be regarded as barycentric coordinates of points 
of a triangle EF, with vertices P, Q, R. The pedal mapping can then be regarded as a 
mapping of EF into E. Divide E into four congruent triangles, each half the size of 
FE, see FIGURE 1. Denote the subtriangles adjacent to P,Q, R by E.,i = P,Q, R, 
and denote by £,, the one in the middle. 

A triangle ZT whose angles are acute corresponds to a point interior to Ey). 
Formulas (1) and (2) for the angles of the pedal triangle can be expressed as follows: 

P maps points of F,, by dilatation from the midpoint (1/3,1/3,1/3) of E by a 
scale factor — 2. 

P maps points of £,,i = P,Q, R by dilatation from P,Q or R by a scale fac- 
tor 2. 
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FIG. 1 


Note that each E, is mapped onto E; P is discontinuous across the boundary of 
Ey, except for the midpoints of its sides. 

The pedal mapping can be regarded as a two-dimensional analogue of the 
mapping x — f(x) of the unit interval into itself defined by 


2x O<x< , 
f(x) > 1 (3) 
2— 2x 5 <x<l 
The graph of f is roof-shaped: 
0 4 1 
FIG. 2 


In fact, the restriction of the pedal map to any of the medians of F is the roof 


mapping (3). 
The pedal mapping shares many of the properties of the roof mapping: 


THEOREM 1. The pedal mapping is area preserving, i.e., for any measurable subset 
G of E, 


m(G) = m(P-(G)), 
where m is Lebesgue measure in the plane. 


Proof. This is obvious, since the inverse image of G consists of four pieces G, 
located in the four subtriangles E,,i= P,Q, R, and M, each congruent to G 
diminished by a factor of 2. Therefore each G, has measure (1/4)m(G), and their 
union has measure m(G). 


THEOREM 2. The pedal mapping is ergodic. 
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This means: for any point qg in FE, any subset O of F and any natural number n 
denote by 
N(q, O,n) 


the number of points 


q,P(q),-..,P"-"(q) 
that lie in O. The ergodic property is that for almost all g and every open set O, 


 N(q,O,n) — areaof O 
im = ————— = ——_. 
n— 00 n area of E 


In words: for almost all g, the successive images of g under P are on the average 
equally distributed over E. 

According to G. D. Birkhoff’s pointwise ergodic theorem, a measure preserving 
map is ergodic iff it is metrically transitive. This means any measurable subset of E 
that is invariant under P has measure either 0 or m(E£). 

We show now that the pedal map P is metrically transitive. Let S C E bea set 
invariant under P; denote by c its relative measure 

m(S 
c= (S) , (4) 
m(E) 
Denote by S, the portion of S contained in E,, i = P,Q, R, M,i.e.,8,;= S00 £;. 

Since S is invariant under P, the images PS, are contained in S. Since the 
measure of PG is four times the measure of G for any set G contained in any of the 
subtriangles E,, it follows that 

4m(S,) < m(S), i= P,O,R, M. (5) 
Adding these inequalities and dividing by 4 we get 


»m(S;) < m(S). 


Using the additivity of measure, and that 


US = S, 
we get 
m(S) <m(S). 
It follows that the sign of equality must hold in (5) for all i: 


1 
m(S,) = qin(S). 


Combining this with the notation (4), and using the fact that 


we get that 


fori= P,Q, R, M. 
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We claim that (6) holds for every triangle E, obtained from E by repeated 
subdivision into four congruent parts. We have already shown this for the four 
triangles obtained by a single subdivision; to show it for triangles obtained by n 
subdivisions we argue by induction on n. Denote by { £/’} all the triangles obtained 
by n + 1 subdivisions, and denote by S,’ the intersection of E£,’ and S. P maps each 
E/ onto a triangle E y obtained by n subdivisions; therefore S’ is contained in 
S, = E,O S. Arguing as in (5) we conclude that 

4m(S/) < m(S,). 
By induction hypothesis, (6) holds for S; therefore, 
4m(S/) < cm(E,) = c4m(E/). (7) 


We sum over all S’; using additivity of measure and that 


we cbtain, after dividing by four, 
m(S) <cm(E). 
According to (4) the sign of equality holds above; therefore equality must hold in 
(7) for all i; this completes the induction. 
Next we claim that (6) holds for all rectangles R in E, 1.e., that for all R, 
m(ROS)=cm(R). (8) 


For every rectangle can be approximated arbitrarily closely from the inside and 
outside by unions of triangles E£’. 
Denote by R(h, p) the square of side length h centered at p. According to a 
classical theorem of Lebesgue, for any measurable set S, 
_ m(R(h, p) 1S) 
im 
h—0 m(R(h)) 
exists for almost all p, and the limit is 1 at almost all points of S, and zero at almost 


all points in the complement of S. In light of (8) this means that c is either 1 or 0. In 
the first case 


m(S) = m(T), 
in the second case 
m(S) = 0. 
This completes the proof that P has no nontrivial invariant sets, and thereby has the 
ergodic property. Q.E.D. 


We can simplify the last step in the proof by appealing to the much more 
elementary L' version of Lebesgue’s theorem, as follows. Denote by p the character- 
istic function of the unit square R around the origin, 1.e., 


fl forpinR 
e(P) (6 otherwise. 


Denote by p, the function 
p,(p) = h~“e( ph); 
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for small h, p, serves as an approximate 6-function. For any x in L! we denote by 
x,, its mollification by p, 


Xn — On *X; 


where * denotes convolution. Obviously, if x is continuous, x, tends uniformly to 
x as h tends to zero. Since convolution with p, is bounded in the L' norm 


Pn * Xl1 < [XI 


it follows that for any x in L', x, tends to x in the L' norm as A tends to zero. 
We apply this to the characteristic function x of S 


1 forpinS 
— 9 
x(P) ‘ otherwise. (°) 
It follows from (8) that for any h, 
Xn = C: 


Taking the limit as # tends to zero we conclude that x =c. In view of (9) we 
conclude that c is either 0 or 1, and S has measure either 0 or 1. Q.E.D. 
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Abstract. Geometric inequalities, like the isoperimetric inequality, have been a subject of intensive 
research for a long time. More recently, some properties of these inequalities, which may be called 
stability properties, have been investigated. Roughly speaking, these investigations concern the geometric 
implications if the inequalities are in a certain sense close to equalities. The present article is primarily an 
exposition of various stability results concerning inequalities for plane convex sets, including a discussion 
of the concepts and methods which are of importance in this area. Moreover, several new results and 
proofs are presented. 


1. Introduction. Let R” denote n-dimensional Euclidean space. A bounded con- 
vex subset of R” will be called an n-dimensional convex body if it is closed and has 
interior points. We let @” denote the class of all n-dimensional convex bodies. 
Two-dimensional convex bodies will be called convex domains. 

We consider primarily inequalities of the form 


®(K)>0, (1) 


where ® is real valued and (1) is supposed to hold for all K € @7. For example, if 
we let p(K) denote the perimeter and a(K) the area of K we may set ®(K) = 
p(K )* — 42a(K ). Then (1) is the well-known isoperimetric inequality. 

Let now @, denote those elements K € @* for which 


®(K) =0. 


For instance, in the case of the isoperimetric inequality @@ 1s known to consist of 
all (closed) circular discs. We are interested in the stability problem associated with 
(1). That means, we ask if K must be close to a member of @, whenever ®(K ) is 
close to 0. In the case of the previous example the question is whether K must be 
close to a circular disc if p(K)* — 47a(K) is small. In order to give a precise 
formulation of the stability problem let us assume that we are given a function that 
measures in some sense the deviation between two convex domains. This will always 
be a nonnegative function g(K, L) defined for all K, L © @? which has the 
property that g(K, L) =0 if and only if K = L. If ®, @ and g are given, the 
stability problem associated with the geometric inequality (1) can now be formu- 
lated as follows: 


Find positive constants c, a with the property that whenever 
O(K)<e (2) 
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(for some ¢ => 0), then there exists an L © @y such that 
g(K,L) < ce*. (3) 


(Instead of ce* one could admit more general functions but for the purpose of 
this article nothing would be gained.) For example, if again ®(K) = p(K)? — 
4cra(K) and if g(K, L) denotes the Hausdorff distance (as defined in the next 
section) it will be seen that (2) implies that there is a circular disc L with 
g(K, L) < (1/4y )e!” (cf. the remarks regarding (13)). 

It is of interest to note that a solution of the above stability problem implies the 
original inequality (1). Indeed, if we had ®(K) <0 we could take e = 0 which, 
because of (3) would imply g(K, L) = 0. But this is impossible since it would yield 
K = L and therefore ®(K ) = 0. 

If one chooses in (2) e = ®(K), then g(K, L) < c®(K )* and consequently 


®(K) >(g(K, L)/c)'”". (4) 


This inequality, which has to be interpreted in the sense that for every K there is a 
Le @, such that (4) holds, is obviously a strengthened version of (1). It is also 
clear that any inequality of the form (4) leads immediately to a solution of the above 
stability problem. In the case of the isoperimetric inequality the inequality corre- 
sponding to (4) and the previously mentioned stability result is 


p(K)° — 4ra(K) > l6mg(K, L)’, 


where g is again the Hausdorff distance and L is a suitable circular disc. 

Obviously, one can consider many variations and generalizations of these stabil- 
ity problems. For example, there is no need to restrict attention to the case n = 2 or 
to inequalities involving only one convex body. Some of these generalizations will be 
discussed later. 

It is the aim of the present article to survey a variety of stability results. At the 
same time this exposition provides us with the opportunity to introduce and discuss 
some of the concepts that proved to be useful in this context. Finally, several new 
results and proofs will be presented. These mostly concern stability results for plane 
convex sets that are stronger than those obtained by specializing known results for 
n-dimensional convex bodies to the case n = 2. 


2. The Support Function. Let K be a convex domain and assume that the origin 
o of R” is an interior point of K. If wis a unit vector in R* we let L(u) denote the 
line that is perpendicular to u and has the property that it meets K but not the 
interior of K, and that u points into the halfplane determined by L(u) that does not 
contain K. We call L(u) the support line of K associated with u. The distance 
between o and L(u), considered as a function of u, is called the support function of 
K. It is usually denoted by H(u). However, since u is uniquely determined by the 
angle, say @, between the positive x-axis and u we also write H(@) instead of 
H(u(@)). It is easy to see that H(u) = sup{u- x:x € K }, where u- x denotes the 
usual scalar product. This formula can also serve as a definition of H(#) in the case 
when o ¢ K. Obvious arguments show that H(@) is a continuous function of @. The 
support function is one of the most fundamental concepts in the theory of convex 
sets and 1s discussed in practically all textbooks on convexity (see for example [2], 
[4], [10], [20], [25}). 
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As before, let p(K ) denote the perimeter and a(K ) the area of K. We have 


p(K) = f'"H(8) ad (5) 


and, if H is sufficiently smooth and ’ denotes the derivative with respect to @, 


a(K) = Sf (uy — H’(8)°) dé. (6) 


(5) is an immediate consequence of the two-dimensional version of the so-called 
surface area formula of Cauchy (see [4, p. 48]). If K is strictly convex and AH(@) is 
continuously differentiable the following analytic proof of (5) is also of interest (cf. 
Minkowski [27]). If we set q= KM L(u) then H(u) = q-u and consequently 
A’(@)=q-u+q-:u'=q-w (since q’ is a tangent vector of the curve dK and 
therefore perpendicular to u). Hence, q = (q: wus (q- uw) = A(#)u + AA), 
and because of u’” = —u it follows that q’/(@) = (A(@) + A’(@))uw’ where H(@) + 
H'(@) = 0. Thus, 


27 27 27 
p(K) = ["q'() ad = f-"(a() + H°(8)) dd = [7H (8) a8. 
0 0 0 
Next we will prove (6). Assume that H is twice continuously differentiable. If we 
set ||q|| = r and let B denote the angle between the positive x-axis and the vector q, 
then 


Because of rcos(B — 6) = H(@) and r= yH(6) + H’(6)’ we obtain after a 


simple calculation r7(dB/d0@) = H(6@)* + H(@)H’(@). Therefore, 
1 pon 
a(K) = sf (11(6)° + H(8)H"(8)) a6, 


and (6) follows by an obvious integration by parts. 

For further comments on (6) see |4] and Blaschke [1] who has shown that (6) 
holds for any convex domain if H’ is properly interpreted. However for the purpose 
of proving stability results smoothness assumptions are usually no serious restriction 
since the general case can be obtained by routine approximation arguments. 

Another concept that can be defined in terms of the support function and will 
play a role in our stability statements is the Steiner point of K. It can be defined by 


s(K) = ~ ["u(o) H(0) a0, 


T “0 


where an obvious generalization of the ordinary integral to vector-valued functions 
is used. If the curve C = OK is sufficiently smooth and if one assigns to each point 
of C the curvature as density, then s(K ) is the center of mass of this “inhomoge- 
neous” curve. For more information on the Steiner point of a convex body one may 
consult [19], [25], [30], and [31]. 


3. Fourier Series. The support function H(@) of a given convex domain K is 
always continuous, bounded, and of period 27. Consequently it has a Fourier series 
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of the form 
ag + > (a,cosné + b, sinn@). 


n=1 


We wish to express p(K), s(K), and a(K) in terms of the Fourier coefficients. 
From (5) and the definition of the Steiner point and the Fourier coefficients we 
obtain immediately 


p(K) = 21a, (7) 
and 


s(K) = (a, 5;). (8) 
From (6) and Parseval’s equation 


2a, + y (a> + b>) , 


n=1 


[CH(8) 40 = 0 


0) 


together with the corresponding equation for H’(@) (if this function is also continu- 
ous), one can deduce that 

a(K) =a, — ~ ¥ (n’ - 1)(a? + b?). (9) 
Blaschke [1] has shown that (9) can actually be proved without any smoothness 


assumptions; he also noted the validity of (8). Regarding (8) see also Kubota [24], 
Meissner [26], and the historical remarks of Schneider in [31]. 


4. Deviation Measures. Let K and L be two convex domains with respective 
support functions H, and H,. The most frequently used function to measure the 
deviation of L from K (or K from L) is the Hausdorff distance. It can be defined 
by 


h(K,L)= max | H, (u) — H,(u)|. 


The dominating role of the Hausdorff distance in the theory of convexity is evident 
from such books as [2], [4], [10], [20], and [25]. However various other deviation 
measures have been proposed and used (cf. [33], [34], and the survey [18]). One such 
measure which appears to be of particular value with respect to stability problems is 
the measure that corresponds to the L,-metric in function spaces. It is defined by 


a 5 1/2 
(KL) = | [| (8) ~ He 0) a 


From the continuity of the support function one deduces easily that h,(K, L) = 0 
if and only if K = L. In the next section it will be seen that A, arises in a very 
natural way; see also [5] and [23]. The quantitative relationship between h and h, 
has been studied by Vitale [34]. Among other (and much more general) results he 
found that for any two convex domains K, L whose diameters do not exceed d the 
following relationship is always satisfied: 


1 - K 2/3 
Saha KL) < WK L) < (| (K,L). 
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If K is given we can ask which circular disc yields the best approximation of K 
in terms of the h, deviation measure. The following theorem shows that this 
problem leads again to the Steiner point of K. For the sake of convenient 
formulation we associate with each K © @? the circular disc S(K) which has 
radius p(K)/2a and center at the Steiner point s(K) of K. We call S(K) the 
Steiner disc of K. 


THEOREM 1. If K © @* and X is any circular disc, then 
h,(K, X)° =h,(K, S(K))° + h,(S(K), X)’, (10) 


where S(K ) is the Steiner disc of K. In particular, h,(K, X), considered as a function 
of X is minimal if and only if X is the Steiner disc of K. 


Proof. We only have to prove (10) and may assume that s( K ) = 0. Then, because 
of (7) and (8) the support functions Hy, Hy, and Hs-x) have, respectively, the 
following Fourier series: 


(p(K)/27) + > (a,cosné + b, sin n@), 
n=2 


(p(X)/27) + a, cos 6 + B, sin, 


p(K)/27. 


Hence, using Parseval’s equality we can deduce that 


1] o.6) 


hal KX) = 5-( p(K) ~ (20)? + aad + Be) + E (al + 62) 
h,(K, S(K))° =a E («i +b), 
1 
h,(S(K), X)° = —(p(K) ~ p(X))° + a (az + BF). 


(10) is now an obvious consequence of these three relations. 


5. Stability of the Isoperimetric Inequality. We consider now the stability of the 
isoperimetric inequality p(K )* — 47a(K) > 0 with respect to the deviation mea- 
sure h,. As pointed out in the introduction we may formulate stability results as 
strengthened forms of the original inequality. In the present case we choose the 
following formulation. 


THEOREM 2. Let K be a convex domain of area a( K ) and perimeter p(K ), and let 
S(K ) denote the Steiner disc of K. Then, 


p(K) — 42a(K) > 6nh,(K, S(K))’. (11) 


Equality holds if and only if the support function of K is of the form a, + a, cos @ + 
a,cos2@ + b, sin@ + b, sin2é. 
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Proof. Let us first assume that the smoothness assumptions under which (7), (8), 
and (9) have been derived are satisfied. Then, 


2 2,2 2 ™ = 2 2 2 
p(K)° —4ma(K) = 4a aj — 4n| maj — 5 y(n —1)(a? + b?) 
n=2 


= 2m? Y (n? —1)(a2 + b2) = 60? Y (a? + b?) 
n=2 


n=2 
= 6rh,(K, S(K))’. 


The general case can be settled either by referring to Blaschke [1] where it is shown 
that (7), (8), and (9) hold without any regularity assumptions, or by using the fact 
that arbitrary convex domains can be approximated to any desired degree of 
precision (in the Hausdorff metric) by domains with analytic boundary (cf. [4] or 
[18]). Although the remark about the occurrence of the equality sign is an obvious 
by-product of our proof it has to be noticed that it is only meaningful if there 
actually are convex domains whose respective support functions are of the indicated 
type. An example of such a domain can be constructed by considering the curve 
defined by 


x = cost + = sin’ t, y= sint + 3 cos’ t (0<t<2m). 


It is an easy (but somewhat lengthy) exercise in elementary differential geometry to 
prove that this is a convex curve whose corresponding convex domain has the 
support function 


1 
H(@) =1+ é sin’ 6. 


See Inzinger [23] for a very detailed study of support functions that are partial sums 
of trigonometric series. 

We add several remarks concerning Theorem 2 and its proof. Our proof of (11) 
actually yields an explicit expression (in terms of the Fourier coefficients) for the 
isoperimetric deficit p(K )? — 42a(K ), namely, 


p(K) — 4na(K) = 29’ (03 -- 1)(a? + b?). (12) 


This relationship can be used to obtain a stability statement in terms of the 


Hausdorff metric. Since it is easily seen that |a, cosn@ + b, sin n@| < ja? + b? it 
follows that 


|Hx(0) — Ayn) (9)| =l|a)>+ > (a,cosné + b, sinn6) 
1 


n= 


—(a,) + a,cos 6 + b, sin@) 


CO CO 
> |a,cosn6+b sinnO|< > jai +b. 
n=2 


n=2 


lA 
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Hence, using Hoélder’s inequality, we find 


oe) 00 V/27 «& fe 
h(K,S(K)) < ¥ a+b < | Y 1/(n? - 1) | y(n? — 1)(a; + bf) 
n=2 n=2 n=2 


and because of 


and (12) we finally obtain 


p(K)*° — 47a(K) = aK, S(K))’. 


But this inequality is not best possible. Better results can be deduced from a 
theorem of Bonnesen [3] which states that for any K there are two concentric discs 
of respective radii R and r such that 0K is contained between these discs and 


p(K)° — 4a(K) > 40(R-r)’. (13) 


This implies obviously that there is a disc D (of radius (R+r)/2, but not 
necessarily the Steiner disc) such that 


p(K) — 42a(K) > 162h(K, D)’. 


Inequalities of the type (13) are known as Bonnesen type inequalities. More 
information on such inequalities can be found in [4] and in the survey [28]. 

Although (11) cannot be improved for all K € @* it is possible to prove stronger 
inequalities for particular kinds of convex domains. For example, if K 1s of constant 
width the Fourier expansion of the support function of K has the property that 
a, = b, = 0 for all even values of n. Checking the proof of (11) and the subsequent 
remarks one sees that this fact leads to the inequalities p(K)* — 47a(K) > 
l6mh,(K, S(K))? and p(K)? — 47a(K) > 8977h(K, S(K))*. From the latter in- 
equality one can derive that p(K)? — 42a(K) > 277(R — r)?, which is an im- 
provement of Bonnesen’s inequality in the case of domains of constant width (cf. 
also [15)). 

The idea of using Fourier series for proving the isoperimetric inequality is due to 
Hurwitz [21]. With minor modifications this proof has been reproduced in many 
textbooks (for example in [7]). However Hurwitz did not use the Fourier expansion 
of the support function but employed two series, one for each of the coordinates of 
the points of the boundary curve. In a subsequent paper [22] he gave another proof 
of the isoperimetric inequality using the Fourier series expansion of the radius of 
curvature. A proof of the isoperimetric inequality that is based on the support 
function and Fourier series was first given by Blaschke [1] (he used the Fourier 
series of the derivative of the support function); see also Geppert [12]. But none of 
these articles deals with stability questions. Various other geometric applications of 
Fourier series have been investigated by Meissner [26]. 

Concerning the stability problem of the isoperimetric inequality in higher dimen- 
sions see [8], [29], and [32]. 


6. Mixed Areas. The isoperimetric inequality can be significantly generalized by 
the introduction of “mixed areas.” These can be defined by the following procedure. 
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If K, L€ @? and a, B are non-negative real numbers we set 
ak + BL={ap+ Bq:pEK, qeL}-. 
One can easily prove that the support function of aK + BL is aH, + BH, (where, 


of course, H, and H, are the support functions of K and L, respectively). Using 
(6) one finds immediately 


a(aK + BL) = a’a(K) + 2aBa(K, L) + B’a(L), 
with 
277 


a(K.L) = 5 [Wy (A), (8) — HE (8) HO) d8. 


In particular, a(K, K)=a(K) and, if C is a circular unit disc, a(K,C) = 
(1/2) p(K ). The generalization of the isoperimetric inequality is Minkowski’s in- 
equality 

a(K, L)” — a(K)a(L) > 0, 


where equality holds if and only if K and L are homothetic (i.e., K = aL + p with 
p € R” and a > QO). This inequality has been proved repeatedly, see for example [1], 
[4], or [11]. The above integral representation of a(K, L) or the obvious relation 


2a(K,L) =a(K+L)-a(K)-a(L) 


enable one again to use Fourier series to establish stability results. However, as 
recently shown by Wallen [35], the best approach is probably to first use Fourier 
series to prove a general inequality, known as Wirtinger’s inequality, and then to 
employ this inequality to show the stability results. Using this method Wallen has 
proved the following theorem. 


Let K and L be two convex domains and let R and r be defined by 
R=min{t: LCiK+p, ¢>0,peR’*}, 
r=max{s:sK+qCL, s>0,qeR°*}. 

Then, 
a(K)° 
4 


This inequality will fit better into the context of our stability results if it is 
reformulated in terms of the Hausdorff distance. This yields the following theorem: 


a(K, L)° — a(K)a(L) = (R—r)’. (14) 


THEOREM 3. If K and L are convex domains, then there is a homothetic copy L' of 
L such that a(.L’) = a(K ) and 


a(K, L)° — a(K)a(L) > ————A(K, L’)’, (15) 


where d(K ) denotes the diameter of K. 


Proof. In view of (14) we have only to show that there is a homothetic copy L’ of 
L such that a(L’) = a(K ) and 


h(K, L’) < Ja(K)/a(L)(R —r) d(K). 
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Since the case R = r is trivial we may assume R > r. The definition of R and r 
shows that there are points p, q © R? such that 


rK+qCLCRK +p. 


This can also be written in the form 


_ r R _ 
[K+ 5 =) CL+ p - —q cR[K +5 sa 
R- R-r R-r R- 
Hence, replacing (if necessary) K and L by suitable translates we may assume that 
rK CLC RK, 


If we set k = ja(K )/a(L), L’ =kL, r’ = xr, and R’ = kR, then a(L’) = a(K) 
and 


r'KCL'C R'K. 
It follows that o € K,o € L’, and r’ < 1 < R’. Using these facts we find 
H,)(9) — Hx (0) > rH (8) — Hg(9) = —(1 — 1’) Hg (0) = -(R'- 1’) d(K) 
and 
H,(6) — Hg (0) < R’Hg (9) — Hg (8) = (R’ - WA (8) < (R’- 1’) d(K). 


Consequently we obtain the desired result 


h(K, L’) <(R’—r') d(K) = Ja(K)/a(L)(R-1)d(K). 


A kind of n-dimensional generalization of Theorem 3 has been given by Diskant 
[8]. However in the case n = 2 the result of Diskant is weaker than (15) (see also 
Schneider [32]). 


7. The Brunn-Minkowski Theorem. The Brunn-Minkowski theorem is one of the 
most important theorems in the theory of convex sets. In the case n = 2 it is the 
following statement concerning the area of the linear combination aK + (1 — a)L 
of two convex domains K and L. 


IfK € @*, LE @*, and 0 <a <1, then 
a(aK + (1—a)L)'” > aa(K)'” + (1—a)a(L)'” (16) 
In the case 0 < a < 1 equality holds if and only if K and L are homothetic. 


For the sake of simplicity let us consider the special situation when a(K) = 
a(L) = 1. (The general case can be derived from this restricted version by simple 
substitutions.) Under this assumption (16) can be written as 


a(aK + (l-—a)L)>1. 
The stability problem associated with this inequality is to estimate the degree of 
homothetic relationship of K and L if for some a (0 < a < 1) and e > 0 
a(aK + (l-—a)L)<l+e. 


In [14] where the n-dimensional case is considered it is shown that this inequality 
implies the existence of a translate L’ of L such that 


1 
h(K, L') < a8 | +2] Del3, 


a(1 — a) 
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where D = max{d(K), d(L)}. Using Theorem 3 we can here easily prove the 
following stronger result. 


THEOREM 4. Let K and L be two convex domains with a(K ) = a(L) = 1, and let 
a be such that 0 <a <1. If for some ¢ > 0 


a(aK + (1-—a)L)<l+e, (17) 
then there exists a translate L’ of L such that 


V17 1 


h(K,L') < 2 Ja(l—a) a) 


De'”’, (18) 


where D = max{d(K), d(L)}. 
Proof. We may assume that 


V17e /2Ja(1 — a) <1, (19) 


since if this is not true then (18) is trivially satisfied. Using the definition of 
a( K, L) we see that (17) implies 
(1— a)’ + 2ae(1 — a)a(K, L) +02 <1+e. 
This can also be written as 
a(K,L) -1<e/(2a(1 - a)). 


Consequently, using also (19), we find 


a(K,L)’-1< (1+ a(K,L)) < 5 


rica? * a) 


E 
2a(1 — a) 

E 18 
a(1 — a) 17 


Hence, Theorem 3 implies that there is a translate L’ of L such that 


n(K, L 5 € 18 17 € p? 
\ < ———- — < — ——- 
SUG -a) ~ 4a(l-—a) 
and (18) follows from this. 
As an illustration of the use of Theorem 4 we mention a stability result 
concerning central symmetrization. If K € @*, we set K={-p: p© K} and 
call the set 


1 
k* = >(K + (~K)), 
the convex domain obtained from K by central symmetrization. K* is obviously a 
centrally symmetric convex domain and the Brunn-Minkowski theorem shows that 
a(K*) > a(K) 


with equality if and only if K itself is centrally symmetric. Thus we meet the 
stability problem of deciding how much K deviates from a centrally symmetric 
convex domain if a( K*) < a(K) + «. The following corollary addresses this prob- 
lem. 
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COROLLARY. Let K be a convex domain of area 1. If a(K*) < 1+ «€ (for some 


e > Q), then there is a centrally symmetric convex domain Z (which is a translate of 
K*) such that 


h(K,Z)< ~ViTe d(K) 


Proof. If Theorem 4 is applied to K and — K we obtain a translate (— K) of —K 
such that h(K,(—K)’) < yl7e d(K ). If we set Z = (1/2)(K + (—K)’), then Z isa 
translate of K* (and therefore centrally symmetric) and 


1 
| H7(9) — Hx(6)| — 5 (Ax (4) + H._xy(0)) — Hx (8) 


1 1 
5 | Ax (9) — H_xy(8)| < 7K (-K)’). 


Hence, h(K, Z) < (1/2)h(K,(—K J’) and the desired result follows. 

This Corollary has applications in the theory of packings of convex domains. It 
enables one to estimate how much a convex domain K deviates from a centrally 
symmetric convex domain if K permits packings (by translates) of high density. See 
[16] for more details. An n-dimensional version of the Corollary together with an 
application, but with poorer constants in the case n = 2, is given in [14]. 

There is another kind of stability problem that can be associated with the 
Brunn-Minkowski theorem. Instead of assuming that (16) holds for a particular 
value a one may suppose that it holds for all a. Diskant [9] has investigated the 
stability of the Brunn-Minkowski theorem from this point of view. However, in 
most applications, like that concerning central symmetrization, the available infor- 
mation is restricted to only one value of a. 


9. Stability and Projections. To indicate the wide variety of stability problems in 
the theory of convex sets we discuss now a problem that is not exactly of the type as 
described in the Introduction. If K is a convex body and E a plane in R° let K, 
denote the orthogonal projection of K onto E. Assume now that every projection 
K,, differs (with respect to some deviation measure) at most e from a circular disc. 
What can be said about the deviation of K from a ball? To give a precise 
formulation of this problem let R(M) denote the circumradius and r(M) the 
inradius of a convex body M in R’ or R*. Among much more general results 
dealing with this problem the following theorem is proved in [13]: 


Assume K € @? and e > 0. If for every plane E in R° 
R(K,) —r(Kz) <e, (20) 
then 
R(K) —r(K) < 6(1 + 2y2)e. (21) 


We can use this theorem to prove the following result which, to some extent, 
answers a question of Chakerian [6]. The question is whether one can conclude that 
the isoperimetric quotient of a convex body in R? is large (i.e., close to 1) if every 
projection has a large isoperimetric quotient. In our context it is more convenient to 
work with the isoperimetric deficits p(K,)? — 47a(K,) and o(K)'”? — 
(367)'/°u( K )'7?, where o denotes surface area and v volume. 
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THEOREM 5. Let there be given a convex body K in R?’ and an « > 0. If for every 
plane E in R? 


p(K,) — 4na(K;) <e, (22) 
then 


a(K)'”* — (367)'°v(K)'” < 6(1 + 2y2 )ve. 
Proof. From (22) and Bonnesen’s inequality (13) it follows that 


R(K,) —r(K,) < TE. 


Tz 


Since (20) implies (21) we obtain from this inequality 


R(K)-—r(K)< (1+ 2/1) —E. 


Using this we find 
1/2 4 13 
o(K)'”? — (367)'/°v(K)'” < (47R(K)’) — (360)'"{ Sark) 


= (47 (R(K) — r(K)) < 6(1 + 2y2)ye. 


10. Concluding Remarks. There are many other theorems in the theory of convex 
sets that give rise to interesting stability problems. Actually most classical character- 
izations of simplices, balls, ellipsoids etc. have significant associated stability prob- 
lems. In some cases attention has to be restricted to much smaller classes than the 
class of all convex bodies. For example, it is known that a convex domain of given 
constant width and minimal area is a Reuleaux triangle. The associated stability 
problem is to determine how much a convex domain of constant width deviates 
from a Reuleaux triangle if its area is close to the minimal area. An answer to this 
problem is given in [15] and the analogous problem for equichordal sets is consid- 
ered in [17]. Some of the stability results that have been proved by the use of 
Fourier series can be generalized to the n-dimensional situation if instead of Fourier 
series One uses expansions in terms of spherical harmonics (see, for example, the 
recent work of Schneider [32]). 
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Separable Extensions and Diagonalizability 


FRED RICHMAN 
Department of Mathematics, New Mexico State University, Las Cruces, NM 88003 


1. Introduction. The notion of a separable field extension plays a major role in 
many treatments of Galois theory. As any algebraic extension of a field of 
characteristic 0 is separable, expositions of this notion tend to focus on fields of 
characteristic p. The idea appears even more esoteric because algebraic extensions 
of finite fields are also separable. Thus in a subject that grows out of the study of 
algebraic numbers, we must use rational function fields of characteristic p to give 
content to one of the central concepts. 

By enlarging the scope of the discussion to include extensions with zero-divisors 
we can construct natural examples of inseparability in characteristic 0, we get to 
observe the connection between separability and diagonalizability of matrices, and 
we don’t have to mention the characteristic of the field. As an added bonus, our 
theorems are more general. 

In this note we establish, in this enlarged context, the fundamental results that 
the separable elements in a finite-dimensional commutative extension of a field k 
form a ring, and, if k is infinite, the ring is of the form k[@]. The basic techniques, 
which are not new, are diagonalizing a matrix whose minimal polynomial has no 
repeated roots, and simultaneously diagonalizing commuting matrices. 


2. Separability and diagonalizability. A polynomial p over a field k 1s said to be 
separable if it is relatively prime to its formal derivative, that is, if there exist 
polynomials s and ¢ such that sp + tp’ =1. The polynomials s and ¢ can be 
constructed so that their coefficients are in the field generated by the coefficients of 
p; indeed the Euclidean algorithm does just that. It 1s easy to verify that the 
polynomial (X — r,)(X — r,)--:(X — 7,) 1s separable if and only if all the v are 
distinct. Thus a polynomial is separable if and only if it has no repeated roots in any 
extension field, so any factor of a separable polynomial is separable. 

Many authors define a polynomial to be separable if it 1s a product of polynomi- 
als that are separable in the above sense. This makes the separability of p 
dependent on the field k: for example, any polynomial is then separable over an 
algebraically closed field. For irreducible polynomials it is all the same, so for 
minimal polynomials of elements in an extension field of k, we can use either 
definition. However, if we allow zero-divisors in the extension of k, then the 
minimal polynomials need not be irreducible, and the two definitions are quite 
different. 

By the field of definition of a finite set of matrices over a field, we mean the 
(countable) field generated by their entries. If matrices are linearly dependent over a 
field k, then they are already linearly dependent over their field of definition: 
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indeed if we treat the matrices as if they were row vectors, then the usual procedure 
of Gaussian elimination will either result in a nontrivial dependence relation over 
the field of definition, or will demonstrate that the rows are independent over k. In 
particular, the coefficients of the minimal polynomial of a matrix lie in the field of 
definition of that matrix, so the minimal polynomial of a matrix does not depend on 
the particular field from which the entries of the matrix are considered to be taken. 
Also, if a matrix A can be written as a polynomial in a matrix B, then the 
polynomial may be chosen with coefficients in the field of definition of A and B. 

An n-by-n matrix A with entries in a field k is diagonalizable over k if and only 
if k” is the direct sum of the eigenspaces of A, that is, if k” admits a basis of 
eigenvectors of A. The connection between separable polynomials and diagonaliz- 
able matrices is 


THEOREM 1. Let A be a matrix with entries in a field. Then the minimal polynomial 
of A is separable if and only if A is diagonalizable over some field k. In this case, the 
projections of k" onto the eigenspaces of A can be written as polynomials in A. 


Proof. Suppose the matrix A is diagonalizable over a field k, and let A be the set 
of eigenvalues of A. Then A satisfies the separable polynomial II, ~,(X — A). 
Conversely, if A satisfies a separable polynomial, let k be a splitting field for the 
minimal polynomial f(X) of A over the field of definition of A. Write f( X) = 
(X — A)g,(X) for each root A of f. 

As (A — A)g,(A) = f(A) = 0, the matrix g,(A) maps into V,, the A-eigenspace 
of A. If » # A are in A, then A — p 1s a factor of g)(A), so g)(A)V, = 0. Finally, 
the polynomial %) < ,8)(X)/g,(A) takes on the value 1 for each A in A, and has 
degree less than the cardinality of A, so is identically equal to 1. Thus 
ye a8 ,(A)/g,(A) is the identity matrix, so g)(A)/g,(A) is the projection of k” 
onto V,. O 


The last sentence of Theorem 1 says that if A is a diagonal matrix, and P), is the 
diagonal matrix with 1’s where A has A’s, and 0’s elsewhere, then P, can be written 
as a polynomial in A. This fact will play a key role in proving the existence of 
primitive elements (Theorem 5). 


3. Simultaneous diagonalizability and primitive elements. The standard result 
regarding simultaneous diagonalizability of commuting matrices 1s 


THEOREM 2. Let A and B be commuting n-by-n matrices with entries in a field k. If 
k" admits a basis of eigenvectors for each of A and B, then k” admits a basis whose 
elements are eigenvectors of both A and B. 


Proof. Let V/’ and VY’ be the A-eigenspace of A and B respectively. Then 
k" = ®,V/ = ®,V by hypothesis. As B commutes with A — X, the subspace V;! 
is invariant under B, hence also under the projection 7,° of k” onto V,’, which is a 
polynomial in B. Therefore Vi‘ = LV 9 V,2, sok" =X) ViAVE O 

If A and B are elements of a commutative ring containing k, then we denote by 
k[ A, B] the subring of elements that can be written as polynomials in A and B with 
coefficients in k. 


COROLLARY 3. Jf A and B are commuting separable matrices, then any matrix, or 
inverse of a matrix, in k[ A, B] is separable. 
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Proof. By Theorems 1 and 2 we can simultaneously diagonalize A and B. Then 
all the matrices in question are diagonalizable, hence, separable by Theorem 1. O 


Results about matrices over k can be extended to elements in arbitrary finite- 
dimensional extension rings K of k through the device of passing from an element 
a in K to the matrix of the linear transformation from K to K induced by 
multiplication by a. An element of a commutative ring extension of a field k is said 
to be separable over k if it satisfies a separable polynomial over k, or, equivalently, 
if its minimal polynomial is separable. 


COROLLARY 4. If K is a commutative ring containing a field k, then the elements of 
K that are separable over k form a ring. 


Proof. It suffices to show that if a and f are separable over k, then every 
element of the ring k[a, 8B] is separable over k. The ring k[a, 8] is a finite-dimen- 
sional vector space over k as it is generated by the monomials a‘B/ where i and j 
are less than the degrees of the minimal polynomials of a and £ respectively. Let T) 
and 7; be the linear transformations given by multiplication by a and # on the 
finite-dimensional vector space k[a, 8]. Observe that k[7,, T,] is naturally isomor- 
phic to k[a, B]. Let A and B be the matrices of 7, and 7; with respect to some 
basis of k[a, B]. By Corollary 3 any matrix in k[A, B] = k[T,, Tz | 1s separable, 
hence any element of k[a, 8] 1s separable. O 


Proofs of the existence of a primitive element for finite-dimensional separable 
extensions K of k, that is, an element @ such that K = k[@], divide into two cases 
depending on whether k is finite or infinite. The finite case follows from the fact 
that K is also finite; so we can choose @ to be a generator of the multiplicative 
group of K. If we allow K to have zero-divisors, then there need not be a primitive 
element in the finite case: for example, let K be the ring of 3-by-3 diagonal matrices 
over the two-element field. The infinite case, however, goes through in complete 
generality. By induction it suffices to show that k[ A, B] = k[C] where A and B are 
commuting separable matrices. 


THEOREM 5. If A and B are commuting separable n-by-n matrices over a field k of 
cardinality greater than n(n — 1)/2, then there exists an element c in k such that A 
and B can be written as polynomials in A + cB with coefficients in k. 


Proof. We may assume that A and B are diagonal with diagonal elements 
da,,..., a, and b,,..., b,. Choose c distinct from (a, — a,)/(b, — 6,) for each pair i 
and j such that b, #5. Then a; + cb, # a, + cb, whenever a, # a, or b, # 6, It 
follows from the last sentence of Theorem 1 that A and B can be written as 


polynomials in A+ cB. O 


If all we want is a matrix C such that k[A, B] = k[C], the lower bound 
n(n — 1)/2 for the field in Theorem 5 can be replaced by n — 1; this bound is 
sharp in that, over a field of cardinality n — 1, all the n-by-n diagonal matrices can 
be written as polynomials in two diagonal matrices, but not in one diagonal matrix. 
To establish the bound n — 1, we may assume, as before, that A and B are diagonal 
with diagonal elements a,,...,a, and b,,...,6,. It suffices to construct C in 
k[ A, B] whose diagonal entries differ whenever the corresponding entries of A or B 
differ, because then we can write A and B as polynomials in C by the Lagrange 


398 M. D. HIRSCHHORN [May 


interpolation formula. By Theorem 1 the diagonal matrix EF, whose mth entry 1s 1 if 
(a,,, b,,) = (a,, 6,), and 0 otherwise, is in k[ A, B] for each i. Note that either 
E, = E, or E,E, = 0, and the sum of the distinct E, is the identity. As k has at least 
n elements, we can let C be a linear combination of the distinct E, with distinct 
coefficients from k. 

The hypotheses of Theorem 5 can be weakened so that only A is assumed to be 
separable over k; here is a brief sketch of how to do that. Put A in block diagonal 


form, diag(A,,..., 4,,), where each block A, is a scalar multiple of an identity 
matrix, and the scalars are distinct in different blocks. As B commutes with A, we 
must have B = diag(B,,..., B,,), where the block B, is of the same size as the block 


A,. Then put each B, in Jordan canonical form; this does not affect the blocks 4, 
because A, commutes with everything. Now choose c, by Theorem 5, so that A and 
the diagonal of B can be written as polynomials in the diagonal of A + cB. It 
remains to show that the diagonal of A + cB can be written as a polynomial in 
A + cB. This is true for any block triangular matrix M in which each block has a 
constant diagonal: if the minimal polynomial f(X) of M 1s of the form 
(X — r)°e(X), and s(X)(_X — A)° + t(X)g(X) = 1, then the matrix ¢(A)g(A) is 
diagonal, with 1’s where M has X’s, and 0’s elsewhere; the diagonal of M 1s a linear 
combination of these matrices. 


A Birthday Present for Ramanujan 


M. D. HIRSCHHORN 
University of New South Wales, Kensington, NSW, Australia 2033 


Recently, Frank G. Garvan and George E. Andrews made an exciting discovery. 
Given a partition 7, let A() denote the largest part of 7, u(a) denote the number 
of ones in 7 and p(7) denote the number of parts of 7 larger than p(s). Define the 
“crank” of the partition as follows: 


Then the partitions of 5n + 4 split into 5 equinumerous classes according to the 
residue modulo 5 of the crank, the partitions of 7n + 5 split into 7 equinumerous 
classes according to the residue modulo 7 of the crank, and the partitions of 
lln + 6 split into 11 equinumerous classes according to the residue modulo 11 of 
the crank. Thus Garvan and Andrews have succeeded in giving a combinatorial 
explanation of the congruences due to Ramanujan, p(5n + 4)=0 mod 5, 
p(in + 5) = 0 mod 7 and p(i1n + 6) = 0 mod 11. 

The significance of this discovery is that the existence of such a “crank” was 
postulated more than forty years ago by Freeman Dyson, but it has remained 
hidden for so long. 
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The proof that the crank does what is claimed comes in two parts. First it 1s 
shown that the crank is generated (essentially) by the infinite product 


(4; ee (a; q)o = [] (1 - aq")}. 


(aq 'q; 4) oo (49; dD) e | n>O 


The second part of the proof consists in showing that when a is replaced by a fifth, 
seventh or eleventh root of unity and the product is expanded, certain powers of q 
do not appear. 


Thus if 7 is a fifth root of unity, we have the identity, to be found in the so-called 
lost notebook of Ramanujan, 


2 
(q; q) _ 1 (q°; g?°)_.( gs; q?°)..( 4°; gq’). 
(nga) de (450). (9°54?) 4795 9?) co 


+(n 14 n')q(q°; q°),.(q"; q?°) (4°: q°). 
—(n 414 no!) 4243: q?°),( q?°; q?°),.( 45; qs). 


(q°; q°)_.( 4; q?°)_.( 4; qs). 
(q'°: q?) (qs; q?).. 


That no powers of g congruent to 4 modulo 5 occur on the right of this identity 
provides the proof that the partitions of 5n + 4 split into 5 equinumerous classes as 
claimed earlier. 

Frank Garvan has found the corresponding identity for a seventh root of unity, 
in which no powers of g congruent to 5 modulo 7 appear, namely, 


-(n+77')q 


(93 Vac 1 


Nee tn, 49)? (28. 49)? (49. 49)? 
(0795 dQ) oc (4: Vex Ging, C4 I (as a (95 4) 


+(n-Ltn')q ass 9) .(475 1) 0978s 4) 047s 44?) 6 (9%: 9°). 
+( + 0-7) a?(q'4s a) ,,( 45; a). (4s a) = 
—(P tata +977) q3(qs a?) (47s a) (478s 4°?) 6475 9°) (43 0") 5 
—(n +0) q8(475 4 Yan as goa 4255 4 Van 425 49) oo (4°93 9°) 
—(n +1407?) q5( a7 4) 2.( 45 4) -0( 4% 4) } 


Further, he has shown that in the corresponding identity involving an eleventh root 
of unity no powers of g congruent to 6 modulo 11 appear, thereby completing the 
proof. 

I have found a simpler form than Garvan’s of this last identity, and would like to 
present it here, on the occasion of the centenary of Ramanujan’s birth, in memory 
of the man whose life and work have been an inspiration to me and to so many 
others. 
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The identity is 


(Be 
(07 "95 4) 04093 9c 


1 
= (gig 7) x {(4q”; gq) (4°; g") (q": gi) (4: ge). 


(g%3 1) (475 FD (as F4) (99s F(t; gt), 
+(n—1L+m')q(q's g?).(9%; a) ..(4s a4), 
(g% g*)2.(g775 g) (qh; g) (qs gi!) 
+(m +07) 9?(q?s G4) ,.(43s 2). (43g) x 
(45g) (gs g) (4s @) (gts gt). 
+(m + 140-9) g(q?s (44s @)..( 455 gi), 
(4°°5 g?").(4775 gi") (49s g4)2,(g74; gi), 
+n +P +l tn? +94) g4(gh; g4) (4; g4) (4g; g)..( 9%; q!4),, 
(93g) 475 @) (as FD (gs a4) (gts gi), 
-(n tor tanta yg gs gy (42s Fe) o( qs a). 
(q75 9) (as @) (gs g4) (gts gi), 
+(nt tant tay) g( gs ge) (4775 1 las Feo 9s Tew 
(G5 @) (gs F975 1) co( gts gi). (gts g4)-, 
+(n toto tay tat a4) g(t; gi). (q?s a ).(03s a). 
(g®; g4) (G25 ge) (gs gi) .( gis gi), 
—(n + 14074) g (gs a) (475 Fas 1) 04°51") 
(gs g4) (gs G4) ,,(929s F 2) 40( Gs Qe co gs G4), 
—(m + 3) q(t; gt) (4775 F095 ol as 9") 


-( ql’: gi?) (q®; g) (4; g) (gq; g) (q; gy7.}. 
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A Condition for Flatness of Curves in R” 


YAAKOV S. KUPITZ AND MICHA A. PERLES 
Institute of Mathematics, Hebrew University of Jerusalem, Givat Ram, 91904 Jerusalem 


Introduction. The following two exercises are standard in a first course on the 
differential geometry of curves, and appear in this form or another in many 
textbooks. (See, e.g., [1, Ex. 8 on p. 97] or [2, Ex. 4.10 on p. 74 and Ex. 4.20 on 
p. 77].) 

1. Prove that if all tangent lines to a curve pass through a fixed point, then the 
curve lies on a straight line. 

2. Prove that if all the osculating planes to a curve pass through a fixed point, 
then the curve lies in a plane. (An important special case: the path of a free particle 
in a central force field is planar; for example, planetary motion.) 

The connection between these two problems is obvious, and they just “call” for a 
common generalization. Unfortunately, this call is ignored in the standard text- 
books. In this note we present such a generalization: “If all the osculating k-flats to 
a curve in R” (k fixed, 1 < k < n) pass through one point, then the curve lies in a 
k-flat.” 

For k = 1 and k = 2, this generalization reduces to the assertions of exercises 1 
and 2. 

Another point left obscure in the textbooks is: what smoothness conditions are 
needed in exercises 1 and 2? The wording of the exercises and their solutions in [2] 
seem to indicate that the curves in question should be of class C**! (for k = 1 and 
k = 2, respectively). At any rate, since the definition of the osculating k-flat (see §1) 
involves the first k derivatives of the curve at a point, it is natural to assume (at 
least) that the curve is k times differentiable. We shall, in fact, prove the generaliza- 
tion under exactly this “minimal” smoothness condition, not assuming even the 
continuity of the kth derivative. See, however, §5 for a possible attempt to relax the 
smoothness conditions even further. 

For a good introduction to curves in R” we refer the reader to [3]. 


1. Statement of results. Consider a curve in R", n > 2, 1.e., a continuous map 
X:I— R". Here J is a real interval, which may be bounded or unbounded, with or 
without endpoints. For J Cc J, put X(J) = { X(t): t © J}. For fixed k, 1 < k <n, 
we shall seek conditions to ensure that X(J) is included in a k-flat, ie., in a 
k-dimensional affine subspace of R”. Since the conditions that we are going to 
impose pertain to the behavior of X at interior points of J only, and any k-flat that 
includes X(intJ) includes X(J), we shall assume, without loss of generality, that J 
is open. 


DEFINITION 1. The curve X: I > R®” is a k-curve if: 

(a) X is k times differentiable; (We denote by X(t) the ith derivative of X at t.) 
and 

(b) for each t € I, the k vectors X(t), X(t),..., X(t) are linearly indepen- 
dent. (In particular, X“(t) # 0 for all t € I.) 


Note that we do not assume continuity of the kth derivative. 
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Define also, for t € J: 
Lo(t) = span{ X(t),..., X(t) }, 
L(t) = X(t) + L(t). 


We call L(t) “the osculating k-flat of X at ¢.” 
For any subset S of R", let aff(.S) be the affine span of S. 


DEFINITION 2. A k-curve X is flat if X(1) is included in a k-flat, i.e., if aff( X(1)) 
is k-dimensional. 


THEOREM. [f all the osculating k-flats L(t) of a k-curve X: I > R" pass through a 
common point q, then X is flat. 


The proof of the Theorem will be given in §2—4. In §2 we make some general 
observations concerning k-curves. These observations imply that if the Theorem 
holds locally, in a neighborhood of each point t € J, then it holds globally. In §3 we 
prove a local version of the Theorem under an extra regularity condition. In §4 we 
use an inductive argument to prove the local version of the Theorem without the 
extra condition. 

The following example shows that the Theorem fails if the independence condi- 
tion (b) in Definition 1 is violated even at one point. 

Example. Fix k and n,1< k <n, take J = R, and put 


X(t) = (x,(¢),...,x,(t)), where x,(t)=t/ for l<j<k-1, 


t<*! fort>0 
0 fort < 0 


0 fort >0 
Xp44(t) = ( 


x,,(t) =| 


t**! fort <0 


and x(t) = 0 fork + 2 <j <n. For an example of class C™, replace t**! by a 
function @(t) of class C® that satisfies: p(0) = 0 for all i > 0, p(t) ¥ 0 for 
t # 0; e.g., the kth antiderivative of exp(—t7’). 


2. Some preliminary observations. Let X be a k-curve. 

Suppose J is an open subinterval of J, LC R" is a flat, and X(J) Cc L. Put 
Ly = L — L. (Lo is the translate of L through 0.) If t € J, then X(t) is the limit 
of scalar multiples of differences of points in X(J); hence X(t) € Lo. The same 
holds if ¢ is an endpoint of J, provided ¢ € J. Repeating the same argument, we 
find that X(t) © L, for i= 1,2,..., k, and for all t © 1M clJ and, therefore, 


L(t) = X(t) + span{ X(t),..., X(t)} C X(t) + Ly = L. 


It follows that dim L > k, and if dim 1 = k then the last inclusion becomes an 
equality. Thus we have: 


PROPOSITION 1. If J is an open subinterval of I, L is a k-flat and X( J) C L, then 
L= affxX(J) = L(t) forallte TaclJ. 


Call an open subinterval J of I a flatness interval (= FI) if X(J/) is included in a 
k-flat. From Proposition 1 it follows that any FI J is included in a unique maximal 
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FI (= MFI) J. (J is the union of all FIs that include J.) Moreover, the MFIs are 
open and pairwise disjoint. Another consequence of Proposition 1 is 


PROPOSITION 2. If J, and J, are two disjoint FIs with a common endpoint s, then 
J, U {s} UJ, is also a FI. 


Proof. By Proposition 1, X(J, U {s}UJ,) C L(s). O 


In particular, two distinct MFIs cannot have a common endpoint. 

Denote by ¥% the set of all MFIs, and put F=U,.,J. By the previous 
discussion, ¥ is just the set of connected components of F. In the next two sections 
we shall show that F = J. This will imply that = {J},1.., that J is a FI, and the 
theorem will be proved. 


3. Proof of the theorem—part I. Recall the assumptions: X: J > R” isa k-curve, 
and all the osculating k-flats L(t)(t € I) pass through a common point g. Assume, 
without loss of generality, that g = 0. (Otherwise, consider the curve X — g.) The 
meaning of the assumption 0 € L(t) is: X(t) is linearly dependent on 
XO(t),..., X(t). 

For t€J and i=0,1,2,...,k, write X(t) = (xf?(0),..., x©(4)), where 
X(t) = X(t), and consider the matrix 


X(t) 
X(t) = . = (x(t): 0<i<k,l<j<n). 
X(t) 


X(t) is a matrix of order (k + 1) X n and of rank k. 

Denote by X‘(t) [X‘(t)] the submatrix that consists of the first [resp., last] k 
rows of X(t). By assumption, X'(t) is a differentiable function of t, for t © J, and 
X‘(t) isa k Xn matrix of rank k. 

The next Lemma is a local version of the main theorem, with one extra condition. 


LEMMA. Suppose that t, € I, and that the rank of X"(to) is k (i.e., X(to) is not a 
linear combination of X(t)),..., X“~ (t,)). Then to lies in a flatness interval of X. 


Proof. The matrix X'(t,) has a nonsingular k x k submatrix, i.e., there is a set 
Kc N(= ({1,..., n}) of size k, such that the determinant 


A(t) = det(x(?(t):0<i<k-1,j€K) 


is nonzero for t = ty. Since A(t) is continuous (even differentiable), there is an open 
interval J, tj © J C J, such that A(t) # 0 for all ¢t © J. 
Consider a coordinate / € N \ K. For t € J, the columns of X(t) with indices in 
K are linearly independent, and form a basis of the column space of X(t). Thus the 
/th column of X(t) can be written as a linear combination of the basis columns. The 
coefficients of the combination can be calculated already from the submatrix X ‘(r), 
by Cramer’s Rule, as a quotient of two subdeterminants of X‘(t), where the 
denominator is A(t). Write this explicitly: 
X(t) = LA (t)xO(t). E(i, 1, t) 
JEK 
(i is the row index,O<i<k,/@N\K andteEJ.) 
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From the previous discussion it follows that the A,(t)’s are differentiable 
functions of t. Differentiating equation F(i, /, t), for 0 <i < k — 1, we obtain 


xp P(t) = © xi(0 = LA, (t) x7" P(t) 


JEK 


+ VA, (t) x(t). 


jJEk 


The first row in this equation is actually the identity E(i + 1, /, t). Subtracting it 
we get De A, x 6(2) =O0(0<i<k-l1/lEN\K,tEJ). 

For any fixed /<© N\ K and ¢ © J, this is a system of k homogeneous linear 
equations in A unknowns A, (t), and the coefficient matrix (x(t): 0<i<k-—-1, 
j © K) is nonsingular (its “determinant ‘ A(t)). It follows that X, (t) = 0 for 
[IE N\K, j€K andteJ,so 4,(t) =A,, is constant int forte J, 1EN\K 
and j © K. Thus we find that X(J) is included in the linear subspace of R” 
determined by the equations F(0, /, t), 1.e.: 


= A,X, LEN\K. 


JEK 


(For j € K, the coordinates x, are free, and for / © N\ K, x, 1s a linear combina- 
tion with constant coefficients of the coordinates {x; 7 © K}.) O 


4. Proof of the theorem—part II. Recall the notations from §2: .% denotes the 
set of all MFIs, and F is their union. All we have to show is that F = J. By the 
Lemma, F contains all points ¢ © J for which rank X‘'(t) = k. Define: S = I\ F. 
We shall complete the proof by showing that S = 9. 


PROPOSITION 3. S' has no isolated points. 


Proof. If s is an isolated point of S, then s is the common endpoint of two 
distinct components of F, ie., of two distinct MFIs. But this is impossible, by 
Proposition 2. O 


Now suppose s © S. From the Lemma we learn that rank X'(s) < k. Since 
X1(s) has k rows X(s), X(s),..., X“~(s), and the last k — 1 rows are linearly 
independent, it follows that 


X(s) is a linear combination of X(s),..., X“~(s). ( * ) 


We shall show that X(s) is, in fact, a linear combination of X(s),..., X%~?(s) 
(assuming k > 2). 

Take a set K CW of size k, and consider the k x k determinant 6,(t) = 
det(x(t): O0<i<k-—1, j © K). By the Lemma, 8x(t) = 0 for all tS, in 
particular for ¢ = s and for a sequence {s,,}%-, Of points in S\ {s} that converge 
to s. (Remember: S has no isolated points!) It follows that 6 x(5) = 0. But the 
derivative of the k x k determinant 6,(s) is obtained by differentiating the k rows 
one at a time and adding the results. Since the derivative of each row of X'(s), 
except the last, is equal to the succeeding row, all that remains from the differentia- 
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tion is 
0=8,(s) = det(x?(s): i€ {0,1,...,.k-2,k},j¢€ K). 


Since this is true for all choices of K (K CN, |K| =k), we conclude that the 
matrix whose rows are X(s), X(s),..., X*~?(s), X(s) is of rank <k, ie., 
its rows are linearly dependent. It follows that X(s) is a linear combination of 
the (linearly independent) rows X (s),..., X%~(s), X(s). This, together 
with statement (*) above, shows that X(s) is a linear combination of 
X(s),..., X%~ 2s) (since the representation of X(s) in terms of the linearly 
independent vectors X“(s),..., Xs) is unique), and this conclusion holds, of 
course, for all points s € S. 

Repeating this argument k — 2 more times we finally conclude that X(s) 
depends linearly on the empty set, 1.e., that X(s) = 0 for all s © S. Differentiating 
once more, we find that Xs) = 0 for all s © S, and therefore S = $, since by 
assumption X(t) #0foralrel O 


5. An open problem. Consider a continuous curve X: I > R", a point ¢, € int J 
and a k-flat L C R". Call L an osculating k-flat of X at ty if X(t.) € L, and 


dist( X(t) +h), L) =o(h*) ash 0. ( * *) 


If (* *) holds only for h < 0 [or for h > O], then we say that L is a /eft [resp., 
right] osculating k-flat of X at fo. 

Assume that X admits at each point ¢ € int J a unique left osculating k-flat 
L_(t), and a unique right osculating k-flat L,(¢), and that L_(t) = L,(t)(= L(2)) 
for all ¢ € int J. Assume also that all these osculating k-flats L(t) pass through a 
common point. Does it follow that X(J/) lies in a k-flat? One might add some extra 
conditions, e.g., that X(s) is parametrized by arc length. 

It is not sufficient to assume that X admits at each point ¢ € intJ a unique 
osculating k-flat, and that all these k-flats pass through one point. This is shown by 
the following example: k = 2, n = 3, 1 = R, and 


(At ¢ = 0, the unique left osculating 2-flat is the (x,, x,) plane, but every plane that 
includes the x,-axis is osculating on the right.) 
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A Counterexample on Vector Norms and the Subordinate 
Matrix Norms 


PIETER P. N. DE GROEN 
Vrije Universiteit Brussel, Department of Mathematics and Computer Science 
Pleinlaan 2, B-1050 Brussels, Belgium 


In an introductory course on numerical linear algebra, or in a good textbook 
suited for such a course (e.g., Golub and Van Loan [1]), one begins with some facts 
on vector and matrix norms. Examples of such norms are given, and it is shown that 


a vector norm || - || on a linear space EF induces a matrix norm on the set of linear 
transformations from F to E, again denoted by || - ||, 
| Ax] | 
All = (A: E> EB). 


x€F,x+#0 {|x || 


This matrix norm, called the subordinate matrix norm, has the additional property 
of being submultiplicative with the norm of the identity equal to one, 


JAB < ANSI, I= 1 (4, B, 2: E> BE, Ix = x Vx). 


Next, the Frobenius norm (||.A||Z:= ©, j4,) is given as a counterexample that not 
every matrix norm is subordinate to some vector norm. However, the Frobenius 
norm of the identity is equal to the square root of the dimension of the underlying 
vector space E£, so it does not falsify the converse question: Is every submultiplicative 
matrix norm, in which the norm of the identity is equal to one, subordinate to a vector 
norm? If you pose this question to a colleague, he or she immediately answers “no” 
(unless they consider the n X n matrix A as a transformation of the n-dimensional 
space of n X n matrices by matrix multiplication). If you continue and ask for a 
counterexample, they are unable to supply either such an example or a reference in 
which it could be found. Here is a construction of a simple counterexample. 

Let us assume that || - ||, is the following matrix norm on R?**, depending on a 
positive parameter A, 


Qi; 412 
All, = max {lay,| + Alay], |@| + Alaa |}, A i= a a) 

If A = 1, this is the matrix norm subordinate to the vector sup-norm. We shall 
show, that it satisfies the requirements mentioned above and that it cannot be 
subordinate to a vector norm for any A > 1. 

The strict positivity, the triangle inequality, and the homogeneity of degree 1 are 
trivially satisfied by this matrix function, implying that it is a norm. Moreover, the 
norm of the identity is equal to one. For the submultiplicativity we consider 


matrices A and B, 
_ [Ps bio 
b>, by 


|459| + Alayy|, 
|ba| + A] Bo]. 


Qi, 442 
A:= , 
Q5, 47 


For simplicity we assume 
Alla = az] + Alay! 
Fil, = [Pail + A|b,9| 


VW W 
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For the absolute row sum of the first row of the product AB we then find: 
|4415)) + Ay pby,| +A] Q11b15 + Ay2b| < 1a, | (bi + A|by|) + A] 4y5| (|bo9| + A|b5,|) 
< |All, IBlly, 


provided A > 1. Since the same holds for the sum of the second row, this implies 
that the proposed norm is submultiplicative for any A > 1. 

Let us now assume that there exists a vector norm, to be denoted by || - ||, such 
that || - ||, 1s subordinate to this vector norm. We first show that this norm must be 
monotone. Let e, ‘= (1,0)" and e, = (0,1)’ be the “unit” base vectors in R? and 
let A and B be the matrices 


{1 0 _{9 0 
A=(t °) and B (F ’). 


For all vectors z © R*, z = (2, z,)’ we have 


= Aly > ee en elt > levies 
— AF AIT J V1 Syl 
Iz\| (Iz\| 
Using B we likewise can show: |lz|| > |z,| |le,||, for all z © R’. 
Now we can derive a contradiction in the case A > 1. Let C and D be the 
matrices 


c=(9 1), ans =(° 9) 


0 0 1 0 
Since the matrix norm || - ||, is subordinate to the vector norm ||- ||, a nonnull 
vector x exists, such that 
| Cx] [x2] 
A=|ICll, = = lle, ||. 
Ix] Ix] 


Hence the monotonicity of the norm implies 
|X] |ler}] = Allx|| => Al] lel], = lel > Allesll. 


Likewise we show with the aid of the matrix D that |le,|| > Alle,||, which is 
impossible for any A > 1! Clearly the matrix norm is not subordinate to any vector 
norm. 

Finally we find that this example fits nicely in a more general theory and clarifies 
Stoer’s [2] characterization of the subordinate matrix norms as the minimal 
(sub) multiplicative ones. A submultiplicative matrix norm p(-) 1s called minimal if 
any other submultiplicative norm p(-) satisfying 4(A) < »(A) for all matrices A is 
equal to v. Obviously, in the class of || - ||,-norms the sup-norm is the minimal 
element. 
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Ambiguous Numbers Are Dense 
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1. Introduction. It is well known that in “classic” radix number systems with 
integer base b > 2 and digits 0,1,..., 5 — 1 every positive b-ary fraction has two 
distinct representations: for example, 


1 : 000 .. (10) =(0: 999 | (10) = 1. 


Are there number systems in which every real number has a unique representation? 
The purpose of this note is to show that for any real positional number system Y, 


1) if Y can represent all real numbers, then the set of numbers representable in 
SY in more than one way is dense in R, and 

2) if representation in ¥ is unique, then the set of numbers not representable in 
FY is uncountable and dense in R. 


The proof of these results is an instructive exercise in elementary point-set 
topology. As often in discrete mathematics, the relevant topology is that of a Stone 
space (= compact, totally disconnected Hausdorff space). 

Note that the situation is quite different with representation of integers, since in 
this case, there are many number systems in which every integer is uniquely 
representable (see Example 2.4). For related work concerning the ambiguity prob- 
lem for representation of integers, see Maurer, Salomaa and Wood [9], Culik II and 
Salomaa [2], and Honkala [3, 4, 5]. 


2. Positional number systems 


2.1 DEFINITION. By a real positional number system we mean a two-way sequence 
of sets of positional values (P,.), <z, where 

(i) for every k & Z, P, is a finite set of reals containing 0, 

(11) the series 


» max |p,.| 
k<Q Peer 


is convergent. 
2.2 Example. In our conventional decimal system, P, = {d- 10*|d = 
0,1,...,9}. Note that what we call positional values here are not digits but rather 


the numerical values represented by them. For instance, the (decimal) number 32 
has digits 3 and 2, but it has positional values 30 and 2. 


2.3. If S= (P,), ez 18 a real positional number system, let 
R, = [|] {0} x []?,, forn eZ, 
A>n An 
and 


R= UR,. 


nEZzd 
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Thus R is the set of all two-way sequences of positional values in which all positions 
from some place on (in direction of increasing subscripts) are filled with zeros. 
Define the valuation function v: R > R by setting 


v(p) = Li Pe 
k<n 

where p = --- 00p,p,-1P,—2 °** iS a sequence from R,. By Definition 2.1(ii), the 
series on the right-hand side is absolutely convergent. The number vu( p) is the value 
of p, and p is a representation of v( p). 

A representation 1s finite if p, = 0 for all k <n, for some integer n, and integral 
if p, = 0 for all k < 0. 

A number 1s representable in £ if it belongs to v(R), and ambiguous in ¥ if it 
has more than one representation in Y. 

The system ¥ is complete over R if v(R)=R, and unambiguous if v is 
one-to-one. 


2.4 Example. Let D be a finite set of reals containing 0, and |b| > 1. Let 
S(b, D) denote the radix system with base b and digit set D, that is, the positional 
number system with P, = b* - D, for k € Z. Of prime interest are of course integer 
radix systems where the base and the digits are integers. In Matula [7, 8] a digit set 
DC Zis called basic for b © Z if every integer has a unique integral representation 
in “(b, D). The following are some results about basic digit sets from [7] and [8]: 


1. If D is basic for b then Y(b, D) is complete over R. 

2. If D is basic for b then D is a complete residue system mod |b]. 

3. If D consists of |b| contiguous integers which include 0, and either D contains 
positive and negative values, or b is negative, or both, then D is basic for b. 

4. There are, however, many noncontiguous basic digit sets. In fact, if |b| > 3 
then there are infinitely many basic digit sets for b; for instance, the sets 
{ —1,0, 3” — 2} are basic for b = 3, for all positive integers n. 

5. If D is basic for b then the set of numbers ambiguous in ./(b, D) 1s infinite, 
and may even be uncountable if the digits are not contiguous. Actually, the 
construction given in [8, Lemma 11] shows that this set is dense in R. 


Thus the balanced ternary system with b = 3 and D = {—1,0,1}, the “nega- 
ternary” system with b = —3 and D = {0,1,2}, and the system with b = 3 and 
D = {-—1,0,7}, are all complete over R. Here are some examples of ambiguity in 
these systems: 


0-111... . =1-(-1(-D(FD... « = 1/72 
1 - 2020... (3 = 0- 0202... ,.,, =1/4 
0-777... 4 =(-)7- (“1-1 D... «) = 7/2. 


2.5 Example. As an example of a so-called mixed-radix system, consider the 
factorial system where the role of the powers is played by factorials and their 
reciprocals (...3!,2!,1!;1/2!,1/3!,...). The digits allowed with n! or with 
i/(n + 1)! are 0,1,..., 7, so that 


P, = {d- (\k| + I!"*|d = 0,1,...,m,}, 
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where 


(kat fk >0; 
My 


—k, fk <0. 


This system and its generalizations were treated by Cantor in [1]. It is not difficult 
to see that every nonnegative real number is representable in the factorial system, 
that every nonnegative rational number has a finite representation in it, and that 
every positive rational is ambiguous in it. For example, 


] 000... (7) = 0» 1234... (5) = 1. 


For further examples of real and complex positional number systems, and for a 
history of number representations, see Knuth [6, pp. 162-180]. 


3. Ambiguity and completeness of positional systems. Let be a real positional 
number system. For each n € Z, let v,: R, > v(R,) be the restriction of v to R,. 


3.1 LEMMA. Jf for some n © Z the mapping v, is one-to-one, then the set of 
representable numbers v(R) is meager. 


Proof. Endow each set of positional values P,, as well as the singleton {0}, with 
the discrete topology. Being a product of finite discrete spaces, each R, becomes a 
compact totally disconnected Hausdorff space. Using Definition 2.1(11), one sees 
easily that the functions v, are continuous. As they map a compact space into a 
Hausdorff space, they are closed as well. Hence each v(R,,) is a closed set of reals. 

Assume that v, is one-to-one. Because it 1s continuous and closed, it is a 
homeomorphism of R, onto v(R,,). Therefore, v(R,,) is totally disconnected. As a 
set of reals, it has empty interior. 

Let 


F,= (TI, x II {0} OR. 


By Definition 2.10), F, R, is finite, for every k € Z. This implies that F,, is 
countable. Obviously, every representable. number can be written as a sum of a 
number from v(F,) and a number from u(R,,). Therefore, 
v(R) = v(F,) + o(R,) = U(x + 0(R,)). 
x€v(F,) 
This is a countable union of translates of v(R,,), which is a closed set with empty 
interior. It follows that v(R) is a meager set. 


3.2 THEOREM. (1) If Y is complete over R, then the set of numbers ambiguous in S 
is dense in R. 

(i) If SY is unambiguous, then the set of numbers not representable in F is 
uncountable and dense in R. 


Proof. Recall that by Baire’s Category Theorem, R is not a meager set. 


(i) If Y is complete then v(R) =R and is therefore not meager. Hence it 
follows from Lemma 3.1 that no vu, is one-to-one. Choose x © R and e > 0. 
As # is complete, x = v( p) for some p = ---O00p,p,_, --: By continuity 
of v,, there is an me€Z such that y+ v(R,,) C (x — €,x + €) where 
y= v0(-:: OOP, D1 °+* Py+100--:). Since v,, 1s not one-to-one, R,, con- 
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tains two distinct representations of some z € v(R,,). But then (y + z) € 
(x — €, x + €) is also ambiguous. Thus we have shown that every interval of 
positive length contains ambiguous numbers, which means that they are 
dense in R. 

(ui) If is unambiguous, then each v, is one-to-one. By Lemma 3.1, it follows 
that v(R) is a meager set of reals. Hence its complement R \ v(R) is an 
uncountable dense set. 


3.3 Remark. It is possible to modify these results for systems with nonnegative 
positional values (such as the “classic” radix systems, or the factorial system), by 
everywhere replacing R with R*U {0}. 
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The Diophantine Equation x? + y”™ = z7", Addendum 


Davip W. BoyD 
Department of Mathematics, University of British Columbia, Vancouver, B.C., Canada V6T 1Y4 


Professor Andrzej Schinzel has kindly informed me that the main result of [1], 
that the equation of the title has infinitely many primitive solutions if (m,n) = 1, 
follows from Corollary 2 of his paper [2, p. 49]. A special case of his general result is 
that the equation in question is birationally equivalent to X? + Y(" = Z?. Since 
the trivial equation X* + Y = Z* and the Pythagorean equation each have in- 
finitely many primitive solutions, the result follows immediately. 

The idea is to multiply the equation X? + YY"? = Z* by Y74Z*° where a and 
b are chosen so that the three terms become the desired powers. This is possible by 
the Chinese remainder theorem. For example, to obtain x* + y? = z*, multiply 
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X*+ Y=Z* by Y%Z® and then define (x, y, z) = (XY‘4Z?, Y°Z?, YZ). For 
example, 1* + 3 = 2° gives the solution (2734, 273°, 2737) of x? + y? =z‘. 

Conversely, a solution of x* + y? = 24 yields the solution (X, Y, Z) = (x, y3, z”) 
of X* + Y = Z’. Applying the process of the previous paragraph to (X, Y, Z) leads 
to a solution (x’, y’, z’) of x* + y? = z* which is in the same class as the original 
(x, y, Z), Showing that this method yields all (classes of) solutions of the equation of 
interest. 

The approaches of [1] and [2] are quite different but both are suitable for 
inclusion in an elementary number theory course. One obvious advantage to the 
approach used in [2] is that the number of parameters needed to represent all 
solutions does not increase with m. 
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Errata, addenda, etc. 


1. Dr. Arthur T. Benjamin informs us that the main result of his note The 
bisection method: which root?, this MONTHLY 94 (1987) 861-863, had appeared 
previously in a note by George Corliss, Which root does the bisection algorithm 
find?, SIAM Rev., April, 1977. 


2. Jingcheng Wong, of the University of North Florida, has noted an error in On 
the mean value theorem for integrals by B. Jacobson, which appeared in the 
May, 1982 issue, pp. 300-301. Theorem 2 of that article is false. The limit 
c — a which appears in several places on p. 301 should read x — a. 


3. Dr. K. D. Magill advises us that since the publication of his article (Universal 
topological spaces 95 (1988) 942-946) he has learned that theorem 1 had 
previously appeared in On a certain class of complete regularity, by J. Ferrer 
and V. Gregori, Acta. Math. Hung., 47 (1986) 179-180. 


4. Dr. John Beebee writes that he has found an error in his note Examples of 
infinite incongruent exact covers, 95 (1988) 121-123. The set of arithmetic 
progressions {(2':2'~')|i > 1} does not cover 0. The error can be corrected by 
changing the offsets, and the conclusion is unchanged. 


5. I am advised by Mr. Hans-J. Kleiner, of Spaichingen, F.R.G., that there is an 
error in the factored form of the chromatic polynomial of the icosahedron that 
appeared in D. H. Lehmer’s article Coloring the platonic solids (93, 288-292). It 
should read as ‘... +6999¢4 — 1969877 +...’ instead of ‘...+699974 — 
1968827 +...’. 
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Singular Values and the Spectral Theorem 


K. HOECHSMANN 
Department of Mathematics, University of B.C., Vancouver V6T 1Y4, Canada 


The action of an arbitrary real or complex m X n-matrix is not at all arbitrary: it 
takes a suitable orthonormal basis of n-space into some orthogonal set in m-space; 
consequently it maps the unit n-ball onto a (possibly lower-dimensional) ellipsoid. 
This is the geometric view of the Singular Value Theorem, which is not only one of 
the nicest matrix theorems to state and to visualize but also one of the easiest to 
prove and to apply. In any introductory course on matrices it deserves a place near 
the center. Theorem 1 below is a pure matrix version of this result. One of its main 
consequences, Theorem 2 below, leads straight to the orthogonal diagonalization of 
certain matrices, 1.e., the Spectral Theorem. 

It is customary to prove Theorem 2 independently, applying either the Funda- 
mental Theorem of Algebra to a characteristic polynomial or Lagrange Multipliers 
to a quadratic form, and even to deduce Theorem 1 from it. The analytic equipment 
needed for the following proof is more modest: it suffices to know that a continuous 
real-valued function on any compact set has a maximum. For the sake of simplicity 
the argument will first be given for rea/ matrices and then (trivially) extended to 
complex ones. To prevent any suspicion that extraneous subtleties are tacitly used, 
the prefix “eigen” will be avoided. 


THEOREM 1. Let A #0 be an m Xn real matrix. Then there exist orthogonal 
matrices M and N such that 
d, 0 
| where D = — with d,>d,,,> 9. 
0 d 


r 


D O 


MAN =|F 0 


Proof. Let O(k) be the set of all k X k orthogonal matrices. For M © O(m) and 
N € O(n), let a( M,N) stand for the entry in the first row and first column of 
MAN. Clearly, a is a continuous function of the pair (M, N) and, therefore, attains 
a maximal value d, > 0 on the set O(m) X O(n), which is closed and bounded in a 
suitable Euclidean space. Say d, = a(M,, N,). Then 
d, X 
Y A} 
where A’ is an (m — 1) X (n — 1) matrix. It turns out that X and Y are actually 
zero rows and columns, respectively. Indeed, if X were nontrivial, the first row p, of 
M,AN, would have length d > d,. Then one could multiply on the right by the 
reflection H which takes p, to [d,0,...,0] and create a value a( M,N) =d> d,. 
For similar reasons, involving columns and left multiplication, it follows that Y = 0. 
Finally, none of the entries of A’ can exceed d, in absolute value (because each of 


M,AN, = | 
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them could be permuted to the upper left), and this is true for all the possible forms 
of A’. An obvious induction now finishes the proof. 


The positive numbers d, > ::- >d. are known as the singular values of A. 


Remarks. The n X n matrix A* = A’ has a very simple effect on the columns 
V,,---,v, Of N; in fact A*v, = w,v,, where wp, = d? for i <r, and = 0 beyond. In 
matrix terms, this says that A*N = NA, where A is a diagonal matrix with diagonal 
entries »., as described. It is proved by putting A = (MAN)'( MAN), which is just 
N‘A*N. 

To prove uniqueness of the singular values d, it clearly suffices to characterize the 
uw, as being the only numbers such that (A* — pl)v = 0 for some v # 0. Indeed, 
setting v = La,v,, one gets (A* — wl)v = La,(p, — w)v,, which is never 0, unless pu 
is one of the u,. More geometrically, the d, can also be retrieved from the image 
under A of the appropriate unit sphere. 

If m =n, the theorem is often written as A = SR, with S positive semidefinite 
and R = M'N’ orthogonal. It is then called the polar decomposition. 


THEOREM 2. Every real symmetric n Xn matrix A has an invariant one-dimen- 
sional subspace. 


Proof. Let v # 0 be one of the columns of N, so that (A* — pJ)v = 0, as in the 
remarks above. The symmetric nature of A makes A* = A’4 = A? and 


0 = (A* —pl)v =(A+AI)(A-AT)v, 


where \? = p. If (A — Al)v = w # 0, then w generates such a line; otherwise v 
does. 


Remarks. For symmetric A it is trivial to show that the orthocomplement of any 
invariant subspace is itself invariant. Hence, by induction, Theorem 2 provides a set 
of n mutually orthogonal invariant lines, thus proving the spectral theorem for 
symmetric matrices. Moreover, if B is symmetric and commutes with A, it defines a 
symmetric operator on the non-zero kernel of A — AI. Hence, the two matrices have 
a common invariant line, and again, by induction, a complete orthogonal set of such 
lines. 

If the use of linear transformations is didactically impracticable, the spectral 
theorem can be stated in terms of an orthogonal U making U’AU diagonal. Then 
the induction will hinge on partial diagonalizations V'’AV, where the columns of V 
form an orthonormal basis of an invariant subspace and its orthocomplement. 

Everything said after the first sentence of Theorem 1 and before the present one 
applies verbatim to complex matrices, if one changes “orthogonal” to “unitary,” 
“symmetric” to “hermitian,” and replaces the transpose A’ by its complex conju- 
gate A* (adjoint of A). Writing every complex matrix as C = A + iB, with A and B 
hermitian, one can clearly extend the spectral theorem to all C for which A and B 
commute, i.e., where C commutes with its adjoint A — iB. These are, of course, the 
normal matrices. 
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A Remark on the Shortest Confidence Interval 
of a Normal Mean 


ROBERT BARTOSZYNSKI 
Department of Statistics, Ohio State University, Columbus, OH 43210 


WAI CHAN 
Digital Equipment Corporation, AET1-2 / 7, Andover, MA 01810 


1. Introduction. In estimation theory, the shortest 95% confidence interval for 
the mean of a normal distribution of known variance o? is given by 


X + 1.960/vn (1) 


(see, for example, Lehmann [1)). 

Dr. Jones, a recent graduate from the Department of Dubious Statistics, found an 
even shorter confidence interval. First he computed the interval given in (1). He 
then pretended that o is unknown and computed the interval 


X+1S/vn —1, (2) 


where nS? = ¥"_,(X, — X)*° and t = ¢%5 is the 97.5th percentile of a student’s 
t-distribution with n — 1 degrees of freedom. Now he chooses whichever of intervals 
(1) and (2) that happened to be shorter. 

Obviously, the length of Dr. Jones’ interval is less than or equal to the length of 
interval (1). On the other hand, both (1) and (2) have coverage probability of .95, so 
Dr. Jones claims that the same must be true for his interval. 

Has Dr. Jones made a significant statistical discovery? 

This question, plus others of a more technical nature, was on the so-called 
Inference Exam in the Department of Statistics of Ohio State University in 1987. 


0.95 


00 th tt th tt 
0.0 0.5 1.0 15 2.0 2.5.30 35 40 45 50 5.5 60 65 7.0 
S 
Fic. 1. P (Coverage|S = s) 


Variance = 25 
n= 21 
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We cannot disclose the details of students’ performance. However, the question 
here 1s not entirely without curiosity value. Even experienced statisticians are known 
to have been baffled by Dr. Jones’ reasoning momentarily. The important point here 
is to note that even though the probability P(|X — p| < tS/ vn —1) is .95, the 
conditional probability P(|X — | < tS/ Vn — 1|S = s) is not constant in s. When 
ts/ ¥n — 1 < 1.960/ yn, the conditional probability P(|X — w| < tS/ vn —1|S = 
s) is less than .95. In Figure 1, we graph the conditional coverage probability of Dr. 
Jones’ interval as a function of s for n = 21 and o? = 25. Thus while the expected 
length of Dr. Jones’ interval is less than 2 X 1.960/ Vn the coverage probability is 
less than .95. We shall briefly describe the computations of the expected length and 
coverage probability of Dr. Jones’ interval. 


2. Expected length. The length of the newly defined confidence interval is a 
function of S and is therefore a random variable. To calculate this expected length, 
let f,_, denote the density function of a chi-square random variable with n — 1 
degrees of freedom. Then, 


E (|Jones’ interval | ) 


ns? ns? 
i 2x 1.960/vn x f,_4{ + ]d]| = 
{1?s?/(n—1)>(1.960)?/n} 0 0 


2x ts/vn—-1 «tral > a (3) 


+f 
{1*s*/(n—1)<(1.960)?/n} 


2x 1.960/V/n x P(nS*/o? > (n — 1)(1.96/t)’) 


+21/vn—1 f "Ja( 5} (4) 


sx tn-1 
{(ns/o)* <(n—1)(1.96/1)? } 
The first term of (4) is 


(2 x 1.960/Vn) P(x2_, > (n — 1)(1.96/2)’). (5) 
By a change of variable, the second term of (4) becomes 


2t 


7 [omer oy f(y) dy 


7 2 ['(n/2) De (n- ; 
= 210 ao) “Mal bpA x P(x? < (n— 1)(1.96/t)’). (6) 


Thus we need only a chi-square table to compute the expected length of the interval. 
For example if n = 21 and o? = 25, then using (5) and (6) we obtain, 


E(\|Jones’ interval|) = 2.61 + 1.48 = 4.09, 


whereas the length of the standard confidence interval of (1) is given in this case as 
4.28. 
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3. Coverage Probability. The coverage probability is 
P(COVERAGE) 


= P(|X — p| <1.960//n;1.960/Vn < tS/vn — 1) 
-+ P(|X — ul <tS/yn—1; 1.960/Vn > tS/yn — 1 ). 


Now by conditioning on the random variable S and using the fact that X and S are 
independent, we obtain 


P (COVERAGE) 
2 2 
= P(X — wl <1:960/ViIS = s)f.-4| "> al 


{(1.960)?/in<t*s*/(n—-1)} 


_ ns* ns? 
+ P(\X — p| <ts/vn —1|S =5)f,-.| — ]d| > 
{(1.960)?/n>1t?s*/(n—-1)} 0 
= .95P(nS?/o? > (n — 1)(1.96/t)’) 
52 ns 
Tyo, P(|X — p| < ts/vn—1)f,, (5 |a{"). (8) 
{ns*/o* <(n-1)(1.96/1)? } a? 0 


The first term of (8) is .95P(x2_, > d) where d = (n — 1\(1.96/t)”. The second 
term of (8), by a change of variable, becomes 


fo PUX- bl < toly/(n? — n) \ f(y) & 
= f° (20(0Am=1)) = 1) a) &. 


where ® is the CDF of a standard normal distribution. Thus (8) can be rewritten as 
P(COVERAGE) 


= .95 — 1.95P(x2_, )+ 2f" 3 (1 y/(n- y/(n — 1) | fy i(y) dy, (9) 


which can be evaluated numerically. For n = 21 and o” = 25, the coverage proba- 


bility is .9358. Note that whenever y < d= (n — 1)(1.96/t)*, ®(t/y/(n — 1)) < 
®(1.96) = .975. From (9), we obtain, 


P (COVERAGE) 
< 95 - 1.95P(x3_, <d) +2f°975f,_.(y) 
0) 


95 — 1.95P(x2_, <d) + 2(.975) P(x2_, < d) 

= 95, (10) 
and so the coverage probability of Dr. Jones’ interval is less than .95 as asserted. 
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For Your Eyes Only 


GARY A. MARTIN 
Department of Mathematics, Southeastern Massachusetts University, 
North Dartmouth, MA 02747 


I was startled to learn from James C. Reber’s article, “Another Required Reading 
Program for Mathematics Majors” [1], that sections from Guillen’s Bridges to 
Infinity |2] are occasionally required reading for students taking Calculus III at 
Indiana University of Pennsylvania. I see no possible reason for encouraging 
potential mathematics majors to read this book unless it is to point out the 
numerous substantive errors it contains. To be fair, some of the sections seem to be 
free of misinformation; on the other hand, they also appear to be free of mathemati- 
cal content. If it is these which are assigned, the danger is that the interested student 
will read the rest on her or his own initiative, and enter advanced courses believing 
some of the following assertions: 

(a) “On the rational number line, adjacent numbers are infinitely close together. 
But with the real number line adjacent numbers are closer than infinitely close 
together.” (page 35) 

(b) “It [2%] is the first stepping stone beyond infinity, the first transfinite 
number, which Cantor named §,. A set with 8, elements, in turn, has precisely 2™: 
conceivable subsets. This is the second stepping stone, the second transfinite 
number N,, and so forth.”? (page 46) 

(c) “If it were possible to write out explicitly the zeroth transfinite number, ®& p, it 
would be a 1 followed by an infinity of zeros. The transfinite number 8, would be a 
1 followed by more than an infinity of zeros.” (page 47) 

(d) “With such a set [the limit of the Ns, which he calls 8 ,,], he [Cantor] could 
recycle his [diagonal] argument to define an entirely new sequence of larger-than-® ,, 
or trans-transfinite, sets. One might name them after the second Hebrew letier, 3 
(beth), so that the new sequence would be 2), 2,, 3,, and so on.”* (page 47) 

(e) “There are mathematicians today who do speak of an “absolute infinite.” 
They denote it with the last letter of the Greek alphabet, omega (w), and impute it 
to be the largest conceivable infinity.” (page 47) 

(f) “To this day we still don’t know exactly how many irrationals there are, 
although it has been established that the total number cannot be more than §,. 
Cantor himself guessed that the total number of irrationals is exactly 8,, mainly 
since &, is the next largest infinity after N, defined by set theory. His guess came to 
be known as the Continuum Hypothesis. But there is still the uneliminated possibil- 
ity that the number of irrationals actually lies somewhere between 8%, and &,.’’* 


(page 50) 


"The author is incorporating the Generalized Continuum Hypothesis into his notation! 

*We know these as 8,,N.41,%o42):.. 2, is obtained by iterating the power set operator n times, 
beginning with No. 

>We use w to denote the first infinite ordinal—the order type of the natural numbers. 

“We know how big the continuum is—2*%°—but we cannot know whether that is as small as &,, or 
larger. 
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(g) “The roots of fifth-degree equations behave differently when interchanged, he 
[Galois] noticed, depending on whether or not they are all algebraic numbers. If 
they are, then interchanging them in the equation will result in another equation 
that is as algebraically sensible as the first one... . On the other hand, if some of 
the roots are transcendental numbers, interchanging them results in an algebraically 
nonsensical equation.” (page 75) 

(h) “Nim has an optimal strategy that is unfair because by following it, the first 
player will always win, and the second player will always lose.” > (page 141) 

(i) “And yet even with such assistance [computers], although it has helped us to 
formulate some successful chess strategies, we still have not been able to reason out 
the unique [?!] optimal strategy.” (page 143) 

(J) “For a nonpolynomial—or NP-class—problem, the necessary computer time 
grows exponentially with its size.”’® (page 177) 

If that were not enough, the author also manages to give what would have been a 
correct, though not particularly illuminating, proof of the uncountability of the real 
numbers by the Cantor diagonal argument had he not said “irrational” whenever he 
meant to say “real.” He later refers to the “middle” and “outer” points among three 
points on a circle—not so terrible, but he is using this as an example of a 
topological invariant! 

Bridges to Infinity might make pleasant reading for the general public, but for 
math majors, it should be assigned with extreme caution. 
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Cauchy-Bunyakowski-Schwarz Inequality Revisited 


FRANCOIS DUBEAU 
Département de mathématiques, Collége militaire royal de Saint-Jean, Saint-J ean-sur-Richelieu, 
Québec, Canada, JOJ 1RO 


Usually when we prove the Cauchy-Bunyakowski-Schwarz (CBS) inequality in 
any inner product space we consider the sign of a special quadratic polynomial, it is 
“a clever trick that is not easy to motivate” [1, p. 201]. It could be interesting to 
have other proofs of this result. For example in R” (or C”) we can present a proof 
using the principle of mathematical induction. 

Let X = (x, X5,...,x,) and y= (yj, yy,..., y,) be any two vectors of R”. Let 
us consider the Euclidean inner product x - y defined by x - y = L7_,x,y,, and the 
Euclidean length of a vector defined by ||x'|| = (x - x)”. 


> Which player has the winning strategy depends on the initial position, and it is the position, and not 
the strategy, which is unfaur. 


© Then, by definition, P # NP! 
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CAUCHY-BUNYAKOWSKI-SCHWARZ INEQUALITY. Let n > 1 and x, y © R”. Then 
IXY] <q yl. (1) 
and equality holds if and only if one vector is a scalar multiple of the other. 


A proof of this result can be done using the upwards-downwards form of the 
principle of mathematical induction. This form of induction was used by A. Cauchy 
[2, pp. 375-377] to prove the arithmetic-geometric mean inequality. It can also be 
used to prove trigonometric inequalities [3, pp. 93-95]. 


Proof. Let P(n) denote the proposition and let us start with n = 2. 
Step 1. Proof that P(2) holds. The result follows from 
(xy, + X22)” + (1 — XV)" — (xf + x3)( yy + y;). 
Step 2. Proof that P(n) implies P(2n). With a pairing of indices two by two in 


the sum, we have 


2H 2 n 2 
Pa — EE G5) v5. ba) 
1=1 


7] 2 
. | (x35 + 3) (3 +94)" 
J 


where the first and second inequalities follow from P(2) and P(n), respectively. 


Step 3. Proof that P(n) implies P(n — 1). We only have to consider x, = y, = 0 
for P(n). 
From these three steps we can conclude that (1) holds for n > 1. Equality follows 


from the condition for equality at step 1 and the arbitrary pairing in (2). 
The first step of this proof is nothing but the case n = 2 of Lagrange’s identity 


Eon] =(E[E") 


1=1 


» (x,y, — x,y,)° + 


l<i<j<n 


which can be proved by the upwards form of induction and can be used to prove the 
CBS-inequality [2, pp. 373-374]. 

Another proof of the CBS-inequality using the upwards form of induction can be 
done as follows: Step 1 as before, and 


Step 2. Proof that P(n) implies P(n + 1). We have 


n+l 


hn 
xf <} xy, 
1=1 


i=l 


2 1/2; 9 1/2 an 2 [ned 2 ve 
+ (x2) (y241) ~< » xX, » J; 5 
1=1 


i=1 


< 


n 1/27 4 1/2 
+1 Xn4 Wnt < dx | 
1=1 i=] 


where the second and third inequalities follow from P(n) and P(2), respectively. 
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Once (1) is established for R”, we can extend it to more general inner product 
spaces. For example, let w,, w,,...,w, be n strictly positive real numbers and 
consider the weighted Euclidean inner product x - y = U”_,w.x_y,. Using the change 
of variables €, = w, x, and 9, = lw, y, we obtain (1) for any weighted Euclidean 
inner product space. More generally we can extend (1) to any finite dimensional 
inner product space using coordinates of vectors with respect to any orthonormal 
basis [1, p. 225]. 

Finally, for any inner product space we can establish (1) by a slight variation of 
the proof that projection onto an n-dimensional subspace is norm-decreasing [4, pp. 
145-147]. Assume x #0 and let 6 @R be such that x- y = 8||x'|| |[y'||. Let us 
remark that equality holds in (1) if and only if |8| = 1 or y = 0. Let us consider the 
scalar X such that (y-—AX)-x=0. Hence A= X- V/|x||/? = SP || |x|] and 
|v — AX ||? = (_— 87) || ¥|I7. It follows that -1 <6 <1 and y= AX if and only if 
[5] = lor y=0. 
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An Old Max-Min Problem Revisited 


MARY EMBRY-WARDROP 
Department of Mathematics, Central Michigan University, Mt. Pleasant, MI 48859 


Recently, while teaching a beginning calculus class, I introduced the problem of 
inscribing a rectangle of maximum area in a given right triangle and asked the class 
how the rectangle should be oriented inside the triangle. I was slightly surprised 
when a student suggested that one side of the rectangle should be on the hypotenuse 
of the triangle. In previous classes the suggestions were to fit one corner of the 
rectangle into the right angle of the triangle. The class soon bogged down in trying 
to solve the problem with one side of the rectangle on the hypotenuse, but 
subsequently solved the problem using the other orientation. 

I was sufficiently intrigued to solve the problem with both orientations and was 
surprised to discover that in either case the maximum area was { bh, where b and h 
are the base and height of the triangle. Furthermore in each case corners of the 
rectangle of maximum area lie on the midpoints of the two legs of the triangle. 

At this point I was thoroughly intrigued: Could the rectangle be more advanta- 
geously oriented and for other orientations did a corner of the rectangle of 
maximum area lie on the midpoint of a leg of the triangle? 

The following assertions describe the results of my investigation. The basic 
mathematical tools used were the Law of Sines and the technique of maximizing or 
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minimizing a function, usually discussed in a beginning calculus class. I omit the 
solutions under the assumption that most mathematicians enjoy finding their own 
solutions more than reading another’s. 

Let the rectangle be oriented in the triangle as pictured in FIGURE 1. The angle a 
satisfies 6 < a < 7/2. For a given angle a let A(a) be the area of the rectangle of 
maximum area which can be so inscribed in the triangle. 


Fic. 1 


Assertion I. Given a the rectangle of largest area is obtained when 


h b sin 6 
Ising and y= 2cos(a — 6) - 
In this case 
bh sin 6 
A(a) 


7 4sin acos(a — 6) 
and P is the midpoint of the side of the triangle opposite the angle @. 


Assertion 2. The maximum area function A(a), 6 < a < 4, has largest value at 
the two endpoints of the interval: 


A(6) = 4(>) - bh 


Assertion 3. The maximum area function A(a), 6 < a < 7 has smallest value at 
the midpoint of the interval: 


1 bh sin 6 
A|~@0+ —| = —————_.. 
5 | 2(1 + sin @) 


Moreover 1n this case, the shaded triangle in Figure 1 1s isosceles with equal sides of 
length 5h. 

There are numerous problems here for a first-semester calculus class and each of 
the solutions calls upon skills that the students should have developed in geometry, 
trigonometry, and calculus. One of the interesting facets of Assertions 2 and 3 is 
that the students would be invited to consider maximizing and minimizing A(a), 
each value of which is itself a maximum. Few problems of this type seem to be 
accessible for beginning students. Furthermore, there is a collection of geometric 
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problems only touched upon in Assertions 1—3: Why is P a fixed point as a corner 
of the rectangles of maximum area? Why is the shaded triangle isosceles when A(a) 
is minimum? Why is A(a) minimum at the midpoint of the domain of A(a)? Is 
there a geometric proof that A(a) is obtained with sides given by the equation in 
Assertion 1 or a geometric proof that A(@) and A(4) are the maxima of A(a) and 
A(56 + 7) is the minimum? 

It 1s interesting to note that the problem of inscribing a rectangle of maximum 
area in an acute triangle is no more difficult than the same problem for a nght 
triangle. Let the rectangle be oriented in an acute triangle, as pictured in FIGURE 2. 
The angle a satisfies 6, < a < 7/2. 


Fic. 2 


Once again for a given angle a the rectangle of maximum area, A(a), which can 
be so inscribed is obtained when P is the midpoint of the side of the triangle 
opposite 9,. The maximum of A(a) occurs at the endpoints 6, and 5 and the 
minimum occurs at the midpoint 56, + 7 of the domain of A. Finally, if the 
triangle is obtuse, 6; > 7/2 for example, the solution is the same except that A(a) 
would only be defined for = — 6, < a < § and in this case the maximum of A(a) 
would be attained only at 7/2. 

Surely the results of this note are known. However, not a single mathematician to 
whom I have mentioned them admitted having considered the maximum area 


problem with any orientation other than the most standard one. 


Functional Equations and L’H6pital’s Rule in an Exact 
Poisson Derivation 


J. ACZEL 
Centre for Information Theory and Quantitative Economics, University of Waterloo, Waterloo, 
Ontario, Canada N2L 3G1 


In [2], Cooke derived the Poisson formula 


P(t) = Prob[n arrivals in the interval (0, r)| 


(Ar) " —Xt 
=——e (A> 0;n=1,2,...) (1) 
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from P,(t) = e~*’, L'Hépital’s rule, the differentiability of P, (n = 1,2,...), and 
the functional equations 


P(t+h) = P(t)Po(h) + P(t) P(A) (t,h>0;n=1,2,...). (2) 


Actually, (1) does not satisfy (2), but in [2] equation (2) is regarded valid only 
“for very small h.” There (2) is presented as a consequence of “independence and 
negligibility assumptions.” 

Here we state these assumptions explicitly, derive from them the exact functional 
equations in place of (2) and arrive at (1) in an exact and still elementary way, again 
by use of L’HOpital’s rule. The proof goes back at least 40 years; see [3] of which we 
give a somewhat modified form, more suitable for teaching beginners. 

A (tacit) assumption 1s that of homogeneity: the probability P,(t, — ¢,) of n 
occurrences in the interval (¢,, ¢,) depends only on (n and) the length ¢, — ¢, of the 
interval. Independence means that the number of occurrences in (¢,, f,) and (45, ¢;) 
is independent. Negligibility (or rarity) means that in “very small” intervals one 
occurrence is approximately (asymptotically) equally probable to at least one 
occurrence; in terms of a formula: 


1-— P(t 
tim Poh) _ (3) 
that 1s, 
. P,(t) + P(t) + P;(t) totes 
im ———-——_ = 1. 


Since the P,, are probabilities, we have 


} 
o.6) 


P(t)=1 and 0<P(t)<1 (n = 0,1,2,...). (4) 
n=O 
Consequently, 
Fu) 0 ( 2,3 (5) 
lim 3 a) = m=2,3,...). 


By the independence assumption (and homogeneity), 
P(t + u) — P(t) Po(u) + P,_,(t)P,(t) tote + Po(t)P,(u) 
(t,u>0;n=0,1,2,...) (6) 
(there are exactly n occurrences in (0, ¢ + uw) if there are either n occurrences in 
(0, ¢) and none in (¢,¢ + uw), or (n — 1) in (0, f) and one in (ft, + w),... or 0 in 
(0, ¢) and n in (¢, ¢ + w)). This is the sequence of (exact) functional equations from 
which we will prove (1) with the aid of (3), (4) and (5). 
We start with n = 0. Then (6) reduces to 


P(t tu) =P(t)P(u)  (t,u> 0). (7) 
The general positive, bounded solution of (7) 1s 
P,(t) =e"™ (8) 


with constant A (see Appendix). By (4), 0 < P(t) < 1, which shows that Pp 1s 
bounded and that A > 0. If positivity of P) is not required then P)(t) = 0 could be 
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a solution too, but considering (6) for n = 1,2,... would then give P.(s) = 0 for 
n=1,2,..., contrary to L*_,P,(s) = 1 in (4). Similarly, if A = 0, P(t) = 1, then 
from (4) P(t) = 0 for m = 1,2,... which is of the form (1) (if (0 - 2)° = 1), but 
uninteresting. So, in what follows, we take A > 0. 

Now we get to 7 = 1. In this case (6), combined with (8), gives 


P(tt+u)=P,(t)e “+e 'P,(u) (t, u> 0). 
If we multiply both sides by e**" we see that /,, defined by 
f(t) = e“P(t), (9) 
satisfies 
f(tt+tu) =fi(t) + f,(uv) (t, u> 0). (10) 


By (4) and (9), f, is also bounded on every finite interval. The bounded solution of 
(10) is 


f(t) = at, 
where c, is an arbitrary constant (see Appendix). By (9), 
P(t) = te, 
and by (3) and (8) 


10 Cte ™ 0 «Cyt 10 Cy C 


(here we used the L’Hépital rule—or the definition of the derivative of e* at 0), so 
that : 


P(t) =Ate™. (11) 


We finish the proof of (1) by induction. By (8) and (11) it is true for » = 0 and 
n = 1. Suppose (1) is true for n = 0,1,..., m — 1 (> 0). Then (6) for m reads 


pm i pm 2 2 


u u 


_ —Au mt ,—AL,—AU 4 ———__ } ... 
P(ttu)=P,(t)e "+ Ne “Ve (m— ii!’ (m—2)!2! 


m— 1 


tu 
+ ————— | +e *'P (u) 
1!(m — 1)! 


Ne Alp hu 


=P (the “+P (ujye “+ 
m!} 


aa Mt 


(1 +u)  —t™ = uw’. 


Multiplying this equation by et” and writing 


f(t) = Pa(tyes — 22 
gives again 
f(t+u)=f(t)+ flu) (t>0,u> 0). (12) 


With P,, also f is bounded on every finite interval and so (see Appendix), f(t) = ct 
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and (An) 
At)” 
P(t) = | + ctje"™, (13) 
m! 
Applying also (5) and (11), we get 
(Ar)” 
4 —Xt mn 
F m! “ye ; (Ar)" "cc C 
= lim ——————_———- = ——— +-]=-, 
0 Ate~™ _ m! v r 


so that c = 0 and (13) shows that (1) holds also for m, which concludes the proof. 


APPENDIX 


Of course, if the P, are differentiable, we immediately get from (7) 


h Po(t)Po(u) = Py(t + u) = Po(t) Py(v) 
so that 
P.(t) = aP,(t), P,(t) = ae* 


and, substituting back into (7), a = 0 or a =1, that is, Pj)(¢t) = 0 or P(t) = e™. 
(From (4), as we have seen, a = —A < 0.) 

The solution of (12) (and (10)) in this case is even simpler: f’(t + u) = f(t) = 
constant (say, c), so f(t) = ct + b which satisfies (12) only if b = 0, f(t) = ct. 

If, as above, we know only that f is bounded on at least one finite proper 
interval, say (0, 1], (which followed from (4)) then f(t) = ct is still the only solution 
of (12). We indicate here an elegant proof of Young [4]. Define 


g(t) = f(t) -— fQ)e. (14) 
This g too is bounded, say A < g(t) < B on (0, 1], satisfies the same equation 
g(t +u) = g(t) + g(u) (15) 
and also g(1) = 0. So g(t + 1) = g(¢) for all ¢ > O and 
A<g(t)<B forall ¢>0. (16) 


We prove that g(t) = 0. Indeed, if there existed a ¢, > 0 with g(t.) = C # 0, say 
C > 0, then by repeated application of (15) g(nt)) = ng(to) = nC > B for large 
enough n, contrary to (16). So g(t) = 0 and, by (14), f(t) = f(1)t = ct for all 
t > O, as asserted. 

As to (7), it is easy to see that P, is either everywhere or nowhere 0, thus either 
P,(t) = 0 or P, is positive for all ¢ > 0, so that logarithms can be taken in order to 
reduce it to (12). (We know now only that log Py(t) is bounded from above; P, > 0 
gives no lower bound for log P,, but a slight modification of Young’s argument (see 
[1]) gives the same result in this case too.) 
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ELEMENTARY PROBLEMS 


E 3384. Proposed by Jacob Sturm, Rutgers University, Newark, NJ. 


Let {a,}%_, be a sequence of distinct complex numbers such that |a,]| 
converges. Let {c,,}°_, be a bounded sequence of non-zero complex numbers. Prove 
that there are infinitely many positive integers k such that 


CO 
Y c,ax #0. 


n=1 


E 3385. Proposed by Solomon W. Golomb, University of Southern California, Los 
Angeles. 


For what positive integers n is it true that 


y p=n) 


p<m7(n) 


Here, as usual, (x) denotes the number of primes not exceeding x and the 
summation 1s Over primes p. 
E 3386. Proposed by Eugene F. Schuster, University of Texas, El Paso. 


Let L be the length of a (2N — 1)-game World Series, modeled as a sequence of 
independent identically distributed Bernoulli trials which terminates as soon as one 
team wins N games. (The length is the number of games actually played.) Prove the 


427 
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seemingly obvious observation that the expected length E(L) of the series is 
maximized when the two teams are evenly matched. 


E 3387. Proposed by Douglas B. West, University of Illinois at Urbana-Champaign. 


Determine the maximum possible number of edges in a simple graph on ten 
vertices that contains no 4-cycle and find all graphs achieving this maximum. 


E 3388. Proposed by Matthew Cook (student), University of Illinois at Urbana- 
Champaign, Walther Janous, Ursulinengymnasium, Innsbruck, Austria, and Marcin 
E. Kuczma, University of Warsaw, Poland. 


Let x, and x, be arbitrary positive numbers. Suppose we define a sequence 
{x,}%_, by putting x,,,=2/(x,,, + x,) for n=1,2,3,.... Prove that the 
sequence converges. 


E 3389. Proposed by Joseph H. Silverman, Brown University, Providence, RI. 


Let ||t|| be the distance from the real number ¢ to the nearest integer. Prove that 
there is a unique real number r greater than 1 such that 


[o-@) 
yr ern lel 


n=1 


is a polynomial in |j¢\|; give this polynomial explicitly for the special value of r 
referred to. 


E 3390. Proposed by Robert B. Israel, University of British Columbia, Vancouver, 
Canada. 


Let 7, v, and d be the position, velocity, and acceleration vectors of a particle at 
time ¢. Suppose the particle moves so that a is always perpendicular to both 7 
and v. 

(a) Show that v,, = lim,_,,, 0 exists and show that tu, — ris bounded. 

(b) Show that if /5°¢|a(t)| dt < oo, then lim, _, ,,(¢u,, — 7°) exists. 


i—-> oo 


SOLUTIONS OF ELEMENTARY PROBLEMS 
A Combinatorial Sum 
E 3290 [1988, 872]. Proposed by Douglas B. West, University of Illinois, Urbana 
IL, and Douglas H. Wiedemann, Institute for Defense Analyses, Princeton, NJ. 
Given nonnegative integers /, m,n with /, n < m, evaluate the double sum 
f I 
—j\{n\({m—n 
eee ee ee 
‘=0 j=0 m—1}\I}\ PT 


Solution I by Irl C. Bivens, Davidson College, Davidson, NC. The value of the 


sum 1s 2!( """). To see this, suppose that a mathematics department contains m 
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students, of whom n are male, and that all m students wish to take a class in 
combinatorics that is limited to / students and is graded pass /fail. There are 2 (" i ) 
potential grade reports under the restriction that no males take the class. We show 
that the double sum also counts this. 

The nonzero terms of the double sum have 0 < j < i < /. Given such parameters, 
the number of grade reports in which i students pass, of whom / are male, is 


(")(" ="V( a '): choose the males who pass, choose the females who pass, and from 
the remaining students choose those who do not take the course. Hence the double 
sum is the difference between the number of grade reports in which an even number 
of males pass and the number in which an odd number of males pass. 

Furthermore, the operation of changing the grade of the eldest male taking the 
course establishes a bijection between the set S of grade reports in which an odd 
number of males pass and the set T of grade reports in which an even number of 
males pass and at least one male is in the class. This involution implies that the 
double sum is the number of grade reports for classes containing no males. 


Solution II by Richard Stong, Harvard University, Cambridge, MA. The value of 
the sum is 2(” , "), Denoting the double sum by S(/,m,n), we claim that 
ee oS(1, m, n)x! = (1 + 2x)""", from which the result follows by the binomial 
theorem. 

Extending the sums over all nonnegative i and ; does not change the value; we 
do so to facilitate interchange of summation. We compute the triple sum by viewing 
the sums in reverse order and applying the binomial theorem three times. 

Est me nx! =T(-y(F JOT 7 ET a 
{ j ; 


Bia (s)E(1f)e0 89 
-cesyrEcw(as) S(T la) 
-assyrEcen(( 3) (AE) 
=(s2y™"e9"[1= 5) 


= (1+ 2x)" 


Editorial comment. Alternative solution techniques included everything from 
recurrence relations to calculus and complex analysis. David Callan and Fred 
Kochman observed that interchanging the order of summation gives an inner sum 
that is a special case of the binomial coefficient identity given as equation (24) in 
Donald E. Knuth, The Art of Computer Programming (vol. 1, second edition), p. 58, 
after which one application of the binomial theorem completes the sum. 


Solved also by J. Anglesio (France), J. C. Binz (Switzerland), D. Borwein, D. Callan, J. Duemmel, A. 
J. Humphreys (student), A. A. Jagers (The Netherlands), N. L. Johnson, F. Kochman, C. Krattenthaler 
(Austria), Y. H. H. Kwong, H. Lipman, O. P. Lossers (The Netherlands), R. Martin (student, W. 
Germany), L. E. Mattics, A. Nijenhuis, T. S. Norfolk, R. Perez-Marco (Spain), H. Steinacker (Austria), J. 
H. Steelman, J. Toth (Hungary), M. Vowe (Switzerland), A. Wender, and the proposers. 
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When the Kronecker Product is Triangular 
E 3298 [1988, 954]. Proposed by Marvin Marcus and Claire Pesce, University of 
California, Santa Barbara. 


If A and B are square matrices, we define the Kronecker product A ® B as the 
matrix 


a,B a,,B a,,B 
- _— 
a,,B a, B a,,B 
where 
ayy a)? ay 
A= a 
a a 42 oS ann 
Let A,, A5,..., A,, be square matrices over a field, and let K = A; ® A, @®--: ® 


A,,. Prove that K is lower triangular if and only if the following two conditions are 
satisfied: (a) A, is lower triangular; (b) if there exists p < m such that A,,, is not 
lower triangular but all of A,, A,,..., A py are lower triangular, then at least one of 
the matrices A,, A,,..., A, has zero main diagonal. 


Solution by John Henry Steelman, Indiana University of Pennsylvania, Indiana, 
PA. It should be assumed that none of the matrices A,, A,,..., A,, 1S a zero matrix. 
With this assumption, we prove the result by induction on m. For any square 
matrices A and B, the entries of A ®@ B that occur above the main diagonal are 
a,,b,, with i <j or with i = 7 and A </. If B # 0, then these will all be zero if and 
only if A is lower triangular and either B is also lower triangular or A has zero main 
diagonal. This verifies the statement for m = 2. 

Suppose m> 2 and K=4A,®A,®--- ®A,, is lower triangular. Since the 
Kronecker product is associative, K = A, ® (A, ® --: ® A,,). Hence the result for 
m = 2 implies that A, is lower triangular. Now suppose there exists p < m such 
that A,,, is not lower triangular but all of A), A,,..., A, are lower triangular. 
Then K = A @ B, where A= A, ®--- @A, and B=A,,,®--: @A,,. By the 
inductive hypothesis, B is not lower triangular. Therefore, A must have zero main 
diagonal. However, the main diagonal of A consists of all products formed using 
one main diagonal entry from each of Aj,..., A,. Hence one of these matrices must 
have zero main diagonal. 

Conversely, suppose m > 2 and K = A, ® (A, ® -::: ®A,,) satisfies conditions 
(a) and (b). If A, is not lower triangular, then A, has zero main diagonal, and the 
result with m = 2 implies that K is lower triangular. If A, 1s lower triangular, then 
A, ® +--+: ®A,,_ also satisfies (a) and (b) and by the inductive hypothesis 1s lower 
triangular. Now the case m = 2 implies that K is also lower triangular. 


Solved also by J. K. Baksalary (Poland), D. Callan, Y. Ikeda, R. H. Jeurissen (The Netherlands), O. P. 
Lossers (The Netherlands), A. Nijenhuis, and the proposers. One incorrect solution was received. 
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Computing the Volume of a Zonotope 


F 3301 [1989, 54]. Proposed by Hugh L. Montgomery, University of Michigan, 
Ann Arbor, and Jeffrey D. Vaaler, University of Texas, Austin. Suppose a,,@5,..., Ay 
are vectors in R™, where N > M. Prove that the set of vectors in R™ expressible in 
the form 


has M-dimensional volume 


y |det A,|, 
I 


where / runs over the subsets of {1,2,..., NM} of cardinality M and A, denotes the 
M by M matrix whose columns are the a, for which i € J. 


Solution by H Guggenheimer, West Hempstead, NY. Given N compact convex 
sets K, in R™, their Minkowski sum (or vector sum) is defined by 


K, +++: +Ky= { x! + oe +xN:x'€K,,i=1,...,N}. 
A formula of Minkowski computes the volume of the vector sum as 
V(K, +--+ +Ky)=DV(K,,...,K,,), 


where the summands are Minkowski’s mixed volumes, and the sum ranges over 
1 <i,,...,é,, < N. For these basic definitions and the results used below, see T. 
Bonnesen and W. Fenchel, Theory of Convex Bodies, translated from the German, 
BCS Associates, Moscow, Idaho (1987) or H. G. Eggleston, Convexity, Cambridge 
Tracts in Mathematics and Mathematical Physics, No. 47, Cambridge University 
Press, 1958. 

Let A, = {Aa,: 0 <A < 1} be the line segment joining the origin to the point a,, 
fori =1,..., N. Then A, + --+ +A,, 1s the set (called a zonotope) described in the 
problem. V(4,,..., 4, ) = 0 when two of the line segments that appear in the 
mixed volume are the same, and the mixed volume is invariant under the permuta- 
tions of its arguments. Hence, in this case, the preceding sum takes the form 


V(A, + ++) t+Ay) = MIDVV(A,,...,4,,), 


where the indices {i,,..., /,,} range over all M-sets taken from {1,2,..., MN}. This 
gives the required formula, because M!V(A,,..., 4,,) is the volume of the paral- 
lelopiped spanned by the segments 4,,..., 4,,, and this volume is equal to |det 4), 
where 4, is the matrix with columns a,,...,a 


Editorial comment. Richard Stanley pointed out that the formula appears in the 
paper of G. C. Shephard, “Combinatorial properties of associated zonotopes,” Can. 
J. Math. 26 (1974), 302—321, where it is attributed to Peter McMullen. Most solvers 
proceeded by showing that the given set could be dissected into parallelopipeds, 
which was the result proved in Shephard’s paper. The formula has been applied in 
P. McMullen, “Volumes of projections of unit cubes,” Bull. London Math. Soc. 16 
(1984) 278-280, and in G. D. Chakerian and P. Filliman, “The measures of the 
projections of a cube,” Studia Sci. Math. Hungar. 21 (1986) 103-110. 


Solved also by M. Golomb, L. R. King, O. P. Lossers (The Netherlands), A. Nyenhuis, and the 
proposers. 
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Averaging to Integers 


E 3302 [1989, 54]. Proposed by Jim Delany, California Polytechnic State Univer- 
sity, San Luis Obispo. 


The mean and standard deviation of any 7 consecutive integers are both integers. 
What natural numbers greater than 1 share this property with 7? 


Composite solution by Roger Douglass and Mike Breen, Alfred University, Alfred, 
NY, and Roger B. Eggleton, University of Brunei Darussalam, Borneo. We prove the 
following generalization: 


The mean and standard deviation of an arithmetic sequence of n integers with 
common integral difference d are both integers 1f and only if (n — 1)d is even and 
n is of the form [(2 + 3 )*/2] for k a non-negative integer. 

The mean and standard deviation of {a,a+d,...,a+(n-—1)d} are a+ 
d(n — 1)/2 and [(n? — 1)/3]'”? d/2, respectively. When vn is odd, the mean is 
integral, and the standard deviation is integral if and only if [(n? — 1)/3]'/? is 
integral, because it must be rational and therefore an integer (necessarily even since 
n* — 1 is divisible by 8). When n is even, the mean will be integral if and only if d 
is even, and then the standard deviation is integral if and only if [(n* — 1)/3]’”” is 
integral. In either case, (n — 1)d must be even. To complete the proof, we must 
describe the positive integers n for which there is an integer ¢ with n* — 1 = 3r?. 

The equation n* — 3t? = 1 is a Pell equation whose solutions are well-known. 
Since n = 2 and ¢ = 115s the smallest positive solution, the theory of these equations 
implies that every positive integral solution (n,, t,) satisfies n, + t,V3 = (2 + y3)‘, 
for some positive integer k. Furthermore, n, — t.V3 = (2 - J3 )*, and therefore 
n, = [2 + V3 )* + (2 — ¥3)*]/2. Since n, is an integer and 0 < 2 — y3 <1, we 
have n, = [(2 + ¥3)*/2], as claimed. 


Editorial comment. When d is odd (for example, when d = 1), the requirement 
that (n — 1)d be even forces n to be odd and thus & to be even, say, k = 2 j. Thus 
n has the form [(7 + 4y3)/ /2\ when d is odd, where / is a non-negative integer. 


Ninety-eight correct solutions, 2 “empirical” solutions and 3 incorrect solutions were received. 


ADVANCED PROBLEMS 


6631. Proposed by Ronald Evans, University of California at San Diego and Peter 
Montgomery, Unisys and University of California at Los Angeles. 


We say that f(z) = a) + a,z + +--+ +a,z* is a unimodal reciprocal polynomial 
with positive coefficients if 


O< dy = ay < a) = Ay SG, = Oy gg St SA = Ay 
(1) If z,, 25,..., Zz, are the mth roots of 1 and if 27/n < 1¢, prove that 


Il (2-2, 


larg z,|>¢ 


1s a unimodal reciprocal polynomial with positive coefficients. Here —7 < argz < a. 
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(11) If z,, 25,..., z, are the nth roots of —1 and if w/n < t, prove that 
I] (2-2) 
larg z,|>t 


is a unimodal reciprocal polynomial with positive coefficients. 


6632. Proposed by Gilbert Muraz, Institut Fourier, Université de Grenoble I, St. 
Martin d’ Heres, France, and Pawel Szeptycki and Fred Galvin, University of Kansas, 
Lawrence. 


Let E be a measurable subset of R modulo 1 having positive measure. For real ¢ 
let N, be the set of positive integers n such that nt modulo 1 is in E. Suppose 


{ a,,}%_, 1S a Sequence of positive real numbers such that Ua, = oo. Prove that 
da, = 0 
nEN, 


for almost all ¢ in [0,1]. 


6633. Proposed by Horacio Porta, University of Illinois at Urbana-Champaign. 


Suppose we are given a large positive number N and the further information that 
N =k! for some positive integer k. Show that we can determine k in at most 
C log log log N steps. 


SOLUTION OF ADVANCED PROBLEMS 
Fireworks in a Normed Linear Space 


6546 [1987, 469]. Proposed by Juan Arias-de-Reyna, Universidad de Sevilla, 
Spain. 


Suppose F is a separable infinite-dimensional, normed linear space. Construct a 
sequence X,, X5, X3,... in & such that 

(a) lim, ll&,|| = + 00. 

(b) Every x** in E** is a weak* cluster point of the sequence { x, }. 


Composite solution by the proposer, Mahlon M. Day, University of Illinois. 
Urbana, and N. Tenney Peck, University of Illinois, Urbana. Begin by observing that 
if a set D is weakly dense in a normed space E£, it is weak* dense in E** when E 1s 
regarded as a subspace of E** (this is Goldstine’s theorem). 

Now for any positive integer n, let S, be the set of points in FE of norm a. Let 
the E, be n-dimensional subspaces of E such that EF, C E,,, and U,E£, is 
norm-dense in E. Choose the F, to be finite 1/n-nets in E£, 0 S,, and set 


F=U,F,. (Cf. [1].) The following result shows that the obvious enumeration of F 
will be a sequence x,, X5, X3,... With the desired properties. 
We shall show that F is weakly dense in FE. Let x belong to E, and let U bea 


weak neighborhood of x. We may assume that 


U = {y:|f(y) —filx)| <eforl <i<m}, 
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where each f, belongs to E*. Let 


= max||f,|| 
and 
v={y:|£(»)-f()| < 5 <i<m}, 


Now choose n so that (1) 1 > k, (2) n> ||x||, (3) 2 > 2p/e, and (4) E, A V is 
unbounded. Then x belongs to V and E,, M V is unbounded, so V meets S/O E,,. If 
z belongs to the intersection V 1 S| E,, there is a y in F, such that 


1 
ly — Z|, < 7. 
A 


Finally, for each /, 


REFERENCE 


1. M.M. Day, Normed Linear Spaces, Springer-Verlag, 1958, Ch. II, §5, (2), p. 41 or 1973, Ch. II, §5, 
(2)(a), p. 48. 


No other solutions were received. 


Some Diophantine Equations Not Solved by Fermat 


6558 [1987, 884]. Proposed by Barry Powell, Kirkland, Washington. 


Let p > 3 be prime. By a theorem of Faltings [ Invent. Math. 73 (1983), 349-366] 
the Fermat equations 


2xP+yP=zP, QP xP + yP=2z?, and x? +y?=2? 


have only finitely many nontrivial integral solutions. 
(a) Show there exists a positive integer N such that for g prime, m > 1, and 
q’™ > N, the equation 


gx ?P + y?P _ 72P 


has no positive integral solutions with x, y and z pairwise relatively prime. 
(b) Show that for any positive integer n > 2, the Diophantine equation 


x? _ y® — on 
has at most finitely many integral solutions with x and y relatively prime. 


Solution by the proposer. The consequence of Faltings’ theorem was misstated; the 
given Fermat equations have only finitely many relatively prime solutions. 
Also, the solution would be more direct if the Fermat equations had read 


2xP + yP = 2?, Q2P-lyP + yP = z?P, and x?+ y?=2Z?, 


but of course we must deal with the equations as printed. For (a), let {x,, y,, z,}~., 
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be an enumeration of all solutions to the given Fermat equations. We choose 
N=4 max {[x,|’, [y17, [Z1? }- (1) 
l<i<k 


Suppose first that g # 2, and assume that there exist positive pairwise relatively 
prime integers x, y, and z such that g”x°*? = 27? — y*? = (z? + y?)(z? — y?). 
Since gcd( y, z) = 1, we get that gcd(z? + y?, z? — y?) = 1 or 2. Hence, there exist 
relatively prime positive integers a and b such that x = ab or 2ab and at least one 
of the following holds: 


ga’? =zP?+y? and b?P = 7P + y?, (2) 
2g"a2P =zP+y? and 22Pp-lp2p = oP y?, (3) 

Or 
2?P—-lgmag?P =z?+y? and 2b7? =z? F y?. (4) 


Considering (1) and the equation involving a which holds above, we get that for 
l<i<kK, 


2|2z,|? < 3N < 39" < 3q"a°? < |z? + y?| < 2I2|?. 


Thus, |z,| <|z| where z belongs to a solution (b, y, z) of one of the equations 
involving b above. But this contradicts the requirement that (b7, + y, z) (in the case 
of (2) or (4)) or (257, + y, z) (in the case of (3)) is some (x,, y,, Z,). 

For the case g = 2 the argument is the same, but one should replace (3) and (4) 
in the list above with 


2m—lg?P = 2P ty? and 2b? = 2? Fy? 


This concludes the proof of (a). 

For (b), there are no nontrivial solutions for n = 3 by Fermat’s Last Theorem for 
exponent 3. The usual method of proof by infinite descent for exponent 3 can also 
be used to rule out nontrivial solutions for n = 2. So suppose there exist integers 
x, y, z with (x, y) = 1 for which x® — y® = z” with n > 4. Now exactly one of x, 
y, or z is even. If it is z, then without loss of generality 


x3—y?=2" la” and x? + y? = 2b" (5) 
where (a, b) = 1 and b is odd. Make the substitutions x + y = 2e and x — y = 2f. 
Then e and f are integers and (e, f) = 1. Also, a congruence argument modulo 4 
shows that exactly one of e or f is even. We obtain 

2f(3e2 +f?) =2"-'a" and 2e(e* + 3f7) = 2b". (6) 


Now either (1) 3 + ef or (i1) 3|ef. 
In case (1) it follows from (6) that there are integers a and Pf such that 


f = 2"-°B", e=a". (7) 
Hence, by (6) again, there is an integer y such that 
B02" + 227 4B 20 = yn, (8) 


since (3e* + f*, f) = 1. Now by Faltings’ Theorem, equation (8) has only finitely 
many integer solutions {a, 6, y}. Hence 


max(a|, |B], |y|) < M (9) 
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for some positive integer M. Hence |e| and |f| are both less than 2”"~*M” and we 
see that 


max(|x|, |y|) < 2"71M". (10) 


Hence, the original equation has only finitely many solutions x, y, z with (x, y) = 1, 
z even and 3 + ef. 

For case (ii) let’s assume 3/e (the case 3|f can be handled in a similar manner). 
By (6) it follows that there are integers a, 8, y such that 


e= 3" ly", f _ 27-28" (11) 
and, by (6) again, 

Ban yen +3. 227 4Ren _ 3y" (12) 
since (e* + 3f*,e) = 3. Now by Faltings’ theorem equation (12) has only finitely 
many solutions in integers, and this implies (as in (i)) that the original equation has 
only finitely many solutions with (x, y) = 1, z even, and 3l]ef. 


It remains to consider the case of odd z. Since (x, y) = 1 there are relatively 
prime odd integers a and b such that 


xe-ye=a" and x? +y? =b". (13) 
The substitutions e = x + y and f= x — y yield 
f(3e7 + f?) =4a" and e(e? + 3f7) = 4b", (14) 


where clearly (e, f) = 1. We again distinguish the two cases (i) 3 + ef and (ii) 3 lef. 
These lead, respectively, to equations of the form 


3q2" 4 Br _ 4y" and gan lyen 4 B?" _ Ay", (15) 
and again the result follows from Faltings’ Theorem. 


Editoral comment. Of related interest is the paper by B. Powell and P. 
Ribenboim, “Note on a paper by M. Filaseta regarding Fermat’s Last Theorem,” 
Ann. Univ. Turku, Ser AI, 187 (1985). 

The editors thank Michael Filaseta for considerable help in the editing and 
verification of this solution. 


No other solutions were received. 


Coverup Revealed in Hilbert Space 


6577 {1988, 665]. Proposed by B. Bagchi, G. Misra, and N. S. N. Sastry, Indian 
Statistical Institute, Calcutta. 


(a) Let H be an infinite-dimensional inner-product space. Suppose that finitely 
many closed balls cover the surface S of the unit ball B in H. Prove that these balls 
also cover the center of B. 

(b) Does the above assertion remain valid if H is any infinite-dimensional, 
normed linear space? 


Combined solution by (independently) the Florida State University Mail Room 
Problem Group, O. P. Lossers (The Netherlands), and John Henry Steelman, 
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Indiana University of Pennsylvania. The answer to (b), and hence to (a), is yes. If m 
balls B,,..., B,, (or any m closed convex sets) cover the surface of the unit sphere 
of H, the same situation holds for any m-dimensional subspace of H. Inside this 
subspace, if B, fails to cover 0, there is a hyperplane P through 0 that does not 
intersect B,. Thus the m — 1 closed convex sets B, 1 P,..., B, A P cover SO P, 
the unit sphere of an (m — 1)-dimensional space. This step may be iterated; if 
B,,..., B,,_, also fail to cover 0, then B,, will contain the surface of a one-dimen- 
sional unit sphere and hence 0 by convexity. 


Editorial comment. Most solvers stayed in infinite-dimensional space and applied 
some variation of the Hahn-Banach theorem. A few (S. K. Chung (Hong Kong), 
Jesus Ferrer (Spain), Pei Yuan Wu (Taiwan)) based their solution on results about 
the weak topology. After his solution, Gerd Herzog (West Germany) remarked that 
Borsuk proved the following much deeper result (see, e.g., K. Deimling, Nonlinear 
Functional Analysis, Springer-Verlag, Berlin, 1985, p. 22). Let H be a normed linear 
space, n < dim H, and B,,..., B, a cover of S such that S 1 B, is closed. Then for 
some x in S both x and — x belong to the same B,. In response to a query, Tenney 
Peck showed that in Hilbert space there is no positive r such that if finitely many 
closed balls cover the surface of the unit ball, then at least one of them contains the 
ball of radius r centered at the origin. 

Solved also by Mahlon Day, K. P. Hart (The Netherlands), H. Hunziker and V. Mascioni (Switzer- 


land), Miguel Lacruz, Reiner Martin (West Germany), Eero Posti (Finland), William H. Ruckle, and 
Thomas Starbird. Part (a) was solved by A. A. Jagers (The Netherlands) and the proposers. 


An Application of the Three Square Theorem 


6580 [1988, 774]. Proposed by P. A. Batnik, University of Illinois at Urbana- 
Champaign. 


Prove that if n is an odd positive integer, there exist integers x,, x5, x3, x, such 
that 


n=x~p+x34+xj+ xj, x, > |va] -1. 


Editorial remark. As Godfrey Isaacs and Joseph Lehner pointed out to the 
editors, the published statement of the problem contained a serious typographical 
error, namely, the symbol > was inadvertently replaced by the symbol >. As the 
case n = 23 shows (or, more generally, the case n = 16k* + 7 for k = 1,2,3,...), 
the assertion of the problem is false with strict inequality. 


Solution by J. L. Selfridge, Northern Illinois University, DeKalb. We prove the 
stronger result that if n is a positive integer not divisible by 16, then there exist 
integers x,, Xj, X3, X, satisfying 


n=x?+x24+x24+x2, 2x, > [vn] -1. (*) 


This amounts to showing that we may take x, =|v¥n]—1 or x, =|vn]|. We 
indicate our choice of x, from among these two alternatives by specifying the parity 
of x,. We require the following classical result. 
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LEMMA. A positive integer can be written as a sum of three squares of integers if 
and only if it cannot be expressed in the form 4°(8b + 7), where a and b are 
non-negative integers. 


If n=1 (mod 4), we choose x, to be the even number in the set 
{|vn | —1,|[Vn J}. Then n — x? =1 (mod 4) and so n— x? is a sum of three 
squares by the above lemma. 

If n = 2,3,4,6,7 (mod 8), we choose x, to be the odd number in the set 
{|va | —1,|Vn J}. Then n — x? = 1,2,3,5,6 (mod 8) and so again n — x} is a 
sum of three squares by the above lemma. 

If n= 8 (mod 16), we choose x, to be the even number in the set 
{|va | — 1,|va |}. Then n — xf = 4,8 (mod 16) and so once more n — x? is a sum 
of three squares by the above lemma. 

In all three of these cases there exist integers x,, x5, x3, x, such that (*) holds. 

The assertion of the problem is not in general true for integers n divisible by 16; 
for example, it fails when n = 1136, since 1136 — 33? = 47 and 1136 — 32? = 47+ 7. 
Further, if » = 27’! where r > 3, the only way of expressing n as a sum of four 
squares 1s (up to permutations) 


n = (2")° + (2°)? + 0? + 0? = (yn/2 yr + (/n/2 ) + 02 + 02, 
where 2” = /n/2 < [Vn | - 3. 


Solved also by David Callan, R. J. Chapman (England), M. Chiebik (Czechoslovakia), Carl Cotner 
and San Ling (students), O. P. Lossers (The Netherlands), Nick MacKinnon (England), and Carl 
Pomerance. 


An Integral for Jacobi 


6582 [1988, 774]. Proposed by N. J. Fine, Deerfield Beach, FL. 


Prove that 


Solution by A. A. Jagers, Universiteit Twente, Enschede, The Netherlands. We 
start with 


> g” _ IT @ _ gq") 4 g2n-hy’, 


a special case of Jacobi’s triple product identity [Theorem 352 of the fifth edition of 
An Introduction to the Theory of Numbers by G. H. Hardy and E. M. Wright]. Since 


I] @ 4 q?"*) _ 
n=1 


(1 + q")(1 + q?")* 


_18 


= 
| 
_ 


_T (1 —@°")(1— 4") 


n=1 (1 _ q”)( — q*") 
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we have 


On the other hand, 


4 
aa 2 aa md 
q” —-1+8 
(o | x 1 — q 
m#0 mod 4 


by Theorem 385 of Hardy and Wright, and logarithmic differentiation yields 


d | i a _ Wo - qi") 


dq\on=1 1Q~q")’ } nei (L—q")! 
oC m m CO 4m 4m 
x18 5 7 _g a 
m=1 1 — q m=1 1 — q 


Hence, 


dq n=] @! — gq") 
Now recall that the Dedekind eta-function y(7) 1s defined by 
n(r) = qi7™* I] (1 —q"), 
where g = exp(2zir) and Im(7) > 0. Thus 


on oO 1 — 2n\20 oO 1] —e 4™ 
(Tare Toe 


(1 ~q")” 


(ef S| - See 


since n(—1/7r) = V— itn(7). 
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Editorial comment. Jagers’ solution is essentially the same as the proposer’s. Jesse 
Deutsch remarks that the integral appears in Nathan Fine’s book Basic Hypergeo- 
metric Series and Applications, Mathematical Surveys and Monographs, No. 27, 
American Mathematical Society, on p. 89 (equation 33.43). Both Kee-Wai Lau 
(Hong Kong) and Klaus Zacharias (Berlin) presented solutions based on elliptic 


integrals and the Legendre relation 
T 
E(k) K(k’) + E(k’) K(k) — K(k) K(k’) = 5° 


The solutions mentioned were the only ones received. 
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The Mathematics of Plato’s Academy. A New Reconstruction. By D. H. Fowler, 
Oxford Science Publications, the Clarendon Press Oxford University Press, New 
York, 1987, xxi + 401 pp. 


RICHARD K. GUY 
Department of Mathematics and Statistics, University of Calgary, Calgary, Alberta, Canada, T2N 1N4 


In each of FIGURES 1-3, a side AB of unit length of a regular polygon is folded 
onto the part AE of a diagonal of length x (x is different in each figure, of course) 
forming two similar triangles ACF and CDE. FIGURE 1 shows the relationship 
between what we now call golden section (a name apparently coined by Martin 
Ohm, younger brother of the physicist, around 1834) and the regular pentagon 
(Euclid’s Elements: II, 10; [V, 10 and 11). In fact, 


DE CF 
CE AF’ 
x-1 1 1 1 1 1 1 1 1 
=—(*)x=1+-=14+- -=1+-> - = =: 
1 x x 1+ x 1+12+14+1¢ 
In each of FIGURES 2 and 3, 
| AC CD x l-y 
CF DE’ 17 iy 
Add 
ly 
ly 
to the two sides (componendo): 
1+x 1 y 1 
1 yp? 1 +x 
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In FIGURE 2, 
x-1l1=y, x=lt+y=1+ 
y y tos (*) 
1 1 1 1 1 1 1 1 1 
x=1+-— =l+= = =1l+- +> = = 
2+1+x 24+2+14+x 24+424+24+2+¢+ 
In FIGURE 3, 
x-1 2 14 1+x 
— = + 5 =j]+ 
y lv - y y (*) 
2 1 1 1 1 1 1 
x=1l+2y=1+ yt aya lty —~J4-— _ 
1+ 1l+y 1+1+x 1+2+1+y 
1 1 1 1 1 1 1 1 1 
=]+- —- = yt —- FOS 
1+2+1+1+4 1+2+1+2+1+4y 
1 1 1 1 1 1 
—~]J4-— 


1+24+14+2+1+2+4+— 


In present day language, we have found the continued fraction expansion for the 
golden mean, (1 + J5 )/2, for ¥2, and for V3. Each of the three arguments, at least 
as far as (*), have used only the idea of the ratio (Jogos) of lengths of line segments, 
very familiar to the Greeks. The claim is that the Greeks could also well understand 
the anthyphairetic ratios that we now call continued fractions. The author derives 
the name anthyphairesis from the Greek verb anthuphairein, which Euclid uses at 
Elements VII, 1 and 2, and X 2 and 3, for the operation of reciprocal subtraction, 
whereby he finds the greatest common measure: the process we now call the 
Euclidean algorithm, or more exactly, algorism, as the author prefers. In fact he 
suggests the ancient to modern “proportion”: 


(anthyphairesis) :(algorisms) : :(continued fractions) :(algorithms) 


If we look at the process in one direction, it is not difficult to see how the Greeks 
discovered the incommensurability of /2 and other surds. In the other direction, it 
is not unduly fanciful to visualize the infinite process which generates ever closer 
rational approximations, while never attaining the limit. Compare several recent 
notes [3, 13, 14, 15]. 

How did the Sumerians, a millenium before Pythagoras, obtain the remarkable 


approximation 
24 51 10 
1 — 
60 60° 60° 
to V2? If the side of the square in Yale Babylonian Collection 7289 [9, Plate 6) were 
a mile, then, to mix metraphors, the error in its diagonal would be less than a tenth 
of a millimetre! Since when has man been able to measure as accurately as that? 
Perhaps the Sumerians used the ‘“‘Heronian algorithm,” which we now recognize 
as a simple application of the Newton-Raphson process. One is too small, and two 
too big, so try their average, 3/2. On dividing into two, 4/3 1s too small and 3/2 is 
too big, so try their average: 
3742-2? 17 177+2-12% 577 


—=-—: Try again: ——— = —. 
2-2-3 12 2-12-17 408 


442 RICHARD K. GUY [May 


Not as good as the Sumerians, since this answer is nearer to 11 in the third 

sexagesimal place. But they must have noticed that the arithmetic mean was greater 

than the geometric mean, and that their approximations, after the first, were 

over-estimates, and they had the good judgement to take 10/60° instead of 11/607. 
If we look at the convergents 


. ar 3 7 17 41 99 239 577 


—_—_ ell 


1 2 5 12 29 70 169 408 


to the continued fraction we obtained near the outset, we see the approximations as 
the first, second, fourth and eighth [compare 2, 4]. 

This is not necessarily to suggest that the Sumerians had the concept of 
continued fractions, but the Greeks could have had. 

The Babylonian astronomers found (but with not quite this accuracy!) that the 
seasons recur every 365.242199 days and that the period of the moon’s phases is 
29.530588 days. The ratio of these numbers gives the continued fraction 


111 1 21 ~=21 ~21 
W+t-= = 


2414+24+14+1+4+174+2 
as far as (or slightly further than) even modern astronomical measurements justify 
our going. The convergents 


12 25 37 99 136 235 4131 8497 


are roughly the number of “months” in a “year.” The ratio 235:19, found by 
truncating the large partial quotient 17, is a very good approximation, and 1s called 
the Metonic cycle since it was known to the Athenian astronomer Meton, who 
flourished a little earlier than Plato. It was almost certainly also known to earlier 
Babylonian astronomers and it is still used to determine both the Jewish calendar 
and the date of Easter. 

The use of continued fractions to calculate gear ratios is now well known. But the 
corresponding results were known to, and implemented by, the Greeks [11, 12]. 

Fowler invents illustrative conversations between a putative slave-boy of remark- 
able intelligence and prolonged youth, and each of Socrates, Eudoxus and Archytas, 
in which various aspects of anthyphairesis are developed, including the relationship 
with music theory. It is a pity that this has to be one of the most speculative areas as 
far as our knowledge is concerned. The possibility is explored of compounding v2: 
1 = [1,2,2,2,...] and v3: 1 = [1,1,2,1,2,1,2,...] to give V6: 1 = 
[2, 2,4, 2,4, 2,4,...], although even the slave-boy is unable to explain what is going 
on. Indeed, had he tried to add these ratios, the results would have been even more 
bewildering, and Fowler quotes Khinchin [6, p. 20] as Gosper had done before him: 


There is, however, another and yet more significant practical demand that 
the apparatus of continued fractions does not satisfy at all. Knowing the 
representations of several numbers, we would like to be able, with relative 
ease, to find the representations of the simpler functions of these numbers 
(especially, their sum and product). In brief, for an apparatus to be suitable 
from a practical standpoint, it must admit sufficiently simple rules for arith- 
metical operations; otherwise, it cannot serve as a tool for calculation. We 
know how convenient systematic fractions are in this respect. On the other 
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hand, for continued fractions there are no practically applicable rules for 
arithmetical operations; even the problem of finding the continued fraction for 
a sum from the continued fraction representing the addends is exceedingly 
complicated, and unworkable in computational practice. 


It is not as well known as it should be that Bill Gosper has in fact met this 


“significant practical demand.” Some indications of this are found in [1] and in [7] 
and [8], but not in the book under review. 


There are more misprints and inaccuracies than one expects from Clarendon 


Press. However, it 1s a scholarly and well-documented work. On the other hand it is 
highly idiosyncratic and many will find it over fanciful. At least the author warns us 


at 


the outset by quoting Neugebauer’s closing paragraph [9, p. 177]: 


In the “Cloisters” of the Metropolitan Museum in New York there hangs a 
magnificent tapestry which tells the tale of the Unicorn. At the end we see the 
miraculous animal captured, gracefully resigned to his fate, standing in an 
enclosure surrounded by a neat little fence. This picture may serve as a simile 
for what we have attempted here. We have artfully erected from small bits of 
evidence the fence inside which we hope to have enclosed what may appear as 
a possible, living creature. Reality, however, may be vastly different from the 
product of our imagination; perhaps it is vain to hope for anything more than 
a picture which is pleasing to the constructive mind when we try to restore the 
past. 
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Elementary, T(13). Primer for College Algebra. 
Karl J. Smith, Patrick J. Boyle. Brooks/Cole, 1990, 
xii + 563 pp. [ISBN: 0-534-12288-4] Excellent text 
“with the goal of preparing mature beginning stu- 
dents with the ...skills for college algebra.” Also 
appropriate for students needing review. Good ex- 
planations and pointers in worked examples. Has 
diagnostic tests prior to each chapter. Not enough 
graphs. From beginning concepts to conic sections, 
exponents and logs, sequences, series, and the bino- 
mial theorem. FR 


Precalculus, T(13: 1), S. Precalculus: Func- 
tions and Graphs, Sizth Edition. Earl W. Swokow- 
ski. PWS-Kent, 1990, xvi + 676 pp. ([ISBN: 0- 
534-92086-1] Changes from the Fifth Edition (TR, 
June-July 1987): solutions to several examples have 
been rewritten; more applied problems have been 
added; and exercises have been expanded. SP 
History, S*(15-16), P*, L*. The History of Mod- 
ern Mathematics. Eds: David E. Rowe, John Mc- 
Cleary. Academic Pr, 1989. Volume I: Ideas and 
Their Reception, xvi + 453 pp, $39.50 (ISBN: 0- 
12-599661-6]; Volume II: Institutions and Applica- 
tions, xvi + 325 pp, $37.50. [ISBN: 0-12-599662-4] 
Twenty-four essays on the mathematics of the 19th 
and 20th centuries. Volume I focuses on the emer- 
gence and reception of new ideas in pure mathemat- 
ics, e.g., three articles on the assimilation and ex- 
pansion of ideas of Gauss and Riemann. Volume IJ is 
concerned with ideas from mathematical physics and 
applied mathematics. Representative articles are the 
work of Liouville using Gauss’ intrinsic approach 
to surfaces to derive the Hamilton-Jacobi equations, 
and the revolutionary impact of high-speed comput- 
ers on numerical analysis through the work of von 
Neumann. Very interesting reading. MPR 


Logic, T**(14), L. Introduction to Proof in Ab- 
stract Mathematics. Andrew Wohlgemuth. Saunders 
College, 1990, xii + 366 pp. [ISBN: 0-03-026782-X] 
Very interesting text attempting to ease the tran- 
sition to abstract mathematics for the average stu- 
dent. Focuses on proof structures, techniques, and 
ideas. Concepts we assume the students just “pick 


up” along the way are clearly presented in simple 
contexts. Topic selection is excellent: sets and rules 
of inference; functions; relations; operations and in- 
tegers (including an introduction to groups); proofs 
in analysis; cardinality; and groups. Truth tables are 
mercifully relegated to an appendix. Plenty of exer- 
cises and examples. Highly recommended for transi- 
tion courses. MPR 


Graph Theory, P, L. Labyrinth Problems. Armin 
Hemmerling. Teubner-Texte zur Math., Band 
114. BG Teubner Leipzig, 1989, 215 pp, 27 DM (P). 
[ISBN: 3-322-00704-9] There are two classical prob- 
lems in labyrinth theory. The first is the search 
problem: to find an algorithm guaranteed to even- 
tually reach every point in the labyrinth. The sec- 
ond, clearly a special case of the first, is the escape 
problem. Interest in these problems is motivated by 
the fact that the possible configurations of a non- 
deterministic Turing machine can be modeled as a 
network. This book contains the main results, tech- 
niques, and open questions of labyrinth theory. SM 


Graph Theory, P. Algorithms On Graphs. H.T. 
Lau. TAB Books, 1989, vi + 231 pp, $29.95. [ISBN: 
0-8306-3429-0] Explicit Fortran programs for 20 
problems in graph theory, including connectivity, 
shortest paths, traversability, node coloring, and pla- 
narity testing. From the Preface: “The main objec- 
tive of this book is to provide computer programs 
that can be used with minimal effort for problem 
solving without much concern for their underlying 
methodology and implementation.” BC 


Graph Theory, P. Coloring Theories. Steve Fisk. 
Contemp. Math., V. 103. AMS, 1989, xv + 164 pp, 
$31 (P). [ISBN: 0-8218-5109-8] Global properties of 
colorings and maps of simplicial complexes. BC 


Combinatorics, T**(14-17: 1, 2), L**. Con- 
crete Mathematics: A Foundation for Computer Sci- 
ence. Ronald L. Graham, Donald E. Knuth, Oren 
Patashnik. Addison-Wesley, 1989, xiii + 625 pp. 
[ISBN: 0-201-14236-8] “What do Graham, Knuth, 
and Patashnik know that I don’t?” Here is a good 
partial answer in an incredibly well-written presen- 
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tation of the sort of material that lives in Knuth’s 
mathematical foundations in The Art of Computer 
Programming. This book represents nearly twenty 
years of classroom experience with material rooted in 
many fields: discrete mathematics, an antidote to the 
excesses of the “new math,” and the need for bring- 
ing together the spirit of mathematics which inspires 
those neat algorithms some computer scientists use. 
Presented with grace and (marginal) graffiti. Con- 
tains among other things chapters on recurrent prob- 
lems, sums, integer functions, special numbers (Stir- 
ling and the like), and asymptotics. Each chap- 
ter has many exercises (with answers) running from 
warmups through occasional research problems. JAS 


Combinatorics, T*(13-17: 1, 2), S, L. Intro- 
ductory Combinatorics, Second Edition. Kenneth P. 
Bogart. Harcourt Brace Jovanovich, 1990, xvii + 622 
pp, $38. (ISBN: 0-15-541576-X] In addition to a num- 
ber of improvements in clarity and style, there is new 
material throughout (e.g., more algorithmic content 
related to trees, matchings, block designs, coding, 
interval orders), and a new chapter on counting un- 
der the action of a group. Additional exercises and 
examples, and answers to odd numbered exercises. 
(First Edition, TR, May 1984.) LCL 


Discrete Mathematics, T(14-15: 1), L. Discrete 
Mathematics, Second Edition. Richard Johnson- 
baugh. Macmillan, 1990, xiv + 705 pp. [ISBN: 0-02- 
359690-2] Features extensive revision and expan- 
sion (from 349 pages to 582 pages) over the original 
edition (Extended Review, June-July 1987). Begins 
with new chapters on logic and proof, the language 
of mathematics, and algorithms; many changes and 
reorganization in later chapters as well as added ex- 
amples and exercises. Appendix, references, hints 
and solutions to exercises. JS 


Number Theory, P. Automorphic Forms, Shimura 
Varieties, and L-functions. Eds: Laurent Clozel, 
James S. Milne. Academic Pr, 1990, $49.50 each. 
Volume I, Perspectives in Math., V. 10, xii + 437 
pp [ISBN: 0-12-176651-9]; Volume II, Perspectives 
in Math., V. 11, xii + 391 pp. [ISBN: 0-12-176652-7] 
Expanded lectures from a July 1988 conference held 
at the University of Michigan. May be viewed as an 
update of the proceedings of the 1977 AMS Summer 
Research Institute at Corvallis. Articles tend to be 
advanced surveys rather than focused on one indi- 
vidual’s work. GG 


Number Theory, P*. Transcendental Numbers. 
Andrei Borisovich Shidlovskii. Stud. in Math., V. 
12. Transl: Neal Koblitz. Walter de Gruyter, 1989, 
xix + 466 pp, $89.50. (ISBN: 3-11-011568-9] Defini- 
tive exposition of the author’s extensive work on 
transcendental numbers. A major focus is the tran- 
scendental nature of values of E-functions, a partic- 
ular class of entire function. Includes a long bibliog- 
raphy and some supplementary remarks written for 
the English edition. A valuable reference book. SG 


Linear Algebra, T(13-14: 1, 2). Elementary Lin- 
ear Algebra with Applications, Second Edition. W. 
Keith Nicholson. PWS-Kent, 1990, xvi + 576 pp, 
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$34. [ISBN: 0-534-92189-2] Changes from the First 
Edition (TR, February 1987) include highlighted def- 
initions, new material on Euclidean n-space, some 
new exercises and examples, new sections on LU- 
factorization and complex matrices, and some rewrit- 
ing and reorganization. RH 


Algebra, T(15-17: 1, 2), L. Abstract Algebra, 
An Introduction. Thomas W. Hungerford. Saun- 
ders College, 1990, xxi + 575 pp, $49.25. [ISBN: 
0-03-030558-6] Departs markedly from the pattern 
of most recent texts. Part 1: The Core Course, begins 
with the ring of integers through modular arithmetic, 
introduces abstract rings, and moves on to poly- 
nomial rings and congruences, ideals, and quotient 
rings. Groups are introduced in Chapter 7. Part 2: 
Advanced Topics, treats field extensions, more group 
theory, and Galois theory including solvability by 
radicals. Part 3: Excursions and Applications, has a 
selection of applications such as geometric construc- 
tions and coding theory. Appendices, bibliography, 
answers and hints for exercises. JS 


Differential Equations, T(16-18), S, L. Elemen- 
tary Stability and Bifurcation Theory, Second Edi- 
tion. Gérard Iooss, Daniel D. Joseph. Undergrad. 
Textsin Math. Springer-Verlag, 1990, xxili + 324 pp, 
$49.95. [ISBN: 0-387-97068-1] A substantially re- 
vised edition with efforts towards simplification and 
clarification, especially of material found in the final 
two chapters of the earlier version. An additional 
chapter on stability and bifurcation in conservative 
systems is included. KS 


Differential Equations, P. Lectyre Notes in Con- 
trol and Information Sciences-126: Stochastic Differ- 
ential Systems. Eds: N. Christopeit, K. Helmes, 
M. Kohlmann. Springer-Verlag, 1989, ix + 342 
pp, $47 (P). [ISBN: 0-387-51299-3] Proceedings of 
the Fourth Bad Honnef Conference held in June 
1988. Twenty-nine papers with thematic emphasis 
on adaptive methods in stochastic systems analysis 
and on the theory of random fields. JO 


Partial Differential Equations, P. Nonlinear 
Wave Equations. Walter A. Strauss. CBMS Reg. 
Conf. Ser. in Math., No. 73. AMS, 1989, x + 91 pp, 
$21 (P). [ISBN: 0-8218-0725-0] A survey of and in- 
troduction to central aspects of nonlinear wave equa- 
tions in absence of shocks. Chapters include invari- 
ance, singularities, solutions of small amplitude, scat- 
tering, stability of solitary waves, Yang-Mills equa- 
tions, and Vlasov-Maxwell equations. SP 


Partial Differential Equations, T(15: 1), L. 
Introduction to Partial Differential Equations from 
Fourier Series to Boundary-value Problems. Arne 
Broman. Dover, 1989, vi + 183 pp, $5.95 (P). [ISBN: 
0-486-66158-X] An unabridged and corrected repro- 
duction of the original (1968 Addison-Wesley text, 
TR, October 1971). Contains a bibliography new to 
this edition. MLR 

Partial Differential Equations, P. Lecture Notes 
in Mathematics-1402: Nonlinear Hyperbolic Prob- 


lems. Eds: C. Carasso, et al. Springer-Verlag, 1989, 
249 pp, $24.30 (P). [ISBN: 0-387-51746-4] Proceed- 
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ings of a conference held in Bordeaux, France in June 
1988. Contains sixteen papers (thirteen in English, 
three in French) addressing a variety of nonlinear 
problems, both numerically and analytically, rang- 
ing from fluid flow to stability of the Minkowski met- 
ric. SP 


Partial Differential Equations, P. Variational 
Methods for Boundary Value Problems for Systems of 
Elliptic Equations. M.A. Lavrent’ev. Transl: J.R.M. 
Radok. Dover, 1989, 153 pp, $4.95 (P). [ISBN: 
0-486-66170-9] Translation of a 1963 monograph, 
with corrections and an index. Approaches classical 
boundary value problems via geometric properties of 
conformal and quasi-conformal mappings, using the 
basic scheme of Hilbert and Tonnelli. For mathe- 
maticians and theoreticians in mechanics. DFA 


Functional Analysis, T(16-17), S, L. Functional 
Analysis with Applications. B. Choudhary, Sudarsan 
Nanda. Wiley, 1989, xii + 344 pp, $34.95. [ISBN: 0- 
470-21564-X] Starts with introduction-level chapters 
on metric, normed and Hilbert spaces, Banach alge- 
bra and spectral theory, and operator theory. Con- 
cludes with chapters on fixed point theory, nonlinear 
operators, and sequence spaces. Intended as a text 
with exercises of varying difficulty closing each chap- 
ter. KS 


Functional Analysis, T(15-17: 2), P, L. Fourier 
Analysts. T.W. Korner. Cambridge U Pr, 1988, xii 
+ 591 pp, $34.50 (P). [ISBN: 0-521-38991-7] Paper- 
back reprint of a 1988 text (TR, December 1988) con- 
taining over 100 short interconnected essays which, 
taken together, lead the reader through the the- 
ory and applications of Fourier analysis. The short 
chapters (the longest is ten pages) makes working 
through this text seem less foreboding. No formal 
exercises. SM 


Analysis, P. I.J. Schoenberg, Selected Papers. Ed: 
Carl de Boor. Birkhauser Boston, 1988, $150 set 
(ISBN: 0-8176-3378-2]. Volume 1, xviii + 405 pp; 
Volume 2, xvi + 441 pp. This two-volume sam- 
pler contains about 45 of I.J. Schoenberg’s 173 pa- 
pers. Schoenberg’s publications span the period 
1928-1987; subjects range from splines to motion of 
billiard balls to placement of frets on guitars. Pa- 
pers in Volume 1 cover number theory, geometry, real 
analysis, and complex analysis. Volume 2 emphasizes 
aspects of spline theory, especially cardinal splines. 
Interspersed in both volumes are brief commentaries 
by experts, summarizing Schoenberg’s contributions 
in various areas. PZ 


Analysis, S(18), P. The Collected Works of Arne 
Beurling, Volume 2: Harmonic Analysis. Eds: L. 
Carleson, et al. Birkhauser Boston, 1989, xx + 389 
pp, $39. [ISBN: 0-8176-3412-6] The second of two 
volumes on complex analysis and harmonic analysis, 
respectively, of papers and seminar notes by Arne 
Beurling who did pioneering work in e.g., extremal 
length, inner functions, and spectral theory. The 
present volume contains 28 published papers, 1937- 
1970, together with hitherto unpublished seminar 
and lecture notes, the latter from a 1977-78 Mittag- 
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Leffler series on harmonic analysis. A brief Foreword 
by L. Ahlfors and L. Carleson assesses Beurling’s 
mathematical contributions. PZ 


Differential Geometry, P. Spin Geometry. H. 
Blaine Lawson, Jr., Marie-Louise Michelsohn. Math. 
Ser., V. 38. Princeton U Pr, 1989, xii + 427 pp, 
$55. [ISBN: 0-691-08542-0] Presents the fundamen- 
tal concepts of the geometry of spin manifolds (man- 
ifolds with a simply connected structure group) us- 
ing Clifford algebras and their representations as a 
principle theme. The main tools used included the 
various index theorems of Atiyah and Singer and the 
theory of Cé,-linear elliptic operators. The last chap- 
ter is devoted to applications. MPR 


Differential Geometry, T(18), S, P. Lectures on 
Minimal Surfaces, Volume 1: Introduction, Funda- 
mentals, Geometry and Basic Boundary Value Prob- 
lems. Johannes C.C. Nitsche. Cambridge Univer- 
sity Pr, 1989, xxv + 563 pp, $135. [ISBN: 0-521- 
24427-7] An English translation from the original 
German text of the first five chapters of Vorlesungen 
uber Minimalflachen supplemented with 66 pages of 
appendices and a collection of open problems. This 
broad survey of parametric minimal surfaces in Eu- 
clidean space is self-contained and includes consid- 
erable historical information. Bibliography contains 
nearly 1600 references. SP 


Geometry, P. Geometric Aspects of Banach Spaces. 
Eds: E. Martin-Peinador, A. Rodés. London Math. 
Soc. Lect. Note Ser., V. 140. Cambridge U Pr, 1989, 
194 pp, $24.95 (P). [ISBN: 0-521-36752-2] Contains 
15 essays in honor of Antonio Plans. MLR 


Geometry, T(16-17: 1, 2), L*. Metric Affine Ge- 
ometry. Ernst Snapper, Robert J. Troyer. Dover, 
1989, xx + 435 pp, $10.95 (P). [ISBN: 0-486-66108-3] 
Republication of the 1971 edition by Academic Press 
(TR, March 1972; Extended Review, March 1973). 
Based on Snapper course at First Cooperative Sum- 
mer Seminar at Cornell University in 1964. Read- 
ers should have had a course in linear algebra and 
must know some elementary group theory. Exercises 
complement and extend the text. On the virtues 
of linear algebra as source for the axiom systems 
of all affine and projective geometries, both metric 
and nonmetric, the last three lines of the book say it 
best: “If he (Euclid) had lived today, what would he 
have done? He would have told us to do away with 
Hilbert’s system and use linear algebra instead. And 
what a book he would have written!” JDEK 


Geometry, P, L”*. Teztual Studies in An- 
cient and Medieval Geometry. Wilbur Richard 
Knorr. Birkhauser Boston, 1989, xvil + 852 pp, 
$119. [ISBN: 0-8176-3387-1] Complements the au- 
thor’s The Ancient Tradition of Geometric Problems 
(Birkhauser Boston, 1986, TR, October 1987). Ex- 
position of the problems of cube duplication and an- 
gle trisection by Pappus and Eutocius. Treatment of 
Archimedes’ Dimension of the Circle by Pappus and 
Theon and parallel versions in Arabic, Hebrew, and 
Latin. Three main parts: I. Ancient Texts on Ge- 
ometric Problems; II. Arabic Geometric Texts and 
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Their Ancient Sources; and III. The Textual Tradi- 
tion of Archimedes’ Dimension of the Circle. Nu- 
merous notes and references. JDEK 


Geometry, T*(13: 1). Essentials of Geometry for 
College Students. Margaret L. Lial, Arnold R. Stef- 
fensen, L. Murphy Johnson. Scott Foresman, 1990, 
xiii + 431 pp, $28. (ISBN: 0-673-38419-5] Geome- 
try needed for further courses in college mathemat- 
ics. Requires only beginning algebra. Easy to read; 
pace is slow. “Applications.” Exercise sets, some of 
which are topped off with “old chestnuts”—a definite 
plus! Two-column proofs. Chapters on solid geom- 
etry, analytical geometry, and trigonometry. Nice 
collection of marginal notes and photographs. Good 
figures throughout. Test manual, solutions manual, 
transparencies are available for instructor; solution 
manual (odds plus) available for student. JDEK 


Geometry, S(13-18), P, L**. Recent Advances 
in Geometric Inequalities. D.S. Mitrinovi¢, J.E. 
Peéari¢, V. Volenec. Math. & Its Applic. Kluwer 
Academic, 1989, xix + 710 pp, $149. [ISBN: 90-277- 
2565-9] The “Bible of Bottema” (Geometric In- 
equalities, O. Bottema, et al., 1969; TR, February 
1970; Extended Review, June-July 1971) contains 
about 400 varied geometric inequalities related to fig- 
ures in the plane, and about 225 authors are cited. 
This monumental work updates and extends this be- 
ginning to several thousand inequalities, not only for 
figures in the plane but for figures in space and hyper- 
space as well, and over 750 names are cited. More- 
over, the work contains extensive references, various 
methods of proof, as well as conjectures and unsolved 
problems. A voluminous and encyclopedic reference 
that should be in every mathematics library. LCL 


Algebraic Topology, T(18: 1, 2), P. Lecture 
Notes in Mathematics-1408: Transformation Groups 
and Algebraic K-Theory. Wolfgang Luck. Springer- 
Verlag, 1989, xii + 443 pp, $42.70 (P). [ISBN: 0-387- 
51846-0] Self-contained introduction to the prob- 
lem of classifying group actions on manifolds and its 
connection to algebraic K-theory. Divided into three 
main chapters: geometrically defined invariants, al- 
gebraically defined invariants, and RI'-modules and 
geometry. Chapters 2 and 3 contain previously un- 
published results. Each section begins with a brief 
summary of what is to be covered and ends with 
both an exercise set (containing some results not dis- 
cussed in the section) and a comments section with 
references for further reading. SB 


Topology, P. Lecture Notes in Mathematics-1411: 
Topological Fized Point Theory and Applications. 
Ed: Boju Jiang. Springer-Verlag, 1989, vi + 203 
pp, $20 (P). (ISBN: 0-387-51932-7] Proceedings of 
a conference at the Nankai Institute of Mathematics, 
Tianjin, China held April 1988. Twenty short papers 
from various areas of fixed point theory. SP 


Dynamical Systems, P. Dynamical Systems IV: 
Symplectic Geometry and its Applications. Eds: V.I. 
Arnol’d, S.P. Novikov. Encycl. of Math. Sci., 
V. 4. Springer-Verlag, 1990, 283 pp, $59. [ISBN: 
0-387-17003-0] Contains three somewhat indepen- 
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dent sections translated from Russian. The first is 
a survey of the fundamental concepts of symplectic 
geometry. The latter two articles address applica- 
tions to the theory of integrable systems and quan- 
tigation. KS 


Dynamical Systems, P. The Connection Between 
Infinite Dimensional and Finite Dimensional Dy- 
namical Systems. Eds: Basil Nicolaenko, Ciprian 
Foias, Roger Temam. Contemp. Math., V. 99. AMS, 
1989, xii + 357 pp, $41 (P). [ISBN: 0-8218-5105- 
5] Eighteen papers on the prospect that infinite- 
dimensional dynamics may not be too much harder 
than their finite-dimensional cousins (which are al- 
ready hard enough). BC 


Dynamical Systems, S(16-17), L. Statistical 
Analysis and Control of Dynamic Systems. Hiro- 
tugu Akaike, Toichiro Nakagawa. Trans: Hirotugu 
Akaike, Maki Akaike Momma. Math. & Its Ap- 
plic. Kluwer Academic, 1988, xiii + 210 pp, $99. 
[ISBN: 90-277-2786-4] Nicely illustrates use of sta- 
tistical techniques to automating operation of ce- 
ment kilns. Carefully defines problem ane develops 
necessary statistical background. Includes Fortran 
listing and review of the TIMSAC package. Note 
price. RWJ 


Control Theory, S(18), P. Adaptive Control & 
Identification: Conflict or Confluz? J.W. Polder- 
man. CWI Tract, V. 67. Mathematisch Centrum, 
1989, 117 pp, Dfl. 17,80 (P). [ISBN: 90-6196-377-X] 
Concerned with problems of adaptive control. What 
distinguishes this theory from the classical control 
theory is that the laws governing the systems to be 
controlled are no longer completely known. (Reprint 
of a doctoral thesis.) KS 


Control Theory, T(15-17), L. Linear Systems 
and Optimal Control. C.K. Chui, G. Chen. Ser. in 
Inform. Sci., V. 18. Springer-Verlag, 1989, viii + 155 
pp, $59.50. (ISBN: 0-387-18737-5] Seeks to present 
a brief and reasonably complete study of linear sys- 
tems in both continuous time and discrete time set- 
tings. Optimal control of these systems is also cov- 
ered. The usual topics are covered, namely state 
space descriptions, controllability, observability, sta- 
bility and optimal control via variational techniques. 
Plenty of exercises after each chapter. SM 


Control Theory, P, L. Lecture Notes in Control 
and Information Sciences-136: Stochastic Systems 
and Optimization. Ed: J. Zabqzyk. Springer-Verlag, 
1989, vi + 373 pp, $59.50 (P). [ISBN: 0-387-51619-0] 
Contains the proceedings of the Sixth IFIP Working 
Conference on Stochastic Systems and Optimization 
which was held in Warsaw, Poland during September 
1988. The papers are organized into three categories: 
finite-dimensional systems, infinite-dimensional sys- 
tems, and control and estimation. SM 


Stochastic Processes, P. Optimization of Stochas- 
tic Systems: Topics in Discrete-Time Dynamics, 
Second Edition. Masanao Aoki. Economic The- 
ory, Econometrics, & Math. Econ. Academic Pr, 
1989, x + 417 pp, $64.95. (ISBN: 0-12-058851-X] 
A graduate-level introduction to sequential decision 
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analysis under uncertainty. This edition has been ex- 
tensively revised to make it more accessible to grad- 
uate students and professionals in the social sciences 
(especially economics). MLR 


Elementary Statistics, T(13-14: 1, 2). Intro- 
duction to Statistics. Anthony A. Salvia. Saunders 
College, 1990, xv + 603 pp, $45.25. [ISBN: 0-03- 
012634-7] Covers descriptive statistics, basic prob- 
ability models, estimation, hypothesis testing (in- 
cluding ANOVA and chi-squared tests), correlation, 
and regression. Chapters generally contain a section 
devoted to nonparametric procedures and indicate 
when these may be preferred to the corresponding 
parametric procedures. Some computer output in- 
cluding MINITAB. Requires only high school alge- 
bra. RWJ 


Statistics, S(18), P. Bayesian Analysis of Time 
Series and Dynamic Models. Ed: James C. Spall. 
Statistics: Textbooks & Monographs, V. 94. Marcel 
Dekker, 1988, xxvii + 536 pp, $99.75. [ISBN: 0-8247- 
7936-3] This book presents Bayesian techniques for 
analyzing dynamic systems, systems which change 
over time. Each chapter is written by a different 
author or authors who are experts in the fields of 
statistics, control systems, engineering, or economet- 
tics. The models presented can be broken up into two 
general types: traditional ARMA time series models, 
and linear and nonlinear state-space models. The 
book begins with an historical overview of the field. 
Each chapter contains references to other work in the 
field. JJ 


Programming, P. Ada Quality and Style: Guide- 
lines for Professional Programmers. The Software 
Productivity Consortium. Van Nostrand Reinhold, 
1989, ix + 230 pp, $26.95 (P). [ISBN: 0-442-23805-3] 
A well-focused technical essay on such things as in- 
dentation, notational conventions, and programming 
style with the aim of promoting readable, reliable, 
and portable Ada programs. JAS 


Programming, P, L. Weaving a Program: Lit- 
erate Programming in WEB. Wayne Sewell. Van 
Nostrand Reinhold, 1989, xx + 556 pp, $34.95 (P). 
[ISBN: 0-442-31946-0] A comprehensive overview of 
the WEB programming environment created by Don- 
ald Knuth as a tool for literate programming. A 
WEB source file is a mixture of program code frag- 
ments (usually in Pascal) and explanatory text (usu- 
ally in TeX ) that is processed by a program called 
“weave” to produce a TpX input file which will yield 
neatly printed human-readable listings of the WEB 
source file to serve as documentation for the program, 
which is itself compiled from Pascal source code (not 
human-readable) that is created from the same WEB 
source file by the program “tangle.” Discusses is- 
sues of implementation on various machines and of 
adaptations to different languages. Includes source 
listings for “weave” and “tangle,” and many useful 
examples, tables, and details. LAS 


Languages, P. SGML: An Author’s Guide to the 


Standard Generalized Markup Language. Martin 
Bryan. Addison-Wesley, 1988, xvii + 364 pp, (P). 
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[ISBN: 0-201-17535-5] As advertized, a description 
and explanation of this ISO standard for a computer 
description of documents for the publishing indus- 
try as adopted by the U.S. Department of Defense 
and European Community Office of Official Publica- 
tions. JAS 

Computer Systems, T(14-15: 1), L*. Computa- 
tion Structures. Stephen A. Ward, Robert H. Hal- 
stead, Jr. MIT Pr, 1990, xx + 789 pp, $49.95. [ISBN: 
0-262-23139-5] This textbook was developed for use 
in the sophomore “core” course taken by all com- 
puter science and electrical engineering undergradu- 
ates at M.I.T. Covers the design and implementation 
of computers from the level of logic gates to machine 
language. A very interesting book. AO 

Computer Graphics, T(17-18: 1, 2), P. The 
Mathematical Structure of Raster Graphics. Eu- 
gene L. Fiume. Academic Pr, 1989, xvi + 221 pp, 
$29.95. [ISBN: 0-12-257960-7] A step, possibly a 
big one, towards a formal mathematical system for 
raster graphics. Requires a knowledge of mathemat- 
ics through the beginning graduate level in algebra 
and analysis as well as upper-level experience with 
graphics including illumination. JAS 


Artificial Intelligence, P. Applications of Ezpert 
Systems. Ed: J. Ross Quinlan. Addison-Wesley, 
1987, xvi + 223 pp. [ISBN: 0-201-17449-9] Papers 
from the Second Australian Conference on Applica- 
tions of Expert Systems. The focus is on the move 
from the experimental expert system to the nature 
and role of the commercial tool. JAS 


Artificial Intelligence, P. Case-Based Planning: 
Viewing Planning as a Memory Task. Kristian J. 
Hammond. Perspect. in Artif. Intellig., V. 1. Aca- 
demic Pr, 1989, xxiv + 277 pp, $34.95. [ISBN: 0-12- 
322060-2] Case-based planning is based on the idea 
of modifying an old plan rather than building a new 
one from first principles. Memory of past successes 
is used to find the best plan to modify for current 
goals, and problems and new successes are saved and 
indexed so they can be used to satisfy similar goals 
and avoid similar problems. Theory is implemented 
in the program CHEF, whose domain is Szechuan 
cooking. RBK 


Computer Science, P. Formal Development of 
Programs and Proofs. Ed: Edsger W. Dijkstra. 
Addison-Wesley, 1990, xiv + 241 pp. [ISBN: 0-201- 
17237-2] Papers from, inspired by, and related to 
the sixth (of six) Programming Institute which was 
part of the year of Programming at the University 
of Texas at Austin. The focus is on those aspects 
of programming which appear to be mathematically 
treatable. JAS 


Computer Science, P. Lecture Notes in Com- 
puter Sctence-393: Categorical Methods in Computer 
Science With Aspects from Topology. Eds: H. 
Ehrig, et al. Springer-Verlag, 1989, vi + 350 pp, 
$28.10 (P). [ISBN: 0-387-51722-7] Proceedings of 
a September 1988 international workshop held in 
(West) Berlin. JAS 


Applications, S(18), P. Mathematical Psychology 


1990] 


in Progress. Ed: Edward E. Roskam. Recent Res. 
in Psych. Springer-Verlag, 1989, viii + 385 pp, $53 
(P). [ISBN: 0-387-51686-7] Third volume of papers 
originating from recent annual meetings of the Euro- 
pean Mathematical Psychology Group. Papers par- 
titioned into three groups: theory, metatheory, and 
measurement; choice, perception, cognition, and per- 
formance; psychometrics and theory of data. Papers 
represent “work in progress” and are high in mathe- 
matical content. WE 


Applications, P, L. Mathematics in Industrial 
Problems, Part 2. Avner Friedman. IMA, V. 24. 
Springer-Verlag, 1989, xii + 183 pp, $24. [ISBN: 0- 
387-97139-4] A collection of eighteen mathematical 
problems motivated by specific “grass roots” situa- 
tions encountered in industry. A very wide range of 
problems are encountered, the common thread be- 
ing the application of mathematical method to real 
problems as opposed to the imagined problems of- 
ten encountered in textbooks. Each of the eighteen 
chapters is self-contained. The collection ends with 
brief discussions of solutions to problems raised in 
the first volume of this work. SM 


Applications, P. Lecture Notes in Statistics-61: 
The Analysis of Directional Time Series: Applica- 
tions to Wind Speed and Direction. Jens Breckling. 
Springer-Verlag, 1989, viii + 238 pp, $23.80 (P). 
[ISBN: 0-387-97182-3] The first half of the book 
concentrates on the empirical aspects of directional 
time series by performing a comprehensive analysis 
of hourly records of wind speed and direction at the 
port of Fremantle, Australia. The wind records can 
be explained by a geostrophic component and daily 
component, both related to the season and weather. 
The second half of the book looks at the problem in 
general from a theoretical standpoint using two types 
of models: the von Mises and wrapped (normal) au- 
toregressive processes. The monograph is based on 
the author’s Ph.D. thesis. JJ 


Applications, P. Theory and Practice of Geomet- 
ric Modeling. Eds: Wolfgang Strafer, Hans-Peter 
Seidel. Springer-Verlag, 1989, x + 547 pp, $89.50. 
[ISBN: 0-387-51472-4] Expanded lectures from a 
conference held in Blaubeuren, West Germany in Oc- 
tober 1988. Emphasis on bridging the gap “between 
theoretical results, performance of existing [Com- 
puter Aided Design] systems and the real problems of 
users” leads to subjects ranging from abstract differ- 
ential geometry and analysis of algorithms to descrip- 
tions of new software. Some nice four color graph- 
ics. JO 


Applications (Biological Science), P. Lecture 
Notes in Biomathematics-80: Depth Perception in 
Frogs and Toads: A Study in Neural Computing. 
Donald House. Springer-Verlag, 1989, vii + 135 
pp, $16.20 (P). [ISBN: 0-387-97157-2] Models for 
monocular and binocular cooperation in neural com- 
puting algorithms for navigation and prey localiza- 
tion, with implications for animal experiments and 
robotics. BC 
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Applications (Biological Science), P, L. Lec- 
ture Notes in Statistics-55: Estimation and Analy- 
sts of Insect Populations. Eds: L. McDonald, et 
al. Springer-Verlag, 1989, xiv + 492 pp, $54 (P). 
[ISBN: 0-387-96998-5] Consists of thirty-five papers 
presented at the symposium/workshop on The Es- 
timation and Analysis of Insect Population held in 
January 1988 in Laramie, Wyoming. The papers 
evaluate current methodological and present promis- 
ing new approaches. Four subject areas are repre- 
sented: analysis of stage-frequency data, modeling 
of population dynamics, analysis of spatial data, and 
general sampling and estimation methods. SM 


Applications (Fluid Dynamics), P. Annual Re- 
view of Fluid Mechanics, Volume 22, 1990. Eds: 
John L. Lumley, Milton van Dyke, Helen L. Reed. 
Annual Reviews, 1990, x + 592 pp, $38. [ISBN: 0- 
8243-0722-4] Essay on the history of the Reynolds 
number, followed by seventeen recent papers on top- 
ics including the design of yachts, human-powered 
aircraft, and offshore structures. JO 


Applications (Information Theory), P. Lecture 
Notes in Control and Information Sctences-129: Ad- 
vances in Communications and Signal Processing. 
Eds: W.A. Porter, S.C. Kak. Springer-Verlag, 1989, 
vi + 376 pp, $47 (P). [ISBN: 0-387-51424-4] Pro- 
ceedings of an international conference held in Ba- 
ton Rouge, Louisiana during October 1988. Contains 
thirty-four selected papers written by researchers in 
various fields. Other papers appear in Advances in 
Computing and Control(TR, February 1990), also in 
the Springer-Verlag LE NCIS series. SP 
Applications (Social Science), S(16), C, P. 
Simulating K=3 Christaller Central Place Struc- 
tures: An Algorithm Using a Constant Elasticity of 
Substitution Consumption Function. Daniel A. Grif- 
fith. Mono., No. 10. Institute of Mathematical 
Geography (2790 Briarcliff, Ann Arbor, MI 48105), 
1989, v + 103 pp, $15.95 (P). This monograph 
presents a description of and the source code for a 
program that simulates K=3 Christaller central place 
structures using a constant elasticity of substitution 
consumption function. It includes a discussion of 
the salient geographic theory and mathematical un- 
derpinnings. Also includes a list of references. The 
source code is in Basic for PC compatibles. CEC 


Reviewers 


DFA: David F. Appleyard, Carleton; SB: Steve Benson, 
St. Olaf; BC: Barry Cipra, St. Olaf; CEC: Clifton E. 
Corzatt, St. Olaf; WE: William Etter, Macalester; SG: Steven 
Galovich, Carleton; GG: George Gilbert, St. Olaf; RH: 
Rhonda Hatcher, St. Olaf; RWJ: Roger W. Johnson, Car- 
leton; JJ: Jason Jones, St. Olaf; RBK: Roger B. Kirchner, 
Carleton; JDEK: Joseph D.E. Konhauser, Macalester; LCL: 
Loren C. Larson, St. Olaf; SM: Steve McKelvey, St. Olaf; 
JO: Jeff Ondich, St. Olaf; AO: Arnold Ostebee, St. Olaf; 
SP: Samuel Patterson, Carleton; MLR: Margaret L. Reese, 
St. Olaf; MPR: Matthew P. Richey, St. Olaf; FR: Flauren 
Ricketts, Macalester; KS: Karen Saxe, St. Olaf; JS: John 
Schue, Macalester; JAS: J. Arthur Seebach, Jr., St. Olaf; LAS: 
Lynn Arthur Steen, St. Olaf; MU: Milton Ulmer, Carleton; 
PZ: Paul Zorn, St. Olaf. 
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Carus Mathematical Monograph No. 20 


THE GENERALIZED 
RIEMANN INTEGRAL 


Robert M. McLeod 
275 + xill pages. Hardbound. 


List: $24.95 MAA Member: $22.00 


Until recently the most powerful and beautiful tools of integration theory have been 
accessible only to the privileged few whose studies extended through the Lebesgue 
integral. Now a new integral, a generalization of the familiar Riemann integral, has been 
discovered which has all the power and range of the Lebesgue integral but which can 
be readily understood by anyone who has studied calculus through the sophomore level. 

The Generalized Riemann Integral is the first book-length presentation of this 
exciting new development in integration theory. Because of the clarity and organiza- 
tion of the exposition and the inclusion of exercises designed to actively engage the 
reader in the material, it is eminently suitable for use as a textbook at the advanced 
undergraduate or beginning graduate level. Furthermore, because it presents within 
a single volume results which were previously scattered throughout many research 
publications, it will undoubtedly also be of considerable interest to specialists in inte- 
gration theory. 


List of Contents: Definition of the Generalized Riemann Integral ¢ Basic Properties of the Inte- 
gral e Absolute Integrability and Convergence Theorems « Integration of Subsets of Intervals ¢ 
Measurable Functions ¢ Multiple and Iterated Integrals © Integrals of Stieltjes Type «© Compari- 
son of Integrals « Appendix: Solutions of In-Text Exercises 


Order From: 

Mathematical Association of America 
1529 Eighteenth Street NW 
Washington DC 20036 
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. Complex From our mail... 
4 Variables "Your f(z) software is one of the most inter- 


esting mathematical tools | have come 


i It's a whole new mathematical 
Complex plane or Riemann sphere. = 7°/°SS-- 
P P P world,” S.S.M. 


See how functions combine - apply 
composition, integration, differen- 
tiation - in any combination. 


"Excellent, beyond my imaginings...many 
thanks once again for reopening my eyes to 
the many beauties of mathematics. . . | had 
never dreamed it would be so readily and 
easily available." D. H. - Australia 


f(z) can handle poles and knows 
how to pick a continuous branch of 
the logarithm. "f(z) is spectacular..| only wish | had had this 
when | was back in college!" W. - Ravoh 
Generate animation as t paramter mes rrayenem 


varies. "Great program, terrific graphics, never 


seen anything quite like it before." S. P. 
Watch the actual complex " graph” as 


it rotates in four dimensions! 


Includes f(z)-Mandelbrodt For PC or Macintosh 


Lascaux Graphics 3220 Steuben Ave. Bronx, NY 10467 (212) 654-7429 


TOPOLOGY OF 4-MANIFOLDS 


Michael Freedman and Frank Quinn 


One of the great achievements of contemporary mathematics is the new understanding of four 
dimensions. Michael Freedman and Frank Quinn have been the principals in the geometric and 
topological development of this subject, proving the Poincaré and Annulus conjectures respectively. 
In Topology of 4-Manifolds these authors have collaborated to give a complete and accessible account 
of the current state of knowledge in this field. 
Princeton Mathematical Series, 39. 79 illustrations. 
Cloth: $49.50 ISBN 0-691-08577-3 


SPIN GEOMETRY 


H. Blaine Lawson, Jr., and Marie-Louise Michelsohn 


This book offers a systematic and comprehensive presentation of the concepts of a spin manifold, 
spinor fields, Dirac operators, and -genera, which, over the last two decades, have come to play a 
significant role in many areas of modern mathematics. Since the deeper applications of these ideas 
require various general forms of the Atiyah-Singer Index Theorem, the theorems and their proofs, 
together with all prerequisite material, are examined here in detail. 
Princeton Mathematical Series, 38 
Cloth: $55.00 ISBN 0-691-08542-0 


Princeton University Press 


mj} 41 WILLIAM ST. e PRINCETON, NJ 08540 ¢ (609) 258-4900 
ORDERS 800-PRS-ISBN (777-4726) 


FIRST CONCEPTS OF TOPOLOGY: 
The Geometry of Mappings of Seg- 
ments, Curves, Circles, and Disks 


W.G. Chinn and N.E. Steenrod 


This clear and winning little book, for 
readers willing to come to genuine 
grips with the idea of a mathematical 
proof, presents topology . . . as math- 
ematicians see it. The parlor tricks are 
gracefully alluded to here and there, 
but they are distinctly for after hours. 
The center of interest is the stuff itself: 
powerful notions of set theory are. . . 
exploited to define open sets and their 
coverings, and from them [to prove] 
the key theorems. .. One cannot any 
longer doubt that a single stroke of a 
knife exists that divides any irregular 
ham sandwich so that ham and both 
bread slices can be shared with perfect 
fairness by two consumers. 


Philip Morrison in Scfentific American 
Contains over one hundred and fifty prob- 
lems and solutions. 

160 pp., 1966, ISBN 0-88385-61 8-2 

List: $10.00 MAA Member: $8.00 

Catalog Number NML-18 


FROM PYTHAGORAS TO EINSTEIN 
K.O. Friedrichs 


Starting with area and dissection proofs of 
the Pythagorean theorem, Friedrichs pro- 
ceeds gently, step by step, to cover vectors, 
coordinates, elastic and inelastic impacts, 
and relativistic space-time—ending with a 
derivation of a contemporary formula rival- 
ing the Pythagorean theorem in fame, # = 


mc’. 


88 pp., 1965, ISBN 0-88385-616-6 
List: $9.50 MAA Member: $7.60 
Catalog Number NML-16 


GEOMETRY REVISITED 
H.S.M. Coxeter and S.L. Greitzer 


After seeing a proof that a segment has 
a unique midpoint, students may wonder 
whether there are any interesting theorems 
in geometry. If either they or their teachers 
get a hold of this book, they will learn that 
there are many beautiful and nontrivial theo- 
rems in geometry. Among those found here 
are the theorems of Ceva, Menelaus, Pap- 
pus, Desargues, Pascal, and Brianchon. A 
nice proof is given of Morley’s remarkable 
theorem on angle trisectors. The transfor- 
mational point of view is emphasized: re- 
flections, rotations, translations, similarities, 
inversions, and affine and projective trans- 
formations. Many fascinating properties of 
circles, triangles, quadrilaterals, and conics 
are developed. 


193 pp., 1967, ISBN 0-88385-619-0 
List: $11.75 MAA Member: $9.50 
Catalog Number NML-19 


The very lucid presentation takes the 
reader from the elementary problems 
of plane Euclidean geometry to the fun- 
damental concepts of non-Euclidean 
geometry . The book is rich in 
remarkable facts and thereby is very 
effective in promoting the significance 
and the value of geometry in mathe- 
matical teaching, a promotion which is 
very necessary. . . The always origi- 
nal developments use very simple tools 

. and soon proceed to higher con- 
figurations. 


K. Strubecker in Mathematical Reviews 
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In as few as 20 minutes! 
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TESTS: 
NU 


COLLEGE-LEVEL MATHEMATICS 


This fall you can place your entry-level students in advanced mathematics courses in 
as few as 20 minutes per student! There’s no quicker, easier, or more precise method 
to find out what they’ve mastered and whether they’re ready for intermediate 
algebra, college algebra, pre-calculus, or first-level calculus. 


Now—preview this exciting breakthrough in the College Board’s 
Computerized Placement Tests! 


Here is your opportunity to preview a set of examination disks—and experience a 
powerful innovation in computer adaptive testing called Seamless Serial Testing. 

It tests students across a wide range of concepts with fewer questions than conven- 
tional tests, and provides more accurate measurement than conventional tests. Based 
on their answers, individual students are routed automatically to more advanced or 
more basic material. You can administer this new test module independently or 
combine it with lower-level CPTs in Arithmetic and Elementary Algebra. Operates 
on IBM-PC, XT, AT, PS/2 and compatibles, including portables and laptops. 


Order your set of examination disks today! 


See for yourself how College-Level Mathematics and the other four CPTs in Read- 
ing, Writing, Arithmetic, and Elementary Algebra can handle all your institution’s 
placement needs in a fraction of the time it now takes. Your examination set is just 
$25 (credited toward your first order). You’ll also receive a FREE diskette 
demonstrating Accuplacer, the College Board’s exclusive computerized assessment, 
placement, and management software. Please specify 5 1/4" or 3 1/2" disks. 


m4 The College Board 
| Attn: L.J. Abernathy 
45 Columbus Avenue 
New York, New York 10023-6992 


Copyright © 1990 by the College Entrance Examination Board. College Board and the acom logo are registered trademarks of the College 
Entrance Examination Board. Accuplacer, College Board Computerized Placement Test, and Seamless Serial Testing are trademarks owned 
by the College Entrance Examination Board. 


The International Mathematical Olympiads, 
1959-1977 and 1978-1985 provide a com- 
pilation of problems and solutions of arrest- 
ing ingenuity; all accessible to secondary 
school students. The alternative solutions 
are particularly interesting because they 
show that there are many ways to solve a 
problem. 


INTERNATIONAL MATHEMATICAL 
OLYMPIADS, 1959-1977 


Compilation and solutions by S.L. Greitzer 


116 problems 

204 pp., 1978, ISBN 0-88385-627-1 
List: $11.50 MAA Member: $9.50 
Catalog Number NML-27 


INTERNATIONAL 
MATHEMATICAL 
OLYMPIADS 1959.16 
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INTERNATIONAL MATHEMATICAL OLYMPIADS 


INTERNATIONAL MATHEMATICAL 
OLYMPIADS , AND FORTY 
SUPPLEMENTARY PROBLEMS, 
1978-1985 


Compilation, solutions, and 40 additional 
problems by Murray S. Klamkin 


88 problems in all 

150 pp., 1986, ISBN 0-88385-631 -X 
List: $12.95 MAA Member: $10.50 
Catalog Number NML-31 


This sequel of problems used in the In- 
ternational Mathematical Olympiad cer- 
tainly matches the description ‘meant 
to challenge.’ The problems contained 
in this publication are the most chal- 
lenging | have seen. For the seri- 
ous problem solver, this is an excellent 
collection of truly challenging problems 
designed to give instruction in the use 
of sophisticated methods and modes of 
attack. 


Richard J. Wylie in The Mathematics Teacher 


Both international Olympiad books avail- 
able as a package: 


List: $21.00 MAA Member: $17.50 
Catalog Number IMO 
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GARDNER’S WHYS AND WHEREFORES 


Martin Gardner 


Cloth $19.95 272 pages 


Now in Paper— 


lvar Ekeland 
With a Foreword by Felix Browder 


Paper $8.95 160 pages 


John Allen Paulos 


Joseph Williams, New York Times 
Paper $10.95 124 pages 


Math & More 


“Highly engaging and entertaining. . 


The well-known wit and encyclopedic knowledge of Martin Gardner are on display as he 
tackles topics from the “anthropic principle” to the idiosyncrasies of Allan Bloom, critiques 
fantasy writers from Lord Dunsany to Ray Bradbury, and questions whether physicists 
are really on the verge of discovering Everything. 


MATHEMATICS AND THE UNEXPECTED 


. . Ekeland uses celestial mechanics, Poincare 
maps for the Henon system, the cat map, the Lorenz model, and Sitnikov’s three-body 
problem all to good effect.’-—Ronald F. Fox, American Journal of Physics 


5 halftones, 39 line drawings 


MATHEMATICS AND HUMOR 


“[An] illuminating account of the relationship between humor and mathematics.’— 


6 line drawings, 29 figures 


University of Chicago Press 


5801 South Ellis Avenue Chicago, IL 60637 


Math T, exts 


ESSENTIAL RESULTS OF 
FUNCTIONAL ANALYSIS 
Robert J. Zimmer 


This slim book concisely presents fundamental 
ideas of functional analysis, providing essential 
results from the many domains of mathematics 
and mathematical physics to which functional 
analysis makes contributions. Zimmer sum- 
marizes measure theory and the elementary 
theory of Banach and Hilbert spaces, then discus- 
ses topological vector spaces, seminorms defin- 
ing them, and natural classes of linear operators. 
He then presents basic results for a wide range of 
topics: convexity and fixed point theorems, com- 
pact operators, compact groups and their repre- 
sentations, spectral theory of bounded operators, 
ergodic theory, commutative C*- algebras, Fourier 
transforms, Sobolev embedding theorems, dis- 
tributions, and elliptic differential operators. 

Paper $14.95 (est.) 168 pages (est.) 

Library cloth edition $34.95 (est.) 

Chicago Lectures in Mathematics series 


Now in Paper 


THE THEORY OF LINEAR 
ECONOMIC MODELS 
David Gale 


This outstanding introduction to mathematical 

economics is now available in a paperback edition. 
“An excellent introduction to linear program- 

ming, Neumann models, and matrix games, as 

well as to the theory of linear inequalities which is 

fundamental for these subjects ."— 

Lionel W. McKenzie, University of Rochester 


Paper $14.95 356 pages 


University of Chicago Press 
5801 South Ellis Avenue Chicago, IL 60637 


MEGA 
Random Walks and 
Electric Networks, 


by J. Laurie Snell and Peter Doyle 


xiii + 159 pages. Hardbound 
List: $25.00 MAA Member: $19.00 


In this newest addition to the Carus Mathematical Monographs, the authors 
examine the relationship between elementary electric network theory and ran- 
dom walks, at a level which can be appreciated by the able college student. We 
are indebted to them for presenting this interplay between probability theory 
and physics in so readable and concise a fashion. 

Central to the book is Polya’s beautiful theorem that a random walker on an 
infinite street network in d-dimensional space is bound to return to the starting 
point when d= 2, but has a positive probability of escaping to infinity without 
returning to the starting point when d= 3. The authors interpret this theorem as 
a statement about electric networks, and then prove the theorem using tech- 
niques from classical electrical theory. The techniques referred to go back to 
Lord Rayleigh who introduced them in connection with an investigation of 
musical instruments. 

In Part I the authors restrict themselves to the study of random walks on finite 
networks, establishing the connection between the electrical concepts of cur- 
rent and voltage and corresponding descriptive quantities of random walks re- 
garded as finite state Markov chains. Part II deals with the idea of random walks 
on infinite networks. 


Table of Contents 


Part I: Random Walks on Finite Networks 
Random Walks in One Dimension 
Random Walks in Two Dimensions 


Random Walks on More General Networks 
Rayleigh’s Monotonicity Law 


Part II: Random Walks on Infinite Networks 
Pélya’s Recurrence Problem 
Rayleigh’s Short-Cut Method 
The Classical Proofs of Polya’s Theorem 
Random Walks on More General Infinite Networks 


Order From: 

The Mathematical Association of America 
1529 Eighteenth Street, N.W. 

Washington, D.C, 20036 
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ATHEMATICS 


Queen&Sernant 
of Science 


ERIC TEVIPLE BELL 


MATHEMATICS: 
QUEEN AND 
SERVANT OF SCIENCE 


E.T. Bell 


An absorbing account of pure and applied 
mathematics from the geometry of Euclid to 
that of Riemann and its application in Ein- 
stein’s theory of relativity. The twenty chap- 
ters treat such topics as: algebra, number 
theory, logic, probability, infinite sets and 
the foundations of mathematics, rings, ma- 
trices, transformations, groups, geometry, 
and topology. Republished in 1987 with cor- 
rections and an added Foreward by Martin 
Gardner. 


454 pp., ISBN 0-88385-446-3 
Paperbound 


List: $15.95 MAA Member: $11.95 
Catalog Number QAS 


The book deserves a place in today’s 
market. It is amuch more popular work 
than most histories of the subject, and 
that is exactly what makes it acces- 
sible to students as well as to non- 
mathematicians. It is rewarding read- 
ing for. . . teachers and students at 
all mathematical levels. 


Morris Kline of The Courant Institute 


Order from: 


| WANT TO BE A MATHEMATICIAN 
An Automathography in Three Parts 


Paul R. Halmos 


This is a book to be read with interest 
by all those who know, or might want to 
know, what mathematicians and mathemat- 
ical careers are like. Paul R. Halmos be- 
gins with his school days and carries the 
reader swiftly through a career that has sus- 
tained itself at a high level since his first 
post-doctoral days at the Institute for Ad- 
vanced Study in 1939, where he worked 
with John von Neumann among others. Still 
going strong in 1988, Halmos has con- 
tributed much to logic, operator theory, er- 
godic theory, and the literature in general. 


442 pp., 1988, Paper, ISBN 0-88385-445-7 
List: $18.00 MAA Member: $15.00 


i | Catalog Number IWM 


It is a truly unique book, which nobody 
but Paul Halmos could have written. 
| think it will be a classic. 


Constance Reid 
The book is exciting, witty, and well 
worth the time invested in its study. It 


communicates what it means to be a 
mathematician. 


John Dossey in The Mathematics Teacher 


The Mathematical Association of America 


<< y, 2 1529 Eighteenth Street, N.W. 


2 s Washington, D.C. 20036 
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Studies in the History of Mathematics, Esther R. Phillips, Editor. 


Volume #26 in MAA Studies in Mathematics 
320 pp., Hardbound, ISBN-0-88385-128-8. 
Catalog Number - MAS-26 

List: $36.50 MAA Member: $28.00 


Esther R. Phillips has brought together 
a collection of articles showing the 
sweep of recent scholarship in the his- 
tory of mathematics. The material cov- 
ers a wide range of current research 
topics: algebraic number theory, geom- 
etry, topology, logic, the relationship 
between mathematics and computing, 
partial differential equations, and alge- 
braic geometry. This volume will show 
teachers and their students the histori- 
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The Geometry of Harmonic Oscillators* 


KENNETH R. MEyER, University of Cincinnati 


KENNETH R. MEveEr: I received my Ph.D. from the University of Cincinnati 
in 1964, was an Assistant Professor at Brown University, was an Associate 
Professor at the University of Minnesota and am Professor at the University 
of Cincinnati. I learned the geometry of harmonic oscillators from the late 
Charlie Conley. 


I. Introduction. About a year ago an engineering colleague dropped by my 
office to ask some questions about the analysis of a system of differential equa- 
tions. After hearing the problem, I said he needed to know about the geometry of 
pairs of harmonic oscillators. He looked a little hurt at my suggestion, but 
consented to indulge me for a while. After I lectured at the chalk board for about 
an hour, he admitted that before my lecture he had thought he understood the 
harmonic oscillator, but now it was clear that he hadn’t. 

Although an introduction to the harmonic oscillator comes early in the study of 
the calculus and it is extensively used throughout physics, the beautiful geometry of 
harmonic oscillators is seldom given in differential equations texts of any level. 
Even some researchers in oscillation theory are unaware of the interrelationship 
between the Hopf fibration, the ergodic flow on a torus, the rational approxima- 
tions of irrational numbers, the topology of the 3-sphere and the lowly harmonic 
oscillator. The material presented below has been known since the time of 
Poincaré and Birkhoff and so nothing given here is new. It is essential in Conley’s 
analysis of periodic solutions of the restricted three body problem [4], Weinstein’s 
normal mode theorem [11] and the generalizations of these theorems found 
in [5, 7, 8]. 

First I will present the geometry of a single harmonic oscillator from a 
dynamical systems point of view with particular emphasis on the use and interpre- 
tation of polar coordinates. Then a model for the 3-sphere as an identification 
space is obtained by using polar coordinates for a pair of harmonic oscillators. 
Many basic geometric facts can be gleaned from this simple model of the 3-sphere. 
The Hopf fibration, the knotting and linking of solutions, and the irrational flow 
on the torus are a few of the topics presented. The classical regularization of the 
Kepler problem (the central force problem) is given as an example of a non-trivial 
physical problem which can be reduced to a pair of harmonic oscillators. Thus the 
geometry discussed here sheds some light on the classical two body problem. 


Il. One Harmonic Oscillator. From the point of view of dynamical systems 
theory, a differential equation defines a flow on a space and the solutions are 
curves in that space. Historically, the problems come from classical dynamics and 
so the independent variable is ‘time’ and will be denoted by t. Newton’s notation 
for fluxions persists with 


. a . a? 
= — and ~ (1) 


This work was supported by a grant from ACMP/DARPA administered by NBS. 
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Thus, the harmonic oscillator, 
¥+w*x =0, (2) 


where w is a positive constant, is written as a system of first order equations by 
introducing the new variable u = x/w, to become 


xX =wu 
u = —ox. (3) 


The variable u is a scaled velocity and so the x,u plane is essentially the 
position-velocity plane, or the phase space of physics. The basic existence and 
uniqueness theorem of differential equations asserts that through each point 
(X,,U,) in the plane there is a unique solution which passes through this point at 
any particular epoch ft). These solutions are given by the formula 


X(t, to, Xo, Ug) cos@(t—ty)) sinw(t — ty) \[ xo (4) 
The solution curves are parameterized circles. The reason that one introduces the 
scaled velocity instead of using the velocity itself, as is usually done, is so that 
the solution curves become circles instead of ellipses. In dynamical systems the 
geometry of this family of curves in the plane is of prime importance. 

The system admits an integral (energy in physical problems) 


T=x*+u? (5) 


which is constant along the solutions of (3), since J = 2xt + 2uu = 0 by virtue of 
(3). Since a solution lies in the set where J = constant, a circle in the x,u plane, 
the integral alone gives the geometry of the solution curves in the plane. See 
FIGURE 1. 


Fic. 1 
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If we introduce polar coordinates r? = x? + u*, 6 = tan” 'u/x then the equa- 
tions (3) become 


r=Q, 0 = -a. (6) 


Here again we see that the solutions lie on circles about the origin since, 7 = 0, 
and that the circles are swept out with constant angular velocity. 

In many cases the harmonic oscillator is only a first approximation to a physical 
problem. For example one gets a harmonic oscillator from the pendulum equation 
by making the small angle approximation sin @ = 6. Some physical problems are 
better approximated by nonlinear oscillators such as 


r=0, 6=-wtart:::, (7) 
where a # 0 is a constant. In such cases the solution curves will still be circles, 
since r = 0, but the angular frequency, —w +ar+-::-, varies from circle to 


circle. 


II. Pairs of Oscillators. Consider a pair of harmonic oscillators 


¥t+ow*x=0, yptp’y=0 (8) 
which as a system becomes 
x= wu y= pv 
u = —wXx p= — py. (9) 


In polar coordinates it becomes 


r= p=0 
d= —w b=x-—p (10) 
and it admits integrals 
T=x*+ue J=y?4+v". (11) 


In many physical problems these equations are only the first approximation and 
the full system does not admit the two individual functions J and J as integrals but 
does admit the sum 


E=J+J=x*+ur*+y*+v* (12) 


or total energy as an integral. In this case the 3-sphere, S? = E~'(1), is an 
invariant set for the flow, (i.e. a solution which starts in this set stays in this set.) 
Think for example of a pea, of unit:mass, rolling around in a bowl above the x-y 
plane with its minimum at the origin. If the only force acting on the pea is the 
constant force due to gravity, the potential, V(x, y) of the pea would have a 
minimum at the origin. Let V have a Taylor expansion of the form 2V(x, y) = x? 
+y? +--+. Then the equations of motion of the pea are ¥ = —V.= —x+---, 
y= —-V,=—-y+--- and total energy is (x7 + y*)/2 + (x7 + y* + +++ )/2. Thus 
the linearized system at the minimum would be of the form (9) with w = uw = 1. 

Consider the general solution through ro, Po, 99, @) at epoch t = 0. The solu- 
tions with rp = 0 and py, > 0 or py = 0 and r, > 0 lie on circles and correspond to 
periodic solutions of period 27/u and 27/w, respectively. These periodic solu- 
tions are special and are usually called the normal modes. 
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The set where r =r, > 0 and p = p, > 0 is an invariant torus for (9) or (10). 
Angular coordinates on this torus are @ and ¢ and the equations are 


6 = —W db = ~H, (13) 
the standard linear equations on a torus. 

If 6/u is rational then w = pr and w = qr where p and q are relatively prime 
integers. In this case the solution of (13) through @), ¢) at epoch t= 0 is 
A(t) = 0) — wt, b(t) = by — mt and so if T = 27/7 then O(T) = 0, + p2m and 
Q(T) = by + G27. That is, the solution is periodic with period T on the torus (see 
FIGURE 2a) and this corresponds to periodic solutions of (9). 


FIG. 2 


If 0/p is irrational then none of the solutions are periodic. In fact the solutions 
of (13) are dense lines on the torus (see FIGURE 2b) and this corresponds to the 
fact that the solutions of (9) are quasi-periodic but not periodic. See Coddington 
and Levinson [3] or Hale [6] for a discussion of the irrational flow on the torus. 

We can use polar coordinates to introduce coordinates on the sphere provided 
we are careful to observe the conventions of polar coordinates: i) r > 0, ii) @ is 
defined modulo 27 and iii) r = 0 corresponds to a point. That is, if we start with 
the rectilinear strip r > 0,0 < 6 < 27 (FiGuRE 3a), identify the 6 = 0 and 6 = 27 
edges to get a half closed annulus (FiGuRE 3b), and finally identify the circle r = 0 
with a point, then we have a plane (FiGuRE 3c). 

Starting with the polar coordinates, r, 6, p, @ for R* we note that on the 
3-sphere E =r? + p* = 1 so we can discard p and have 0 <r < 1. We will use 
r,0, @ as coordinates on S°. Now, r;6 with 0 <r < 1 are just polar coordinates for 
the closed unit disk. For each point of the open disk there is a circle with 
coordinate @ (defined mod 277), but when r = 1, p = 0 so the circle collapses to a 
point over the boundary of the disk. The geometric model of S° is two solid cones 
with points on the boundary cones identified as shown in Ficure 4. Through each 
point in the open unit disk with coordinates r, @ there is a line segment (the 
dashed line) perpendicular to the disk. The angular coordinate ¢@ is measured on 
this segment; @ = 0 1s the disk, @ = 7 is the upper boundary cone, @ = —7 is the 
lower boundary cone. Each point on the upper boundary cone with coordinates 
r, 0,6 = 77 is identified with the point on the lower boundary cone with coordinate 
r,0,@ = —v7r. From this model follows a series of interesting geometric facts. 
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r=0 


identify 


FIG. 4 
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For a, 0 < a < 1, the set where r = a is a 2-torus in the 3-sphere and for a = 0 
or 1 the set r= a is a circle. Since r is an integral for the pair of oscillators these 
tori and circles are invariant sets for the flow defined by the harmonic oscillators. 
The two circles, r = 0,1 are periodic solutions, called the normal modes. The two 
circles are linked in S°, i.e. one of the circles intersects a disk bounded by the 
other circle in an algebraically non-trivial way. The circle where r= 1 is the 
shaded disk in FiGurE 5 and the circle r = 0 intersects this disk once. It turns out 
that the number of intersections is independent of the bounding disk provided one 
counts the intersections algebraically. See Rolfsen [10]. 


Fic. 5 


Consider the special case when w = uw = 1. In this case every solution is 
periodic and so its orbit is a circle in the 3-sphere. Other than the two special 
circles, on each orbit as @ increases by 277 so does @. Thus each such orbit hits the 
open disk where ¢ = 0 (the shade disk in FiGurE 5) in one point. We can identify 
each such orbit with the unique point where it intersects the disk. One special 
orbit meets the disk at the center and so we can identify it with the center. The 
other is the outer boundary circle which is a single orbit. When we identify this 
circle with a point, the closed disk with its outer circle identified with a point 
becomes a 2-sphere. Thus: 


Proposition. The 3-sphere, S°, is the union of circles. Any two of these circles are 
linked. The quotient space obtained by identifying each circle with a point, is a 2-sphere 
(the Hopf fibration of S°). 


Let D be the open disk @ = 0, the shaded disk in Fiure 5. The union of all 
the orbits which meet D is a product of a circle and a 2-disk, so each point not on 
the special circle r = 1 lies in an open set which is the product of a 2-disk and a 
circle. By reversing r and p in the discussion given above the circle where r = 1 
has a similar neighborhood. So locally the 3-sphere is the product of a disk and a 
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circle, but the sphere is not the product of a two manifold and a circle. (The 
sphere has trivial fundamental group but such a product would not. See 
Rolfsen [10].) 

When w = p and uw =q, p and g relatively prime integers all solutions are 
periodic and the 3-sphere is again a union of circles but it is not locally a product 
near the special circles. Arnold [1] calls this decomposition into circles near the 
special circles the Seifert foliation and shows its connection with bifurcation 
theory. The nonspecial circles are p-q torus knots. They link p times with one 
special circle and g times with the other. 

The linking statements follow by a slight extension of the ideas of the previous 
proposition. A p-q torus knot is a closed curve which wraps around the standard 
torus in R° in the longitudinal direction p times and in the meridional direction q 
times. If p and g are different from 1 the knot is nontrivial. We cannot go into 
details here, but FiGuURE 6 shows that the 3-2 torus knot is the classical trefoil or 
clover-leaf knot. In FiGuRE 6a the opposite sides of the squares are to be 
identified so that it represents a torus. The line of slope 2/3 in 6a wraps around 
the torus 3 times in one direction and 2 times in the other. This line is the 
important object so henceforth the sides of the torus will be suppressed in the 
drawings. Think of rolling the square in 6a into a cylinder by rolling the top half of 
6a back of the bottom half. Now the line wraps over and under the cylinder and 
has two end points at each end of the cylinder. Join the end points with additional 
curves to get FiGuRE 6b. Note that the crossing patterns for the curves in FIGURES 
6b and 6c are the same and so represent the same curve. FiGurE 6c is the standard 
representation of the trefoil knot. See Rolfsen [10] for more information on knots. 


FIG. 6a 


FIG. 6b FIG. 6c 
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Sometimes one needs to consider nonlinear oscillators. A simple and somewhat 
typical nonlinear system 1s 


?=0 6 =0 
J=wtar d=p+ Bp, (14) 


where a and ®# are constants. In this case we still have the special circles, the 
decomposition into tori, r = constant as described above, and the flow on the tori 
is still linear. However, now the slope of the linear flow on the tori varies from 
torus to torus. Sometimes all orbits are dense and sometimes all orbits are 
periodic. 


IV. The Kepler Problem. As stated above, pairs of linear oscillators are often 
the first approximation to interesting nonlinear physical problems, but also they 
completely solve the Kepler problem of celestial mechanics. Consider a body of 
mass M fixed at the origin in the plane and another free body of mass m whose 
position in the plane is given by the vector g. Assume that the free body is 
attracted to the fixed body by the force given by Newton’s law of gravity. Then the 
equation of motion for the free body is 

. __—GMq (15) 
q=-—_3.> 
lai? 


where G is the universal gravitational constant. By changing the time scale if 
necessary the constant GM can be taken as 1. There are many ways to solve this 
system of differential equations, but I think one of the most informative ways is the 
regularization procedure of Levi-Civita. This method removes the singularity at the 
origin and reduces the equation (15) to a pair of harmonic oscillators with equal 
frequencies. 

Consider g as a complex number, so multiplication makes sense and the norm 


| - || can be replaced with absolute value |: |. Since the bounded orbits of the 
problem occur for negative energy, write the energy integral as 
Iq\? — 2/\q| = —C, (16) 


so C is the negative of the total energy. The Levi-Civita change of coordinates is 
q = w* together with a change of time dt = 4|w|? dr. The transformation from w 
to gq is a 2 to 1 transformation except at the origin. After some careful computa- 
tions the equation of motion in the new coordinates becomes 


dy 


Te + 4Cw = 0, (17) 
a pair of harmonic oscillators with frequencies 4¢C when C > 0. The computations 
are carried out in full in Szebehely [9]. 

Thus the flow of the Kepler problem (15) on a negative energy level is the 2 to 1 
projection of the flow on the 3-sphere defined by (17) where the projection is the 
squaring map. In the new coordinates w = 0 corresponds to a collision of the two 
bodies but the equations (17) do not have a singularity there. Thus these coordi- 
nates ‘regularize the collision singularity’. 
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1. Card games and dynamical systems. In the past thirty years the abstract 
qualitative theory of ordinary differential equations and of difference equations 
has been cast into the theory of dynamical systems with continuous time and with 
discrete time, respectively. In general, in the theory of dynamical systems the state 
space is a smooth finite-dimensional manifold. Depending on the dimension of this 
manifold, its topological structure, and whether time is continuous or discrete, the 
qualitative behavior of trajectories may be more or less complicated. For continu- 
ous time, in dimension one, every bounded trajectory converges to a stationary 
point. On the sphere of dimension two, the theorem of Poincaré and Bendixson 
ensures, that (for vector fields with finitely many hyperbolic stationary points) limit 
sets are points or periodic orbits or cyclic chains of heteroclinic orbits connecting 
saddle points [7]. For dimensions greater than three the possible behaviors of 
trajectories are very complicated. Dynamical systems with discrete time already 
show very complicated behavior in one: dimension (comparable to differential 
equations in three-dimensional space). However, for monotone systems every 
bounded trajectory converges to a stationary point. In nonmonotone systems cycles 
appear, typically of order 2 or a power of 2 (period-doubling, Feigenbaum 
sequence [1], [2]) or more complicated behavior (Sharkovskij families of periodic 
orbits, “chaotic” behavior [2], [6]). In the following we shall study even more 
discrete dynamical systems, namely systems on the set of integers Z. Of course, 
supplied only with the discrete topology, one loses much structure, but general 
questions about trajectories and their asymptotic behavior, about the structure of 
limit sets, the number of periodic orbits etc. remain the same. One is even 
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rewarded by the appearance of the new concept of stop time, which in some sense 
replaces the rate of convergence to a stationary point. The most famous dynamical 
system on Z is the 3x + 1-game, a very simple recursion formula for which the 
convergence problem is unsolved for more than fifty years despite many efforts 
(see [5] and the references therein). Here we study a much simpler class of 
dynamical systems on Z which are related to the floor and ceiling functions. The 
point of depart towards these systems is located in a quite different field of 
mathematics, it is a combinatorial problem related to a certain game of cards. This 
game seems, indeed, to be mathematical folklore, and is described in a book of 
that title by Gancev et al. [3], for a special case where certain divisibility conditions 
are satisfied (which are not satisfied for the practical cases of 32 or 52 cards). 
Nevertheless in [3] a recurrence relation for a bound on the stop time in terms of 
the floor function is given. We observe that this recurrence relation can be solved 
by using an obvious functional equation for the floor function. Even more, using 
the functional equation, the exact stop time can be determined in the general case. 
What seems equally interesting, is that the functional equation characterizes the 
floor and ceiling functions. Of course the result on card games and on recurrence 
relations could be formulated without reference to dynamical systems, but the 
language of dynamical systems seems to be most appropriate since the discrete 
analogon of bifurcation to periodic orbits occurs. 

Consider a well-known playing card trick. Player A has a deck of 32 cards. 
Player B remembers a card, A deals the cards, one by one, cyclically into three 
packs of 11, 11, 10 cards. B indicates which pack contains his card. A collects the 
cards into one deck, whereby he puts the indicated pack between the other two in 
such a way that a pack with 11 cards goes on top, deals again, and so on. After four 
steps of this procedure, the remembered card is known to A, it assumes the 
seventeenth position in the deck. There are several variants of this trick depending 
on how the cards are collected into one deck and how they are redistributed. We 
shall consider generalizations of two versions of this game which essentially differ 
in their mathematical properties. 

Now we give an exact description of the first of these procedures. The cards are 
in the deck, face down, numbered from 1 to 32 


— 1 
— 2 
J 

1. — 31 

— 32 
They are distributed into three packs, face up, 
— 31 — 32 

— — — 30 
T T T 
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If the indicated pack is the first, then the second is taken first, reversed, then the 
first is packed underneath, reversed, finally the third, to give 


— 2 
J 

— 32 

— 1 
J 

— 31 

— 3 
U : 

— 30 


a face down deck with the 25th card in position 20. It is easy to see that for each 
card of the indicated pack the new position x, is obtained from the previous 
position x, by the formula 


x, =11+ | = =| + =|. (1) 


where |x], for real x, denotes the ceiling function, i.e., [|x| is the smallest integer 
number not less than x. (The floor, or ENTIER function, i.e., the largest integer 
not greater than x, will be denoted by |x], observe [x] = —|-—x].) Hence the card 
game induces a dynamical system on the set {1,2,...,32}. The system depends on 
a parameter, the number of packs, in this case 3. In a general version considered 
here the game depends on three parameters N,b, p. There are N cards in the 
deck, these are numbered from 1 to N from top, face down. They are distributed 
into b packs, face up. After reversing the packs, p cards are put on top of the 
indicated pack and the remaining cards are put underneath. Then instead of (1) 
the new position x, of the card x, is given by 


Xo 


X,=pt =] (2) 


Hence the parameter N is not essential. The general procedure can be interpreted 
as a dynamical system on Z 


Xper=f(xs), n=0,1,2,..., (3) 


where f: Z — Z is given by 
x 


fxy=a + |E, (4) 


for given a € Z,b EN \ {I}. 
In the second variant the player will deal cards from the top of a face up deck 
and will never reverse them. It turns out that this apparently simpler procedure is 
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mathematically more difficult to analyze. The arrangement 


— 1 
t 
— 32 
becomes 
— 31 — 32 
— — — 30 
i) i) i) 
— 1 — 2 — 3 


Now player A puts a pack of 11 underneath, the indicated pack in the middle, and 
the last one on top. If the indicated pack is the first, then the result is 


— 30 
t 

— 3 

— 31 
t 

— 1 

— 32 
t 

— 2 


The 25th card moves into position 13. The distribution procedure reverses the 
order in each pack, and this reversed order is retained when the packs are 
collected into the deck. We shall see that this reversal leads to a different behavior 
of the iteration. The general procedure is as follows. A deck of N cards, face up, is 
numbered from 1 to N. The cards are distributed from the top into b packs. The 
pack that contains the desired card is taken first, then p cards are put underneath 
and the remaining cards on top. Then the new position of the card at x, is given 
by 


u-N+1-p-|>I. (5) 


470 PETER EISELE AND K. P. HADELER [June-July 
In the above example N = 32, b = 3, p = 11. Hence, games of cards with reversal 
lead to dynamical systems of the form 

Xnarp =S(X,)s n=0,1,2,..., (3) 


f(x) =a - [5 (6) 


Some other variants of the card games may at first sight lead to recursion formulas 
containing the floor instead of the ceiling function. However, notice that the 
dynamical systems 


Xp : 
= + —_— 
treat |] (7) 
and 
Yn 
Yer nat || (8) 
are equivalent (conjugated): put x, = —y, and use [x] = —|—x]. Thus one can 


always replace the floor by the ceiling function and vice versa, but one cannot 
change the minus sign into a plus sign. We shall see later that the sign distin- 
guishes two very different classes of problems. 

The following questions arise: Are there stationary points of the dynamical 
systems (3)(4) and (3)(6) (fixed points of the mapping f)? How many? Does every 
trajectory converge to a stationary point? To which stationary point does a given 
trajectory converge? Are there cycles? What is the stop time for given x,? 

We shall prove that in the dynamical system (3)(4) there are at most two 
stationary points, every trajectory converges to one of these. We give an explicit 
expression for the stop time. As a consequence we know for the general game of 
cards how many steps are necessary to determine the unknown card. For the 
system (3)(6) we can distinguish the cases with stationary points from those with 
periodic orbits of order two. The key fact of the trick with 32 cards and 3 packs is 
that, in the first version, every trajectory with x, € {1,...,32} reaches the only 
stationary point x = 17 after at most 4 steps. 


2. Dynamical systems on Z. First we consider a general dynamical system (3). 


LEMMA 1. Let f: Z — Z be any function with the following properties. 


i) x<y = f(x) <f(y). 
ii) 4x, x © Z_ such that f(x) <x holds forx > x, 
f(x) > xholds forx <x. 


Then every trajectory of the dynamical system (3) is bounded and converges to a 
stationary point. 


Proof. If x9 is such that x, > x, then {x,} is nondecreasing. If x, is such that 
x, <x, then {x,} is nonincreasing. Hence {x,} is nondecreasing for x, < 0 and 
nonincreasing for x, large. Hence for all x, the sequence {x,} is monotone and 
bounded. Since it is integer valued, it becomes eventually constant, and the limit is 
trivially a stationary point. O 
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For typographical convenience we write 


d=[], =]. (9) 
The function 3: R > Z, (and also 3: R — Z) has the following properties (see 
Sect. 3) 


i) For allx € R, c € Z holds 


O(x +c) = (x) +. (10) 
ii) Forallx € R,n EN holds 
1 
o{—o(ns)| = 9(x). (11) 
iii) 3 is nondecreasing. (12) 
Now we start the investigation of the dynamical system 
Xni1 =f (x,)s n=0,1,2,..., (3) 
f(x) =at+ dB(x/b). (13) 


THEOREM 2. Consider the dynamical system (3)(13). 
i) Ifa # Omod(b — 1) then 


z-9(-~-) (14) 


is the only stationary point, and every trajectory converges monotonely to x. 
ii) Ifa = 0 mod(b — 1) then 
- ab - - 
t= 7 X,=xX,+1 (15) 
are the only stationary points, and every trajectory converges monotonely to the nearest 
stationary point. 
Proof. Let 
a=p(b—-1)+ 9, O0<p<b-1 
and assume that 
x=mb++r, 0O<r<b, meZ, 
is a Stationary point. Then 
mb+r=pu(b-1)+p+m4+ d(r/d) 


or 
(m —1)(b-1) =p—r+ d(r/b). 


Since p < b — 1 and r < b, this inequality implies m = pw and 
r—O(r/b) =p. 
Case 1: p > 0, i.e., a = 0 mod(b — 1). Then r # 0,1 and r = p + 1, and we obtain 
a b-1- op 


=pb+pt+1l=at+pt+1=a+— + —— 
oO P au B aT 5-17 b-1 


9 a 9 ab - 
-#+ 055) -o(5—a} -# 
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Case 2: p = 0,i.e., a = 0 mod(b — 1). Then r = 0 or r = 1. With r = 0 we obtain 


a ab 


x= bb =a+pr-at 7 = 7 = %. 


With r = 1 we obtain 


One checks immediately that the obtained numbers are stationary points. The 
convergence is evident from Lemma 1. O . 


Since f as defined in (13) has the monotonicity properties required in Lemma 1 
the convergence problem becomes so simple, quite in contrast, say, to the 3x + 1 
map (cf. [5]). Also in the theory of monotone dynamical systems with continuous 
time the limit sets have a particularly simple structure (cf. [4]). 


THEOREM 3. Consider f as in (13). Then forx € Z 


(*) o a ia bok k =0,1,2 (16) 
fO(x) = +l at k= 01,20. 
Proof. For k = 0 the assertion is trivial. Assume (15) is true for some k. Then 
(kD x 
fx) =at+9 f ‘ 
9 - ab ab h-k 
aaa) +s | 
9 a ab pk 1 
=at paths] | 
a ab 
~ 9 _ ~(k+1) 
ate, [ —) | 
9 ab ab 4+) 
- Stl FS | : 


Remark. The formula (16) implies convergence to a stationary solution for 
k + o, Furthermore the formula exhibits, for each x € Z, the limit of the 
trajectory, if the discontinuities of 0 are carefully inspected. 

For given x, the stop time js the smallest n such that x, is a point of a periodic 
orbit. For large |x| the difference between a +|[x/b| and x/b is relatively small 
(as compared to |x|). Therefore the stop time is close to (log|x|)/log b for large 
|x|. In fact the stop time can be given explicitly. 


CorROLLARY 4. Consider the dynamical system (3)(13). Let a,b be fixed. Let 
t = t(x) be the stop time for the trajectory starting at x. Let B = ab/(b — 1). Then 
i) if b — 1 does not divide a and x > 3(ab/(b — 1)) 


x-—B 


T(x) = 8 8 5(B) -B 


/ log b 
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ii) if b — 1 does not divide a and x < 8(ab/(b — 1)) 


_ B-x 
= 0| log ——________~ /log b} + 1. 
iii) if b — 1 divides a andx > ab/(b — 1) +1 
log(x — B) 
= 3¢| —__ 
(x) | log b | 
iv) if b — 1 divides a and x < ab/(b — 1) 
_{ log( B — x 
r(x) = (| +1. 
log b 
The proof follows from Theorem 3 by solving for k. O 
Next we consider the dynamical system 
Xnat_ =f ,), n=0,1,2,..., (3) 
x 
f(x) =a - 0(5}. (17) 


Now f is not monotonely increasing and the system 1s somewhat more compli- 
cated. Let F =f f. Then from f(x) = —(9(x/b) — a) = —8(x/b — a) 


F(x) = (fo f(x) =a = 0 0(2 ~a)| 


5°(; -a)} (18) 


THEOREM 5. Every trajectory of the system (3)(17) becomes eventually constant or 
evolves into a cycle of order 2. Furthermore 
i) Ifa # 1 mod(b + 1), then the function f has exactly one fixed point 


_  —{ ab 
7-9/5"). (19) 
ii) Ifa = 1 mod(b + 1), then f° f has exactly two fixed points, namely, 
b(a — 1) 
1 = IT X,=xX,+1 (20) 


which form an attracting two-cycle of the dynamical system, and f has no fixed point. 


Proof of Theorem 5. The function F is monotonely increasing and satisfies, 
since b > 2, the hypothesis of Lemma 1. Hence, for each trajectory {x,} the 
subsequences {x,,} and {x,,,,} converge, and the limits are trivially fixed points 
of F. 

1) Let 


a=p(b+1)+ 9, O<p<b+1. 
Assume that 
x=mb +r, O<r<b 


474 PETER EISELE AND K. P. HADELER [June-July 


is a stationary point. Then 
mb+r=pu(b+1)+p—m-— d(r/d) 
or 
(m—pu)(b+1) =p —r—d(r/b). 
This equality implies m = mw and 
r+ d(r/b) =p. 
Hence p # l,i.e., a # 1 mod(b + 1). 
Case 1: p = 0. Then r = 0 and 


a ab - 
x= pbb=-a-pm-a- 7 7 =e =k. 
Case 2: p > 1, then r+ 1 =p, and 
a p 
= mb +p-1=a-p-1=a-(—— - F411] 


9 a 5(_% - 
—4 testhe [aa) <* 


On the other hand, if p # 1, then X is a stationary point. Hence i) is proved. 
2) It remains to analyze the case where there is no stationary point or, 
equivalently, p = 1. A fixed point of F corresponds to a pair x,, x, such that 
Xx, =a — B(x,/b), 
Xy =a — 3(x,/d). (21) 
Assume 
x, =mb-+r, X,=nb+s, O<r,s <b. 
Case 1: r,s > 0. From 
mb+r=a-n-1 
nb+s=a-m—1 
it follows that (m — n)(b — 1) =s —r. Thus b — 1 divides s — r. But |s —r| < 
b—1. Hence s—r=0 and m=n. The point x, =x, is a fixed point of f. 
Contradiction. 
Case 2: r= s = 0. From 
mb=a-—n 
nb=a-—m 
it follows that (m — n)(b — 1) = 0, thus m =n, and x, =x, is a fixed point of f. 
Contradiction. 
Case 3: r = 0, s > 0 (or vice versa). From 
mb=a-—n-—1, 
nb+s=a-—m 
it follows that (m —n)(b — 1) =s — 1. Hence b — 1 divides s — 1. Buts—1< 


b — 2. Therefore s = 1 and m =n, and a = m(b + 1) + 1. The numbers x, = mb 
and x, = mb + 1 satisfy equations (21) and are equal to x, and x,. O 
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Unfortunately, we have not been able to find an “explicit” solution formula or to 
determine the stop time. 


Remark. Since the systems (7) and (8) are equivalent, all results can be carried 
over to the case where 3 =| | (and, of course, 3 =[ ]). In particular, in Theorem 
2,ii, one has %, = ab/(b — 1) and x, =x, — 1, Theorem 3 remains as it is, in 
Corollary 4 one obtains only slightly different expressions. Finally, in Theorem 5 
the special case to be observed is a = b mod(b + 1), where <, = b(a + 1)/(b + 1) 
and x, =x, —1. 


To prove Theorem 3 one only needs the properties (9), (10). Here some hope 
may arise that a more general class of functions could be considered. See, however, 
the next section. 


3. Characterization of the floor and ceiling functions. 
Lemma 6. The function 3(x) = [x] has the properties (9)(10)(11). 
Proof. (9)(11) are evident. (10) is equivalent to 
B(O(x)/n) = O(x/n). (22) 


Assume x = mn+r,0<r<n,m &Z. For r= 0 the assertion is evident. Now 
let r > 0. Then 


0(d(x)/n) = 8(9(mn +1) /n) 

= 8((mn + B(r))/n) 

= (m+ B(r)/n) =m4+1. 
On the other hand 

O(x/n) = B(m+r/n) =m +1. O 
_ Lemma 7. If a function & satisfies (9)10)(11) then the function %, as defined by 
O(x) = —3(—x), satisfies (9)(10)(11). 
Proof. 
O(x +c) = —8(-x-—c) = —8(-x) +c = B(x) +c, 


3( Fm) -9(-—3(n)| = -0(—9(-n0) 
—3(—x) = 8(x). 0 


Finally we show that the conditions (9)(10) do not lead to any generality. The 
proof is surprisingly complicated (unless a simple argument has escaped our 
attention) and for this reason it will be given here. 


THEOREM 8. Assume 9: iy) — Z has the following properties 
i) For all x EG, c €Z holds 


o(x+c)= B(x) 4+. 
ii) For all x EG, n EN holds 


9(--9()] = 9(x) 
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or, equivalently, 


[| 


1 
o(-9()| = o| 
n 
Then either 3(x) =[x| for all x — Gor O(x) = 
Proof. 


x| forallx € @. 


m— 


1. For all x EG, O(9(x)) = 3(x), ie., all images of 3 are fixed points. 
2. Assume 3(0) = z € Z. Then 


0(0) =z=0(z) =0(04+2z) = 3(0) +z, 
hence z = 0, 


8(0) = 0. (23) 
Thus for c € Z 
O(c) =c. (24) 
3. Assume 0(1/2) =z € Z. Fork EN, 
1 2k +1 1 
z= 0| ———_ 0 | ——— ]] = Olk+— 
2k +1 2 | 2k +1 | | 
k + d(4) k+z 
~~ \ oaked | lok 41) 


If z < 0 then put k = —z to obtain z = 9(0) = 0. If z > 1 then put k =z + 1 to 
obtain z = &(1) = 1. Hence 


5} -omals)-» 


(25) 
4, Assume 3(1/2) = 0. Claim: 


1 
o[— | = oforn EN, n> 1. (26) 
nh 


Proof by induction. The assertion is true for n = 2. Assume it is true for some 
n—1> 2. Then 


1 9 n 
<)> [—] 
and thus 


s(t) -o(2els))<olcts)-° 


5. Still assume 3(1/2) = 0. Claim: 


k 
9] = Oforn,k EN, k <n. 
n 


Proof by induction over k. The assertion is true for k = 1. Assume 


(27) 


py 
o(- | = QOfor v=1,...,k —landn>vp. 
n 
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Letn>k andn=(m-—D)k+r,0<r<k, thus m > 1. If r= 0 then k divides 
nand 3(k/n) = 30 /m — 1) = 0. If 0<r<k, then 


n+k-—-r 
m= i ; 
mk k-r 
—_—=1+ , 

n n 

and thus, by assumption 
mk k-r 
| =-1+% |- 
n n 


Now 


(S))-)- 


Hence it is proved that 0(1/2) = 0 implies 0(x) = [x] for all x © Z. 
6. Now assume that 0(1/2) = 1. Then 9(1/2) = 0, thus &(x) = |x] for all 
x E@, &(x) = [x] for all x Ee @. O 


The authors thank a referee for useful suggestions. 
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The standard curriculum for a first course in linear algebra will normally 
contain a proof of the Cayley-Hamilton theorem, a discussion of criteria for the 
diagonalizability of complex square matrices and perhaps some mention of the 
functional calculus for such matrices. In this article we describe a decomposition 
theory from which these topics are easily developed, and which we think is 
sufficiently simple for inclusion at this level. The theory and its applications show 
how much can be achieved that is based purely on the use of elementary 
polynomial algebra. In particular we have been able to omit Jordan canonical 
forms, and yet retain areas to which this topic applies. 

The content of the theory is not new (see, e.g. [4, pp. 220-223]), but it seems to 
us that it has never been presented at an elementary level. Our approach is 
elementary, and we find that our students have accepted it readily. We therefore 
propose it now to teachers of linear algebra as a possible curriculum development. 
We also present a possibly novel approach to the functional calculus for square 
matrices, based on decomposition theory, which we have found successful in our 
teaching. 

The decomposition theory is outlined in §1, drawing from [4]. In the following 
sections, we explain how the topics mentioned above are developed by means of 
this theory. The definition of f(A) in §3 is suggested by Taylor’s theorem. Indeed, 
by decomposition theory, A can be expressed as the sum of a diagonalizable matrix 
D with a nilpotent matrix N, and the expansion of f(D + N) about D is a finite 
sum due to the nilpotency of N. This definition, although equivalent to that given 
by other authors has, to the best of our knowledge, not been previously presented 
in the literature. We conclude in the final section with a discussion of some 
interesting examples used to illustrate the theory and its applications. 
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We should like to thank the referee for some very helpful comments concerning 
the presentation of the paper. 


1. Throughout, A is an n X n complex matrix. Let 
k 


pA) = TT ~ 6)" 
be any polynomial which annuls A, and for 1 < j<k put 


f(A) = TT —-e,)". 


LA] 


We use partial fractions to write 


1 k Pe 
-ry—L, 
p(a) i=1j=1 = — o) 
and it follows that 
k 
uA) f(A) = 1, (1.1) 
where 
P, 


u(A)= » a; (A — c,)? (1l<i<k). 


j=l 
We now define, for 1 <j <k, 
E, = uj( A) f(A), (1.2) 


so that each E; is a polynomial in A which can be calculated by the simple 
algorithm just described. These matrices have the following properties. 


Lemma 1.1. For li <i<k,1<j <k, 


k 
LE,=1, EF,=0 (i#j), EP=E,. 


J J 
i=1 


Lemma 1.2. For each j, the range-space R, of E, coincides with the null-space of 
(A —c,] ys, 

Simple proofs of these lemmas—using (1.1), (1.2)—will be found in [4]. Using 
the last part of Lemma 1.2, we note that E,; # 0 if, and only if, c; is an eigenvalue 
of A. For this reason we assume throughout that c,,...,c, are the distinct 
eigenvalues of A. 

We now define two further polynomials in A by 

k 
D= JY c¢E, N=A-D. 
i=1 
These have the following properties. 


THEOREM 1.1. Jf A = D + N, where D and N are determined as above, then D is 
diagonalizable, N is nilpotent, and ND = DN. Moreover D and N are determined 
uniquely by these properties. 
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A simple proof—based on the above lemmas, and on simultaneous diagonaliz- 
ability for the uniqueness—is to be found in [4]. Note that the uniqueness of D 
implies the uniqueness of the matrices E,. In fact, Lemma 1.1 shows that 


k 
D’ = YiciE, (r>1), 
i=] 


and an application of Cramer’s rule to the first k of these equations determines 
each E, as a polynomial in D with coefficients fixed by the eigenvalues. 


2. Let m(A) be the minimal polynomial for A, and write 
k 
m(A) = [la —c¢,)"". 


We now suppose that the algorithm for determining the matrices E,, D and N is 
initiated by m(A). The following lemma plays a central role in what follows. 


LEMMA 2.1. For each j, the matrix NE, is nilpotent of index m,. 
Proof. By Lemma 1.1, for each J, 
NE, = (A —_ CE, “oS c, E,)E; = (A —_ cI)E.. 
and 
(NE) =(A-c¢I)E,  (r2>1). 
Hence, 
(NE,)"’ = (A —¢,1)"'u,(A)f,(A) = u,(A)m(A) = 0. 

Suppose now that (NE;) = 0. Then (A — ¢,J)'E; = 0. Let x © C” and put 


pa) =TTA-e«)". 
i#j 
The vector p,(A)x is in the null-space of (A — c,J), and so, by Lemma 1.2, in 
R,. Hence, by Lemma 1.1, pj Adx = E, pf A)x, and we have 
(A —c,1) p(A)x =(A- c,1) E,p,(A)x =0 
for all x in C”. It follows that (A — c;)’p(A) annuls A, and consequently r > m,. 
We can now establish the following result. 


THEOREM 2.1. Jf N is the nilpotent part of A, then the index of nilpotence v for N 


is given by v = max mM, 
l<j<k 


From the case v = 1 we see that A is diagonalizable if, and only if, the minimal 
polynomial of A factors into simple linear factors. This, of course, is a well-known 
criterion for diagonalizability. 


Proof. Lemma 1.1 shows that for r > 1 
N'=N'E, + N'E, + ++: +N'E, = (NE,)’ + (NE) + ++ +(NE,)’, 


and, hence, that N’ = 0 if, and only if, (NE,)" = 0 (1 <j < k). Lemma 2.1 shows 
that this is true if, and only if, r > v. 
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We next prove 


THEOREM 2.2. If D is the diagonalizable part of A, then A and D have the same 
characteristic polynomial. 


Proof. Since D and N commute they are simultaneously triangulisable (see, 
e.g., [6], p. 319). A triangular matrix similar to N is nilpotent and so has zeros on 
its main diagonal. It follows that A and D are similar to triangular matrices with 
the same main diagonal, and so they have the same characteristic polynomial. 

As a corollary we obtain a result containing the Cayley-Hamilton theorem as a 
special case. 


CoROLLARY 2.3. The characteristic polynomial of A is a multiple of its minimal 
polynomial. 
k 
Proof. As in §1, let D= )/c,E;, and let R; be the range-space of E,. Put 
i=1 
d,; = dim R,. Using Lemma 1.1 we obtain 


k 
Dx = | > abe = ¢;Xx 

i=1 
for any x in R,. Now form a basis for C” by first selecting a basis for each R,; and 
then forming their union. With respect to this basis, D is represented by 


diag(c,l,,...,¢,J,), where J, is the d; x d; identity matrix. Thus the characteristic 
polynomial of D, and so of A, is 
k 
Tla- c)). 
i=1 


The index of nilpotence of NE; cannot exceed d,, and hence m, < d;, by Lemma 
2.1. This completes the proof. 

The bases for the range-spaces R,, mentioned in the proof above, can be chosen 
so that their union forms a basis for C” with respect to which the action of A is 
represented by a (similar) matrix in Jordan canonical form. The method of choice 
depends on the theory of nilpotent operators (see e.g. [3, §57]), and this theory is 
more subtle than anything presented here. As we shall see in §4, there are 
examples to which the simpler decomposition theory can be applied as an alterna- 
tive to using Jordan forms. This suggests that these forms could be left as an 
‘optional extra’ in a course which involves this theory. 


3. The functional calculus. Let A be a complex n Xn matrix with distinct 
eigenvalues c; (1 <j <k), and let D, N and the matrices E; be constructed as in 
§1. Let f be a complex-valued function which has derivatives up to order n — 1 at 
each c,, and denote by the class of all such functions f. For f © F, we define 


n-1 £02)(D 
f(A) = > PD) im 


m=0 


— (3.1) 


where, for each m, 


k 
f(D) = X FO (ee) E;. (3.2) 
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As far as we are aware, this definition—which seems to us particularly natural 
because of its connection with Taylor series—has not been given previously in this 
form. To make it acceptable to our students we show that it corresponds to 
“replacing A by A in f(A)” for an appropriate class of functions. For the reader, 
we should perhaps also show that it is equivalent to the other definitions of f(A) 
to be found in the literature (see e.g. [1, chapter 5], [2, chapter 5], and more 
recently [5]). As a by-product, we obtain some simple results of independent 
interest. 


THEOREM 3.1. If f,, f, © F, then 
(Ai fi + Agfa)(A) = AV fiCA) + Arfa( A), 
(fifi)(A) =fi( A) ACA), 
where ’,, A, are complex scalars. 


Proof. The first of these results follows immediately from (3.1) and (3.2). For 
the second, we use additionally the nilpotence of N and Lemma 1.1 to write 


rote f(D) f(D) 


f( AACA) = DNL 
n—-1 NYr 
- LU Ulm ) EAmeyh ™(e;) Ey. 


Changing the order of summation, and using Leibnitz’s rule, the right-hand side 
becomes (f, f,)(A). 

It is easy to verify that f(A) =A if f(A) = A, and that f(A) = if f(A) = 1. It 
follows from this, and from Theorem 3.1, that p(A) is obtained by substituting A 
for A in p(A), whenever p is a polynomial. We can extend this to rational 
functions, but before doing so we first prove the following theorem. 


THEOREM 3.2. If f © FY, then f(D) is the diagonalizable part of f(A), and both 
matrices have as their characteristic polynomial 


(A — f(e,)) “(A — f(eq))? + (A = flex), (3.3) 
where d, = dimR.,. 


Proof. Since f(D) = Li_, flc,)E;, it follows, as in the proof of Corollary 2.3, 
that f(D) is diagonalizable, and that its characteristic polynomial is the one given 
in (3.3). Write 


n—-1 f£(™)( fh 
f(A) =f(D) +N ar / . a (3.4) 


Since N commutes with each E,, it follows that N commutes with each of the 
matrices f(D) occurring in (3.4). So f(A) — f(D) is nilpotent. Moreover this 
matrix commutes with f(D), since each matrix is expressible as a polynomial in A. 
Hence f(D) is the diagonalizable part of f(A). An application of Theorem 2.2 
completes the proof. 

Now let p and q be polynomials, and assume that qg is non-zero at the 
eigenvalues of A. Then p/q € ¥. Theorem 3.2 shows that 0 is not an eigenvalue 
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of g(A) and so q(A) is invertible. Since by Theorem 3.1 (p/q)A)q(A) = p(A) 
we deduce that (p/q)(A) = p(A)(qg(A))*. 

For f € F, it follows from (3.4) that whenever A is diagonalizable then f(A) is 
diagonalizable. We now state a partial converse. 

For a general n X n matrix M, a(M) will denote the set of eigenvalues of M. 


THEOREM 3.3. Let f € FY and suppose that f(A) # 0 whenever X € o( A). If 
f(A) is diagonalizable then A is diagonalizable. 


Proof. Assume that f(A) is diagonalizable. Let 


n—-1 £(™M) 
> f (P) mt 


X= 
m! 


m=! 
where D and WN are, respectively, the diagonalizable and nilpotent parts of A. We 
have, as in (3.4), f(A) = f(D) + NX, and Theorem 3.2 shows that f(D) is the 
diagonalizable part of f(A). It follows that NX = 0. An argument similar to the 
one used in the proof of Theorem 3.2 shows that f’(D) is the diagonalizable part 
of X. Theorem 2.2 now implies that 


a(X) =o(f(D)) = {f(A):A € o( AD}. 


The hypothesis shows that 0 ¢ gs(X), and so X must be invertible. We deduce 
that N = 0 and hence that A is diagonalizable. 
Using Theorems 3.2 and 3.3 we can prove 


THEOREM 3.4. Let f be an entire function. Then the implication 
f(A) =0 =A _  diagonalizable 
holds for all complex n X n matrices A if, and only if, each zero of f is simple. 


Proof. If each zero of f is simple then f(A) #0 whenever f(A) = 0. In 
particular if f(A) =0 then f(A) #0 whenever A € a(A), since for such A, 
f(Aeo(f(A)) = {0}. The diagonalizability of A now follows from Theorem 3.3. 
Suppose, conversely, that A is diagonalizable whenever f(A) = 0. Assume that Aj 
is a zero of f which is not simple, so that, for some entire function g, f(A) = 
(A — Ao)’g(A). Let the n X n matrix A be given by {a,,}, where a;, = Ay (1 <i <n), 
ay, = 1, and aj, = 0 otherwise. Then (A — Ay)? = 0, and so 


f(A) = (A — Agl)°g(A) = 0. 


However A is not diagonalizable. This contradiction shows that each zero of f 
must be simple. 

Since N, and each of the matrices E,, is a polynomial in A, it is clear from (3.1) 
and (3.2) that f(A) is a polynomial in A whenever f € ¥. Different functions f 
and g in Y may therefore give rise to the same matrix. The result stated below 
gives a necessary and sufficient condition for this to happen, and also shows (see 
[5]) that alternative definitions of f(A) involving Jordan canonical forms, or 
interpolation polynomials are equivalent to the one we give here. 


THEOREM 3.5. Let f and g be in .. Then f(A) = g(A) if, and only if, 
Fc) =e9°™c) U<ji<k, 0O<sm< m, — 1), (3.5) 
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where for each j, m, is the exponent of A — c, in the factorization of the minimal 


J 


polynomial for A. 
Proof. From (3.1), (3.2) we have 

n—-1 N™ 

f(A) — (A) = YE (f(D) — gD) — 
m=) ° . ° 
n-1 k m 

=) | » (f°(¢;) — gc) )E; mm! 

m=0 \j=1 ° 


Interchanging the summations, using Lemma 2.1, and noting that N’"E, = (NE,)” 
for m > 1, the right-hand side can be written as 


k [m,—1 NE. 
E[E (1%) ep) ) 
j=l 


! 
m=0 m. 


We deduce from Lemma 1.1 that f(A) = g(A) if, and only if, (fC4) — gCA)E, = 0 
for 1 <j <k, and so if, and only if, 


myo NE, 
yu (f°(¢;) — g™(c;)) a = 0 (3.6) 
m=0 ° 


for 1<j <k. Hence f(A) = g(A) if we have (3.5). Suppose conversely that 
f(A) = g(A). Then the conditions (3.6) hold. Multiplying both sides of (3.6) by 
N™~* shows that f(c;) =g(c,), since by Lemma 2.1, N"E, #0 for 0<m< 
m,; — | whereas NE; = 0 for m > m,. Subsequent multiplication of both sides of 
(3.6) by N™. for m = m, — 2, m, — 3,..., successively gives the remaining equa- 
tions in (3.5). 


4. For some years we have given a first course in linear algebra which includes 
both the decomposition theory and parts of the functional calculus (as outlined 
here), as well as the more standard topics. We feel that the inclusion of the “new” 
topics provides an opportunity to set some instructive and sometimes important 
problems. For example a student can be asked to find the D + N decomposition 
for a matrix A (usually 3 x 3 to keep things simple), and then to determine f(A) 
for a function f (often the exponential function). 

If, for example, 


2 0 -3 
Av\={|-1 -1 -3], 
1 3 6 


then the characteristic polynomial of A is given by 


p(A) = (A = 3)(A - 2)’. 


With c, = 3, cy = 2 the algorithm of §1 now gives 


—-3 -9 —-12 
0 0 0 ? bk, = 
1 


3 4 


£,= 
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so that 
—1] -9 -—-12 3 9 9 
D= 0 2 0], N= {-1 -3 -3 
1 3 6 0 0 0 


An application of (3.1) with f(A) = e* gives 
e4 = 3 E, + e°E, + (e°E, + e7E,)N + 4(e3E, + e72E,)N?. 


Rather than substitute directly for E,, E, and WN at this stage, we encourage 
students to use some theory. Thus one of the factors in p(A) is linear and by 
Lemma 2.1 this implies that E,N = 0. Lemma 1.1 gives E, + E, =I and hence 
E,N = N. The highest exponent occurring in the factorization of p(A) is 2, and so 
Theorem 2.1 shows that N* = 0. This results in the simplified calculation 


7—3e 18-—9e 21- 12e 
e4 = e7E, + e7?(E, +N) =e? —1 —2 —3 
—-l+e -3+3e —3+ 4e 
and illustrates a practical application of the theory. 
As an illustration of the importance of the technique for finding the exponential 
of a square matrix, a student can be asked to solve linear differential equations of 
the form 


? 


y(t) = Ay(t) 
where y(t) is a vector-valued function of the real variable t, and where A is a 
square matrix with real or complex terms. The solution, of course, involves the 
matrix e4’, 

In conclusion we remark that Theorem 3.3 and Theorem 3.4 (see §3) are not 
included in our course work. Our knowledge of them enables us to set problems 
which can be both interesting and challenging. For example, Theorem 3.3 shows 
that “A is diagonalizable whenever e“ is diagonalizable.” Again, Theorem 3.4 
shows that “A is diagonalizable whenever sin A = 0”. Our students, who are not 
familiar with these theorems, have to use some ingenuity to prove results of this 


type. 
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1. Introduction. In the late eighteenth century, Lagrange proved that every 
positive integer can be represented as a sum of at most four squares of integers. 
This cannot be improved to three squares as there are infinitely many integers 
(e.g., 7) which cannot be represented as a sum of three squares of integers: 


Lemma 1 (Legendre, 1798). A positive integer n cannot be expressed as the sum of 
three squares of integers if and only if n is of the form n = 27"(8m + 7) where m and 
k are non-negative integers. 


In this article we shall prove two recent conjectures about whether various 
binomial coefficients can be written as the sum of less than four squares, using 
methods from the elementary theories of numbers, stochastic matrices and of 
graphs. First we will prove a conjecture stated by Alvan Beall, Blair Kelly, and Bob 
Morris at the Western Number Theory Conference in December, 1987: 


THEOREM 1. For every positive integer n, except 1, 2, 3, 4, 5, 9, 14, 17, 18, 20, 21, 
35 and 41, there exists an integer m,0 < m <n, divisible by 4, for which (” ) cannot 
be represented as a sum of three squares. 


We will also prove a conjecture of Neville Robbins [3]: 


THEOREM 2. The set of integers n, for which (2”) cannot be represented as a sum of 
three squares, has asymptotic density 1/8 in the set of all natural numbers. 


Our proof of Theorem 1 will rely on the following result, which we shall prove 
in Section 2: 


PROPOSITION 1. Suppose that m and n are positive integers with m divisible by 4. If 
(") cannot be written as the sum of three squares then neither can (5") nor (2% * '). 


486 
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We define E, to be the set of exceptional n in the hypothesis of Theorem 1. We 
define S to be the set of positive integers that cannot be written as the sum of 
three squares of integers. We can now give the 


Proof of Theorem 1. Let E, and E, be the sets of values of n that are given in 
Tables I and II respectively. We see there that for each such n there exists a value 
of m for which (") © S; moreover if n € E,, then m is divisible by 4. 


TaBLeE I. The least m with (7) © S, for each n € E). 


7, 28, 71, 284 

8, 11, 43, 57, 136 
82 

10, 34, 83 

19, 165 


10000 1010001 
100010000 101001000 
100010001 101001001 
10001001 101001010 
1000101 101001011 
1000110 10100110 
1000111000 10100111 
1000111001 101010 
100011101 1010110 
10001111 1010111 
1001000 10110 
10010010 10111 
10010011 110 
1001010 1110000 
1001011 1110001 
100110 1110010 
100111 1110011 
1010000 11101 
1111 


One can also see, from Table II, that for any positive integer n, not in EF, U E,, 
there exists an integer e © EF, such that the first few digits in binary notation of n 
are precisely those of e: in other words, there exist integers k and r such that 


0<r< 2“ —1 where 


n=e:2k4r. 


(As an example, suppose that n = 13 = (1101), in binary notation. Now 6 = 
(110). € FE; and so 13 = 6.2 + 1. Try also 575 = (1000111111).; we have 143 = 
(10001111). € E; and so 575 = 143.2 + 3.) 
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We shall prove that for any positive integer n € EF, U E, there exists a value of 
m, divisible by 4, for which (”) © S, by induction on k. 

For k = 0, this is trivial as n = e € E,, and the value of m is given in Table IL. 
So suppose the result holds for k — 1: 

For n = e2* +r let s =r/2(r even), (r — 1)/2 (r odd) and p = e2*7! +5. As 
0<s < 2*~! —1, we have a value of t, divisible by 4, for which (?) © S, by the 


induction hypothesis. Therefore, (".) © S by Proposition 1. 

Remark. We see, from the proof, that if n = e2“ +r where e € E, and (*) 
cannot be written as the sum of three squares, then neither can ( akon ). 

We will prove a stronger result than Theorem 2 in Section 3: 

THEOREM 3. Fora = 0, 1, 2 and 3, 

#{n <x:n =a(mod4),n!]€ S} =x/32 + o(x). 

We see that Theorem 2 follows easily from Theorem 3 by the following result, 
which we’ll prove in Section 2: 

PROPOSITION 2. For any positive integers m and n, 


neS ifandonly mnesS. 


Proof of Theorem 2. 
#{n <x: (2"} = s| = #{n <x:(2n)!€ S$} _ by Proposition 2 


= #{m <2x:m=O0(mod2) and mie S$} 
=x/8+0(x) by Theorem 3. 


Theorem 2 is perhaps surprising, as the positive integers n <x, for which 
n © S, actually have density 1/6. By Lemma 1, 


#{n<x:nES}= Yo #{(n<x:n=27*(8m+7) forsome m> 0} 
k>0 


log x 


= bi {x/2°*** + 0(1)} 


k=0 
= x/6 + O(log x). 

However, the reason that we get 1/8 in Theorem 2 (instead of 1/6) can most 
easily be explained by realizing that the parity of the power of 2 dividing n! is 
equally distributed between even, and odd, whereas this is not the case for ordinary 
integers. 

2. Sequences of 0’s and 1’s (Proofs of Propositions 1 and 2). Any positive 
integer ” can be written in the form 2°b where a and 5b are nonnegative integers, 
and b is odd. Let v(n) be the residue class of a(mod 2) and f(1) be the residue 
class of b(mod 8). Lemma 1 may be expressed as 

nes iff v(n)=0 and f(n)=7. (1) 

We also note that 


v(mn) = v(m) +v(n) and f(mn) =f(m)f(n). (2) 
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for any integers m and n. We can immediately give the 
Proof of Proposition 2. By (2) we see that 
v(m?n) = 2v(m) + v(n) =v(n) and f(m?n) = f(m) f(n) = f(n). 
Thus, by (1), n € S iff v(n) = 0, and f(n) =7 
iff v(m?’n)=0, and, f(m?n) =7 
iff m’nesS. 


Define w(n) = v(n!) and g(n) = f(n!). Also, for any fixed sequence aB --: x 
of 0’s and 1’s, define o,,... (1) to be the number of occurrences of this sequence 
in the binary expansion of n. We prove 


LEMMA 2. For any given positive integer n, 
(a) w(n) =n — o,(n)(mod 2), and 
(b) g(n) = 370110) +1000) 7 1010) + 71100) (mod 8). 


Proof. Suppose that the binary expansion of n is L¢., a;2' where each a, = 0 
or 1. Then the power of 2 dividing n! is 


nh d d U 
Ya) D La2i= Daya 
jzl 2 j2lisj i=1 j=l 
d . 
= Yia2'-1)=n- a(n). 
i=1 
Now, as 
m 1 m =0,1,2,7 
llij=(3 if m=3,4 (mod 8), 
=] _ 
} ad 7 m=5,6 
we can see that 
n | [n/2*] 
g(n) = 5k = I] j(mod 8) 
k>0i=1 k>0 j=l 
2* Ili j odd 


_ ZAK > 0:7,=3 or 4} HK >0:7,=5 or %(mod 8) 


where 7, = 4a,,5 + 2a,,, + a,, which establishes the result. 
By noting that 3°7° = 7(mod'8) if and only if a is even and b is odd, we can 
deduce, from Lemma 2 and (1), the following result. 


CoROLLARY 1. For a given positive integer n, n! is an element of S if and only if 
G) of) = n(mod 2), Gi) 05,441) = Gig (MGmo0d 2) and (iii) oi9(n) = o44.(n) 
+ 1(mod 2). 


In practice, Corollary 1 provides an efficient algorithm for determining whether 
n! is an element of S: Simply write n in its binary notation, count the frequency of 
occurrences of the various digit patterns 1, 011, 100, 101 and 110, and then check 
whether (i), (ii), and (ii) are satisfied. 

We can also deduce from Lemma 2: 
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CoROLLARY 2. For any positive integer n, w(2n + 1) = w(Q2Qn) = w(n) + 
n(mod 2) and g(2n + 1)/g(n) and g(2n)/g(n) are congruent to 1 and 1,3 and 1,7 
and 3, and 1 and 7(mod 8) as n = 0, 1,2, 3(mod 4), respectively. 


Proof. w(2n + 1) — wn) = vQn + 1) = 0, by definition, and w(2n) — w(n) 
= (2n — 0,(2n)) — (n — o(n)) = nGmod 2) by Lemma 2(a). Also g(2n + 1)/g(2n) 
= 2n + 1(mod 8) and g(2n)/g(n) = 1, 1, 3 and 7(mod 8) as n = 0, 1, 2, 3(mod 4) 
by Lemma 2(b). 

From this, we can immediately deduce 


CoROLLARY 3. Suppose that m is divisible by 4 with 0 < m <n. Then 


(Con) =e((2%)) =n) 
(on) Alsi) lr) 


Proposition 1 follows immediately from (1) and Corollary 3. 


3. Proof of Theorem 3. Define g(0) = 1, w(O) = 0 and let 
T(x,a;2,w) = #{0 <n <x:n =a(mod 4), g(n) =g andw(n) = w} 


and 


for 
a=0,1,2,0r3, g=1,3,50r7 and w=0Oorl.... (3) 
Let p(x, a; g,w) = T(x, a; g, w)/x, which can be thought of as the “probability” 


that an integer n <x has the properties that n = a(mod 4), g(n) = g and w(n) = 
w. By (1), Theorem 3 is evidently implied by 


THEOREM 4. For each a, g and w in the range (3), we have 


p(x,a;g,w)~1/32 as x>~. 


We shall prove, in Section 4, 


PROPOSITION 3. Fix € > 0. If n is sufficiently large (> n, say), then 
1 


J 43n —l< 
32 


p( x,a3gZ,w) — é 


for any a, g and w in the range (3) and integer x > 1. 
From this, we can immediately give the 


Proof of Theorem 4. Fix ¢ > O'and choose n > n, ;. (where n, is as in Proposi- 
tion 3). For any z > 2*°"(1 + 2/e) choose x to be that integer for which 2%”"(x — 1) 
<z< 2*"x. Then 


J 43ny 


4 7 43n 
32 


T(2*"x,a;g,w) — 


Zz 
T(z,a;g,w) i 32 < 


EX 
< 24n( + i < €Z. 
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4. Stochastic matrices. Let p(x) be the 32 by 1 vector with entries p(x, a, g,w). 
Let Q be the 32 by 32 matrix indexed in both directions by (a, g,w); where the 
entry in the (a’, g’, w’)th row and (a, g,w)th column is 


0 0 1 0 
0 2 3 0 
1 0 1. 0 
1/2 if a= ; a= a, g'/g= | (mod 8) w' —-we (mod 2) 
2 3 7 1 
3 1 3 1 
3 3 1 1 
0 otherwise. 


Note that Q is a doubly stochastic matrix all of whose entries are non-negative. 
(A matrix M is said to be stochastic if the sum of the entries in each of its columns 
is 1, doubly stochastic if the same is true for each of its rows). In Section 5 we shall 
prove 


PRoposiITION 4. A = Q* is a doubly stochastic matrix all of whose entries are 
positive. 


Now Corollary 2 implies that p(2x) = Qp(x) for any positive integer x; by 
iterating this times we get 


Lemma 3. For any positive integers n and x, p(2"x) = Q"p(x). 


For a given matrix M and vector x we say that lim, _,,, "x exists and equals y 
if the ith component of M”x tends to the ith component of y as n tends to ©, for 
each 7. An important result on stochastic matrices has been given by Perron (see 
[1, p. 216] for a nice proof). 


LEMMA 4. Suppose that M is a stochastic matrix all of whose entries are positive. 
There exists a vector a (=a(M)) such that, for any stochastic vector v with 
components all nonnegative, we have lim, _,,. M"v =a. 


We can now give a proof of Proposition 3 by using the three results directly 
above: 


Proof of Proposition 3. M = Q*® is a doubly stochastic matrix, all of whose 
entries are positive, by Proposition 4. Let v be the 32 x 1 vector with each entry 
equal to 1/32. As each row sum of M is 1 we see that Mv = v, and so the value of 
a in Lemma 4 is given by 


a= LimM"”v = Limv =v. 


Therefore, 
Lim p(2""x) = Lim M"p(x) by Lemma 3, 
=y by Lemma 4, 


and the result follows immediately, by definitions. 
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5. Graphs of Stochastic matrices. Suppose that M is an n Xn matrix, all of 
whose entries are non-negative. The directed graph G(M) of such a matrix is 
obtained by taking n vertices v,,...,v, and putting a directed edge from 1, to v, if 
and only if the (7, j)th entry of M is non-zero. A path of length k from v to w isa 
sequence of k (not necessarily distinct) directed edges, starting at v and ending at 
W; €.8., 

V = ao to Qi; a, to a4, a4 to A35+++5Ap_] to a, = W. 
From this definition we have 


LEMMA 5. Suppose that M is ann X n matrix all of whose entries are non-negative 
and k is a positive integer. Every entry of M* is positive if and only if there is a directed 
path of length k, in both directions, between every pair of (not necessarily distinct ) 
vertices in G(M). 


Proof of Proposition 4. It is not difficult to show that the product of any two 
doubly stochastic matrices is itself doubly stochastic; therefore, as Q is doubly 
stochastic, so are O”,Q7,...,QO°% =A. 

In FicureE | we give a subgraph of G(Q). By visual inspection one can see that 
there is a directed path of length a,(< 17) from v to 071, and one of length 
b,(< 26) back from 071 to v, for each vertex v in G(Q). Therefore, for any pair of 
(not necessarily distinct) vertices v and w in G(Q) we let P,,, be the path given by 
joining the path of length a, from v to 071, to 43 — a, — b, circuits of the edge 
from 071 to itself, to the path of length b, from 071 to w. As P,,, has length 43 in 
each case, we know that every entry of A = Q® is positive, by Lemma 5. 


Fic. 1. The vertices are indexed by three numbers which, in order, correspond to a, g and w. 


6. Some final comments. A number of related questions can be asked: 

1. If we fix k what is the density of integers n for which (’) € S? It seems 
possible to derive a complicated general formula that gives a variety of different 
values as we vary over values of k. 

2. In the proof of Theorem 1 the values of m — © as n > ~., Is it true that 
there exists an integer my, such that for all n € E, we have (") © S for some 
m <m,? The answer to this is yes with m, = 74; however our proof is extremely 
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complicated and uses the idea of an infinite system of congruences that cover the 
integers (see [2] for a review). 

3. Let P(n) be the number of integers m in row n of Pascal’s triangle for which 
(") eS. Let ICN) = P(1) + P(Q2) + --: +PCN). Does Lim,_,,, H(N)/(N 7/2) 
exist? If so, what is it?, 1.e., is there a density of integers in Pascal’s triangle that 
are not representable as a sum of three squares? 
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LETTERS TO THE EDITOR 
To the Editor: 


In their Teaching of Mathematics note [2] M. E. Larsen and B. S. Jensen offer 
what they call a “simple, direct solution” for the initial-value problem of the 
system 
(1) 


X, = ax, + bx, 


X,=CX,+ dx, 


which avoids transformation to coordinates along the eigenvectors of the coeffi- 
cient matrix A and the use of complex eigenvalues. In my opinion, their solution is 
neither simple nor direct and, worst of all, from a pedagogical point of view, in 
spite of its elaborate methodology, the’ solution applies only to 2 X 2 systems, 
where the characteristic equation for A is simply A* = —det(A)J if tr(A) = 0. 

The Gaussian elimination method, which applies to the most general systems 
(see, e.g. [1, Chap. 6]) gives, even for the special system (1), a simpler and more 
direct solution, without the use of eigenvalues and other results from linear 
algebra. Write (1) as 


(D—a)x,—bx,=0, —-cx,+(D-—d)x,=0 
and eliminate x, by applying the operators (D — d), b, obtaining 
(2) |(D — @)’ — A]x, = 0, 


where we have used the notations of [2]: @ = (a + d)/2A = (a — d)/2)’ + be. 
Using the shifting rule, according to which p(D + h)x =e7"p(D)e™x, (2) is 
equivalent to 


(3) (D? — A)e~%x, = 0. 


From here on, or even immediately after (2), one could refer to the universally 
taught method for solving second-order linear equations with constant coefficients; 
or proceeding as in (2), let y = 6 be the solution of (D? — A)y = 0 for which 
y(0) = 0, y(0) = 1. Then y = 6 is the solution for which y(0) = 1, y(0) = 0, and 
the general solution of (3) is 


(4) x,(t) = e®'(c,8(t) + c,6(t)), 


hence x,(0) = c,, ¥,(0) = c, + Qc,. The constants c,,c, are determined from the 
given values x,(0) = x°, x,(0) = x9, thus by (1) %,(0) = ax? + bx$. Then (4) gives 


at+d 


a-—d , 
x(t) =e * {( 5 x? + bx’) + x98), 


which is the solution of [2]. By interchanging a and d, b and c, x° and x, one 
obtains x5. 


MICHAEL GOLOMB 
Purdue University 
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To the Editor: 


The result in the note, “Counting the Rationals” by Y. Sagher in the November 
MonrtTHLY appeared in your journal 29 years ago. See K. McCrimmon, Enumera- 
tion of the positive rationals, Amer. Math. Monthly, 67 (1960) 868. 


Yours sincerely, 


Harvey I. BLAu 

Professor of Mathematical Sciences 
Northern Illinois University 
DeKalb, Illinois 60115 


To the Editor: 


In my review of Complex Analysis by Bak and Newman and Invitation to 
Complex Analysis by Boas (March, 1990), I noted my suprise not to find the 
symmetric version of Rouché’s theorem (“if f and g are analytic on and inside the 
simple closed curve [ and |f + g| <|f| + |g| on I, then f and g have the same 
number of zeroes inside T’’’) in either book and added “true, no other text of my 
acquaintance includes it either; but somehow one comes to expect more from Boas 
and Newman.” In fact, this result does appear as Theorem 18.5 on page 156 of the 
text [E]. An undoubtedly more accessible reference for readers of the Monthly is [G]. 


Sincerely yours, 
LAWRENCE ZALCMAN 


Bar-Ilan University 


[E] T. Estermann, Complex Numbers and Functions, Athlone Press, London, 1952. 
[G] Irving Glicksberg, A remark on Rouché’s theorem, this Montuty, 83 (1976) 186-187. 
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A Geometric Regularity From 
Order-Restricted Statistical Inference* 


TIM ROBERTSON and CAROLYN PILLERS 
Department of Statistics and Actuarial Science, University of Iowa, Iowa City, lowa 52242 


In the course of our investigations in statistical inference, we discovered that the 
following identity is true for all integers k > 3: 


~1k-2 k-1 (hk —j 
(" 2 ‘ EY cos! y/ 4=P D _ eq», 
2 i=1 j=i+1 (k —i)j 

where cos” is the principal value of the inverse cosine function. This is an 
unusual geometric identity because it deals with averages of angles. We will first 
prove the identity, and then sketch its geometric interpretation and connection to 
statistical inference. 


1 


Proof. First note that (‘ 5 ‘| is one more than a multiple of three when &k is a 


multiple of three, and is a multiple of three otherwise. Let S be the set of (‘ 5 ‘| 


indices, (i, j), involved in the double sum appearing in the identity. A graph of S, 
which is a triangular-shaped lattice in R’, is given in Figure 1 for k = 9. Partition 
S into groups of three, with possibly one element remaining, and enumerate the 
points in the following manner: The corners of the triangle are the first three 
points. The next set of three points consists of the point just below the upper left 
corner together with the highest and lowest points of the vertical line just to the 
left of the upper right corner. Each succeeding set of three points consists of one 


Penvrno Wo fF AN HA a DO LO 


*This research was sponsored by the Office of Naval Research under ONR contract N00014-80- 
C0321. 
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point from each edge of the triangle as in Figure 1. When the outside triangle is 
exhausted by this process, the enumeration is continued on the next, inner, 
triangle. The proof consists of showing that the sum of the angles corresponding to 
each set of three points is 180°. If k is divisible by three so that there is a point left 
over at the end, then the corresponding angle is exactly 60°. 

A triple which corresponds to the corners of one of the triangles is of the form 
(m,k — m),(m,2m),(k — 2m,k —m) for some positive integer m. Thus, the 
corresponding angles are 


cos~! ~m?/[(k —m)(k —m)], cos~' y¥[m(k — 2m)]/[2m(k -— m)] , 


and 
cos7' /[(k — 2m)m]|/[2m(k — m)] . 


By combining the last two angles using the inverse version of the double angle 
formula, 2cos~! x = cos~1(2x* — 1), it follows that these three angles add up to 
180°. 

Any other triple must be of the form (m,k — m — n),(k — 2m —n,k — m), 
(k — 2m —n,k —m — n) for positive integers m and n. By combining two of the 
associated angles using the inverse version of the angle sum formula, cos”! x + 
cos! y = cos~'[xy — (1 — x?)'71 — y?)'/7], it follows that these three angles 
add up to 180°. 

If there is a point remaining, then k = 3a for some positive integer a and the 
remaining point is (a,2a). The associated angle has cosine equal to 


[a(k — 2a)]/[2a(k — a)] = Va2/(2a) and must be 60°. 
Thus, the double sum appearing in the identity is 180° x(*;1)/3 if (*5) is a 
multiple of 3, and is 180° x (* >} /3| + 60° if (*; *) is one more than a multiple 


of 3. Since there are ( k 5 ‘| terms in the double sum, the average measure must be 
exactly 60°. 

A referee suggested an alternative proof using the following lemma: Let a, b,c 
be nonnegative with a+b>0,b+c>0,a+c> 0. Let 6,,0,,0, be acute an- 
gles defined by cos* 6, = bc/[(a + ba +c)], cos* 0, =ac/[(a + bXb + c)], 
cos* 6, = ab/[(a + c)(b + c)]. Then 6, + 6, + 03; = 180°. The theorem then fol- 
lows by setting a=k—-2m—n,b=m+hn,c =m. 


Remarks. This identity states that the average of the angles between the 
generators of a particular polyhedral cone in Euclidean k-space is 60° (see Stoer 
and Witzgall [4] for a thorough treatment of polyhedral cones). We discovered this 
identity as part of our research in an area of statistics called order restricted 
inference. An account of the theory in this area can be found in [3] and more 
concise summaries in [1], [2], and [5]. 

Polyhedral cones play important roles in order restricted inference as restricted 
estimates are found by projecting the usual unrestricted estimates onto the cone 
and test statistics are proportional to the distances between the various estimates. 
One cone that is of particular interest is L, = {x; x, <x, < +: <-x,}. This cone 
is the direct sum of the pointed cone, L, = L, M {x; U*_,x, = 0} and the linear 
subspace, {x; x, =x, = ‘*: =x,} and it is the pointed cone L, that has the 
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regularity expressed in the identity. The generators of L, are given by 
F = (—k + 1,1,1,...,1) 
l,=(-k+2,-—kA+2,2,2,...,2) 


I,_, =(-1,-1,...,-1,k4 -1). 
For i<j, the cosine of the angle between 1; and 1, is equal to 
[i(k —j)]/[(k —i)j] . There are (‘ 5 ‘| distinct pairs of generators and the left 


side of the identity represents the average of the angles between the generators. 
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The Solution of Certain Integral Equations 
by Means of Operators of Arbitrary Order 


BERTRAM Ross and BALDEv K. SACHDEVA 
University of New Haven, West Haven, CT 06516 


An integral of the form 


1 x aoe | 
To fr f(t) dt,  Re(v)>0 (1) 


is known as the Riemann-Liouville integral. It defines integration and differentia- 
tion to an arbitrary order. The R-L integral can be denoted by .D,’f(x). The 
subscripts on the operator D denote the terminals of integration. The fractional 
calculus is not limit derived; hence, the words “terminals of integration” are more 
appropriate than “limits of integration”. When v =n an integer, the operators 
oD,” and > Df are, respectively, ordinary integration and differentiation. 

The idea of differentiation to an arbitrary order started in 1695 when L’H6pital 
asked Leibniz what would happen with d”y/dx” when n = 1/2. Subsequently, the 
topic started with the misnomer fractional calculus because v can be rational, 
irrational or complex. After a half century of controversy due to the lack of a 
precise definition of fractional derivatives, the matter was finally settled in 1886 by 
Laurent who developed (1) starting with Cauchy’s integral formula in the complex 
plane. 
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One of the applications of the fractional calculus is the simplification of the 
solution of certain integral equations as shown in [1]. Abel, in 1823, was the first to 
apply the fractional calculus in the solution of the integral equation 


ike —1)’"'f(1) dt =k, 


where k is a constant, which arises in the formulation of the tautochrone 
(isochrone) problem and other physical problems [2]. 
The solution of the integral equation 


fla) +af "K(x, t) f(t) dt = g(x) 


was originally attained by Volterra in 1896. When K(x, ft) is a difference kernel of 
the form K(x — t), the standard technique for solving the above when c = 0 is the 
use of Laplace transforms provided that the Laplace transform (one-sided) of the 
functions f(x), K(x), and g(x) exist. Volterra’s method which stemmed from 
the work of J. C. F. Sturm (1836) was attained as the limiting form of a set of 
algebraic equations in which differences of equally spaced points on (c, x) were 
used. 

This paper has a dual purpose. One purpose is to exemplify the power and 
elegance of the Riemann-Liouville operators in the solution of a Volterra type 
equation of the form 


1 x 
f(x) + Ti — 1)’ 'f(t) dt = g(x), (2) 


where g(x) is a known function and where f(0) and f’(0) are assumed to be finite. 
The exponent on the kernel can be arbitrary with the exception that vy —1#a 
negative integer. 

A second purpose of this note is to introduce a method of solving (2) which is 
conceptually different than that of Volterra, Fredholm, and Laplace transforms. 
This method consists of repeatedly applying R-L operators of appropriate order 
until (2) is transformed into an ordinary differential equation. This transformed 
equation is called the rational equivalent. The nonhomogeneous term g(x), as a 
result of repeated operations, gives rise to generalized integrals and derivatives. 
Thus, g(x) is in a wide class of functions such that D*%g(x) is calculable. A table 
of integrals and derivatives to an arbitrary order is given in [3]. As an example to 
illustrate the procedure, we will specify v = 2/3 and g(x) = x’. 

Eg. (2) is written in operator form: 


f(x) +oDeef(x) =x’. (3) 
Operate on (3) with D7™'”? and D!/°. Omitting subscripts on D for conve- 
nience, we get the two equations 


D7'7f(x) + D7'f(x) = D'x? (4) 
and 
D'°f(x) + DN? f(x) = D'3x?. (5) 
Operate on (4) with D*/? which yields 
Df(x) + D'/°f(x) = Dx?. (6) 
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In (6) substitute for D'/°f(x) which is obtained from (5) getting 
Df(x) — D7'f(x) + D!°x? = Dx?. (7) 
In (7) substitute for D~'/*f(x) from Eq. (4) and we will have 


Df(x) — (—D7'f(x) + D7'x?) + Dx? = Dx?, (8) 
or 
Df(x) + D7'f(x) = Dx? — D'°x? + D'NAx?. (9) 
Finally, operate on each term above with D: 
D*f(x) + f(x) = D?x* — D*x? + D*/?x? 
which we write as 


f"(x) + f(x) =2 — D*7x? + D?Px?. (10) 


In developing Eq. (10) which is the rational equivalent of (2), it is important to 
note the use of the index laws for integration and differentiation to an arbitrary 
order. In the Riemann-Liouville fractional calculus the index law for integration to 
an arbitrary order 


oDy?(oDy f(x)) =D, ?* F(x) 


generally holds true; however, the index law for differentiation to an arbitrary 
order 


oD? (oDif(*)) =oDP*if(x) 


holds only for certain p, g, and f(x), [2]. This is one of the subtle perils in dealing 
with fractional operators. It is worth examining the operation we did earlier with 
D*/. We assumed D*/7(.D7!/3x7) = Dx? = 2x. 

The above result can be readily verified by first computing 5D, '/°x*. From the 
list of formulas for integration and differentiation to an arbitrary order [3], we have 
the formula for integration to an arbitrary order 


D-’x?4 Ma + 1) atv R 0 
ove * “Tlatve+l) e(v) = 0, az>Q0. 
l'(3) 
D-'3y2 = 1/3 
ove * T1073) ~ 
l'(3) 


D712 = 7/3 


ove  * F073)” 

For differentiation to an arbitrary order we have the formula 
(a +1) 

[(a—-v+1)" 

| r(3) | r(3) (10/3) 


o Dix? an Re(v) = 0, a> 0. 


o”"\FG0/3), | Fd0/3) (7/3) — (4/3) 41)" (11) 
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We compute the second and third terms of the right side of (10) with the above 
formula (11). 


4/3,2 _ x23 = K,x27 12 
D2/3x2 = 4/3 — K y4/3. 13 

The differential equation (10) can now be written as 
y ty =2-—K,x772 4+ Kyx”. (14) 


The solution to (14) is the complementary solution 


C, sin x + Cc, COs x 


plus the particular solutions. The particular solution for 2 is clearly 2. To obtain y,, 
for —K,x’/° the method of variation of parameters is used. The Wronskian 


sin x COS Xx 


= —1. 
cosx —sinx 


W(sin x,cos x) = 


Identifying y, = sin x and y, = cos x, we have 


—y, f(x —(cos x)(—K,x?”° 
2 ( ) _ ( )( 1 ) _ —K,x’/> cos x, 


Wy =] 
and 
u, = -K, fx? cos x dx. 
us = Tt) = (noe) = K,x’” sin x, 
and 


Xx 
U5 = K,f x?/? sin x dx. 
0 


Thus, the particular solution for —K,x°/? is 


Xx Xx 
—K, sin.x [ x*/> cos xdx + K, cos x [ x°*/? sin x dx. 
0 0 


Similarly, we find the particular solution for K,x*/° to be 


Xx Xx 
K, sin x [ x*/? cos xdx — K, cos x | x4/> sin x dx. 
0 0 


The solution to (14) is thus 


Xx 
y =f(x) =c,sinx +c, cosx +2 +4 sin xf (K,x47 — K,x?/3)cos x dx 


+ COs xf (K, x? — K,x4/9)sin x dex. (15) 
0 
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We can determine the constants c, and c, from the original integral equation 
which we rewrite for convenience: 


f(x) + fe f(t) dt = 2? (16) 
P'(2/3) Jo ) 
We note in the above that f(0) = 0. Thus, we take c, = —2 to meet this condition. 
Thus, the solution is 
f(x) =c,sinx —2cosx+2+ sinx foo +cosx fo , (17) 
0 0 


To determine c, we take the derivative of the original integral equation (16) 
and determine f’(0): 


| d 1 4, 
FO) + ETay I-09 f(t) dt = 2x. (18) 


Leibniz’s rule for differentiating an integral with respect to a parameter is 


b(x) b(x) of(t, x) dt 
ain Ts a= fo Ox 


+ f[b(x), x]b\(x) — fla(x), xJa(x). 
The derivative of the integral in (18) is then 
1 px 
— a (x —1) °° f(x) dx + f[x,x](1) — 0. 
For f'(0) Eq. (18) is 


f'(0) = 0+ | lim fy) — 0. (19) 
tx (xXx —t)'” x=0 


Passing to the limit and then letting x = 0 yields the indeterminate form 0/0 
because f(0) = 0. The application of L’Hépital’s rule yields 


lim tim ~3(x — ako) 


x>OLt 
obtaining 
f'(0) =0- (0) =0. (20) 


To determine the constant c, in (17), we take the derivative of (17) and set 
x = 0 which gives 


f'(0) =, cos0. 


Thus c, = 0 and the solution is 
Xx 
f(x) =2 -—2cos x + sin x | (K,x4/3 — K,x?/3)cos x dx 
0 
x ° 
+ cos x | (K,x? — K,x*/3)sin x dx, (21) 
0 


where K, = 2/T(5/3) and K, = 2/T(7/3). 
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To get a good feel for the efficiency of the method of fractional operators, we 
can compare it to the method of Laplace transforms. Our original integral 
equation is 


1 x 
—-1/3 2 
+= | (x -1 t) dt =x. 
FX) + Tay LOO 
From the theory of convolution integrals we can write 


| —-1/3 __ 2 
L{f(x)} + Taya)" /*« f(x)} = L{x*}, 


~1/3y _2 
F(s) + Taya) o* OY L{f(x)} = 3: 
Then 
1 1(2/3) 2 
F(s) + (2/3) ; as FS) = 33° 
2 sr 2 


MS) = SFB P+ 


The inverse Laplace transform of F(s) is f(x). However, the inverse of the 
function on the right above requires a substantial effort to obtain. This effort 
supports the assertion that the method of fractional operators is indeed an 


efficient one. 
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Relations Between w(n) and O(n) 


ARMEL MERCIER 
Département de mathématiques, Université du Québec a Chicoutimi, 
Chicoutimi, QC, Canada, G7H 2B1 


For a positive integer n, let w(n) and O(n) denote, respectively, the number of 
distinct prime factors and the total number of prime factors of n. That is, 
w(1) = O(1) = 0 and if n = pf! --- p*& then w(n) =r and O(n) =e, + --: +8,. 
Relations between w(n) and Q(n) are not easy to obtain directly from their 
definitions. However, many relations can be deduced with the help of Euler 
products. For example, we will show that for every real t we have the identity 


potmn)—w(m) _ > pe" /O(1 — tO (1) 
d\(m, n) 


where (m,n) is the gcd of m and n. 
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To prove (1), let m = pf! --: pf, a; > 1, the p, being distinct prime numbers. 
Then for each positive integer n and each j (1 <j <r) there exists a unique 
integer 7; such that 

n=p'--: pM, (M,m) =1. 


If f is multiplicative and if /f(n) is absolutely convergent, then by Theorem 
11.6 of [1], we have 


~ f(mn) = Yo f(peth--+ patio) 
n=1 i,>0 
(m.M)=1 
= II (f(p) +f(p')+-°°) Ye fw) 
p ym M= 


1 
(M,m)=1 


fp) + f(pe eo) 2 
tim +f | 


where p*||m means p“|m and p**' +m, provided no factor 1 + f(p) + f(p”) 
+--+ vanishes. 


Consequently, when f(n) = t°°?n~* and &s > 1 (As is the real part of s), we 
can write 


1 1 
ore) poumn) ve) potk) 1 + s + Fs + 
—_— > rem [] Pp P 
~, Ww 7, k* im t t 
n= k=l P 1+ a + sy + 
P p~ 
_ poem) ee 
a ee a a 
1+— 
P 
k=1 k* d|m d* 


Now we put kd = n and equate coefficients of n~* to obtain 


pocmn) — por) y pon / AY _ ty 
d\(m,n) 
which is equivalent to (1). 


One can obtain other interesting nonobvious relations from (1). For example if 
we put m = n, we obtain 


1= ¥ 10/494 — 1), for all t, 
d|n 


and differentiating termwise we get another relation. 
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Using the Euler product we can get many other identities. For example, for any 
positive integer m and for |t| < 2°, Bs > 1, we easily obtain for all ¢, 


d|n d\(m,n) 
if we consider the expansion of the product 


(1 _ oom 


(ore) pormn) 


yy 
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Gear Trains and Continued Fractions 


JEROLD MATHEWS 
Department of Mathematics, Iowa State University, Ames, IA 50011 


Introduction. Gears have been used to change the rate or direction of rotary 
motion for-at least 5,000 years. In the 27th century B.c. the Chinese used a gear 
train to connect the steering mechanism of a chariot to the base of a mounted 
statuette with outstretched arm, so that the arm would remain south-pointing as 
the chariot moved. A common gear train is shown in the figure, where an input 
axle with angular speed of w revolutions per minute (rpm) drives a gear with n, 
teeth, which drives a second gear with n, teeth, ..., which, finally, drives an output 
axle at W rpm. 
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The input axle drives the first gear at n,w teeth/minute, which is transmitted to 
the second gear. Letting the angular speed of the intermediate axle be s, we have 
n,s =n,w. The third gear is driven at n,s teeth/minute and the fourth at 
n,s =n,W. From these relations we have n,jW =n 3s =n,(n,w/n,), or 

VW nyjn; 


= (1) 


Ww Aan, 


If the number vn of teeth on a gear must be between 20 and 120, then to find a 
gear train that transforms an input rpm w to an output rpm W as closely as 
possible we must 


Find 1,,n,,3,n, © {20,...,120} 


Wo Ayns] . _ (2) 
so that |— — | is a minimum. 
wo N, 
If nj,n,13,n4 € {20,...,120}, then x = (n,n3)/(n,n,) is a feasible solution. 


A solution of (2) is called optimal. Since there are at most 101* feasible solutions, 
(2) has an optimal solution. The bounds 20 and 120 give practicable limits on the 
number of teeth that can be cut on a gear blank. 

Piper [6] gave in 1953 an algorithm for solving (2). His method is based upon 
elementary properties of continued fractions. More recently Orthwein [4] pre- 
sented a somewhat different algorithm, also based on continued fractions. His 
algorithm does not appear to give an optimum ratio in all cases. We describe 
Piper’s algorithm here, using notation and results given in Hardy and Wright’s 
book An Introduction to the Theory of Numbers (1). 


Simple Continued Fractions. If we divide 826 by 773 we find that 
826 53 1 


m3) 73 1 TTB (3) 
53 
Continuing the division, 773/53 = 14 + 31/53 and hence 
826 1 
773 =1+ se (4) 
53 
31 


Repeating this process until a remainder of 0 is found, we obtain 


826 ; 1 (5) 
—_ = | + — ——_—_—__. 
773 1 


14 + 
ns 
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If at each step we compute the value of the fraction up through the most recently 
obtained quotient we find 


1,14+1/14,1+1/(14 + 1/1),14+1/(14+1/(1 +1/1)),... 


Or 


1 15 16 31 78 187 826 
Du is? a0 WS (6) 


These numbers are called convergents to the continued fraction (5). The quotients 
1,14,1,1,2,2,4 are called partial quotients. 

More generally, if x is any positive rational we may obtain a continued fraction 
equal to x through the division algorithm: 


1 
a, +¢§ 
a, + a, +> 
1 
+———7- 
ay-~+— 
NI 
The partial quotients are ay, a,,..., @,; the convergents are py/qy) = @o, P;/q, = 
(a,;a) + 1)/a,,..., Pn/Gn =X. The continued fraction (7) is often written as 
[Ao A,,45,..., ay]. 
THEOREM A. The convergents Po/qo, P1/1>-++>Pn/Gn are related to the partial 
quotients Ay, a,,...,@y through the relations 
Po = 45 Pi = 449 + 1, Pa = 8nPn—-1 + Pa-2> 2<n<n, (8) 
do = I, qi — 41, Gn = 4ndn-1 + q,-2> 2<n <N, (9) 
and 
[ 49, 41,47,...,4,] = Py/dy, O<n<N (10) 
THEOREM B. The convergents and partial quotients satisfy 
x = [9,4,,4,,..., Ay] = [49, 4,,49,---5 An 1,1 nr Anais---r@y]] (11) 
and 
n—-1 
Pn9n-1 — Pna-19n = (-1) for 1 SNS N, (12) 
and : 
[Gps On sio++ +s An] Dn_-1 + Pr 
yw fom Batty Ont Pn-1 * Pn-2 for 2<n<N. (13) 
[a,, Ant irers an \4n-1 + Gn-2 
Letting x, = p,/qG, denote the nth convergent, then 
Xp SXQ St SKS ttt SG Ky. (14) 


All of the inequalities in (14) are strict except the one between x and the last 
convergent, which is an equality. The convergents to x have no common factors. 
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THEOREM C. If N>1and1<n<WN —1, then 
5 1 1 
x= < <>, (15) 
Gn Gn In+1 Gn 


and, ifn >1,0<q<q,, and p/q # P,,/q,, then 


(16) 


n 


which shows that p,,/q,, is, among all fractions whose denominator does not exceed q,,, 
the fraction which provides the best approximation to x. 


PIPER’s ALGORITHM. Piper’s algorithm is based on (13). Using the example given 
in (3)-(6) we have 
826 [a4,45,a4g])p3 +p.  (22/9)31 + 16 


= St 8 17 
* 773 [a4.45,ag]q3+q, (22/9)29 + 15 (17) 


where [a4, 45, 4g] = [2,2,4] = 22/9. The value of the expression |a4, a5, a¢] is 2 + 
1/(2 + 1/4), one of the denominators of the continued fraction (5). Piper replaces 
[a4, 4s, Ag] = 22/9 by fractions c/d “close” to 22/9, calculates the value X of the 
new continued fraction through the relation 


_ (c/d) ps + Po 
(c/d)q3 + qo 
factors the numerator and denominator of X into their prime factors, and, if these 
factors can be arranged as in (2), calculates the error |X — x|. He considers only 


fractions c/d satisfying the rule cq, + dq, < 120+ 120 = 14,400. Piper states that this 
procedure will find the optimal solution of (2). 


We give a few preliminary results before describing Piper’s algorithm more 
carefully. Let x be a positive rational and x), x,,..., x, the convergents of x. Let 
I, be the open interval with endpoints x,_, and x,_,, 2 <n < N, and consider 
the function 

(1D, —1 + Pn-2 


f(t) = —————_ t> 0. (18) 


1G, —1 + Gn—2 


If we compute f/(t) and use (12) we obtain 
fit) — (Pn-19n-2 = Py—24n-1)/(Gn-1 + Qn—2)” — (-1)""-*/(1q,- + Qn-2) » 


from which it follows that f, is'either strictly increasing or decreasing on (0, ). 
Hence the range of f, is equal to J,. If an optimal solution x* lies in J, then 
there is a unique rational ¢* € (0,0) for which f,(t*) = x*. Writing t* = c* /d*, 
where c* and d* are relatively prime, we show that c*p,_,+d*p,_, and 
c*q,-, + a*q,,_> are relatively prime. Setting 


u* = C™Dn-1 + a*D,—2 ye = C"An-1 + a*q,_2 
and using (12) again we obtain 


zc = UNG,» — V"Pn—2 +d* = UG, 1 — V"Py-1- 
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From the last two equations it follows that if an integer d divides u* and v*, then 
d divides c* and d*. Hence u* and v* are relatively prime. 

The optimal x* = u* /v* can be written as (n,n3)/(n,n,). Since u* and v* are 
relatively prime and n, < 120, it follows that u*, v* < 1207 = 14,400. 

Let x* denote an optimal solution. We have shown that if x* € J,, where 
2<n<N, then there are unique, relatively prime integers c* and d* for which 
* * 

y* = —Pn=! 4 Pn 2 (19) 
C’On-1 + a" an -2 
where c*p,,_, + d*p,_> and c*q,_, + d*q,,_> are relatively prime and not greater 
than 1207. If neither J, contains a feasible solution nor x,_, is feasible, then we 
seek a feasible solution in [,,_,. Since it follows from (8) and (9) that 
f(t) = fr-l4n-1 + 1/t) (20) 
and, moreover, x,_, =f,,_,(a,_,), the search for c* /d* in the domain of f,_, can 
be restricted to the interval (0, a, _,). 

We can now describe Piper’s algorithm in detail. It is sufficient to assume that 
W/w <1. Uf W/w = 1, then (20 - 20)/(20- 20) is optimal. If W/w > 1, we 
interchange input and output.) 

Piper’s algorithm begins by computing the convergents of W/w. If the last 
convergent x = x, is feasible, then it is optimal. If x, is not feasible, consider the 
function f,, and the interval J,,. We search J, for feasible solutions by computing 
ratios fy(t) = f,y(c/d), where c and d are relatively prime integers, for which (21) 
holds. Since p, <q, for 0 <k <N, both constraints in (21) 


cp,-1 + dp, _» < 14,400 cq, _-1 + dd,_2 < 14,400 (21) 


are Satisfied if the second alone is satisfied. Let F,, be the set of feasible solutions 
in J, together with x,,_,, if the latter is feasible. Next consider the function fy_ 4. 
We see from (20) that the search of the domain of f,,_, may be restricted to the 
interval (0,a,,_,). Let E,_, be the set of feasible solutions in J,_, together with 
Xy_, if it is feasible. From (14) it follows that if there is a feasible solution x, in 
Fy at least as close to x as any solution in Ey_, — Fy, then x, is optimal. 
Otherwise, let Fy_, = Ey_, — Fy. 


Xn—-2 Xn-1 Ii 0 00 
<—_— tO 
\ 
r . 
Xn-2 Xn-1  Xy-3 fa-1 QO 4,_4 00 


This procedure is repeated until an optimal solution is found or, if F, = ©, an 
optimal solution is chosen by finding in (x,, %) the feasible ratio closest to x. 

Finally, we discuss three examples given by Piper [6], in which the number n of 
teeth on a gear satisfies 20 <n < 120, and one example from [4], in which 
18 <n < 200. The computations are based on a program using a computer algebra 
system. Piper used a table of ratios (contained in [5]) in his computation. 
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Example 1. For W/w = 8,639/10,000, the convergents are 0/1, 1/1, (3 - 2)/7, 
13/(5 - 3), 19/(11 - 2), (73 - 2)/(137), (179 - 5)/(37- 7+ 27), (11? - 24) /(83 - 33), 
and (163 - 53)/(5* - 24). In this example N = 8 and x, = (11? - 2*)/(83 - 33) = 
(44 - 44) /(83 - 27) is optimal. The error, |W/w — x,|, is 4.4 x 107°. The program 
found two other possible feasible ratios in (x,, x;). These correspond to c/d = 5/2 
and c/d = 2/3. The ratio for 2/3 is feasible but x, is closer to W/w. The high 
accuracy of these results is reasonable in view of Theorem C. 


Example 2. For W/w = 100,000 /348,592, the convergents are 0/1, 1/3, 2/7, 
(7 - 5)/(61 - 2), 107/373, (71 - 2)/(11 - 5 - 37), (509 - 3)/5,323, and (5° - 2) /21,787. 
Here N =7 and in (x,,x,;) the program found one possible feasible ratio, the 
optimal solution (63 - 31)/(92 - 74). The error is 7.4 x 107°. 


Example 3. For W/w = 7,348/10,000, the convergents are 0/1, 1/1, 2/3, 
3/(27), 11/(5 - 3), (5 - 5)/(7 - 2), (37 - 27) /(77), (19 - 7/181, (71 - 23)/773, and 
(167 - 11)/(5* - 27). Here N=9 and the number of feasible solutions to be 
factored and compared is, as Piper notes, larger than in Example 2. The program 
found 30 possible feasible solutions between x, and x7. The optimal solution is 
(79 - 41)/(76 - 58). The error is 3.6 x 107’. 


Example 4. For W/w = 100,000/517,220, the convergents are 0/1, 1/5, 5/ 
(13 - 2), (3 - 2)/31, 29/(5* - 3 - 2), 151/(71 + 11), 331/(107 - 2*), (241 - 2)/(277 - 
37), (251 - 37)/(127 - 23: 27), and (54 - 2°)/(2,351 - 11). Here N=9 and the 
optimal solution is (101 - 51)/(173 - 154) with error 1.6 x 107°. One other feasible 
solution was found in (x,, x7). This solution was not given in [4]. 


The author would like to thank the referee for his help. 
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Overdetermined Systems of Linear Equations 


GARETH WILLIAMS 
Department of Mathematics, Stetson University, DeLand, FL 32720 


It is well known that the coefficients in a least squares fit for a given set of data 
points are found by “solving” an overdetermined system of linear equations. The 
vector of coefficients can be found by applying the pseudoinverse of the matrix of 
coefficients. In this paper we explore an overdetermined system of linear equations 
to find an appropriate “best solution.” A geometrical interpretation of this 
solution is given. Much of the general theory of least squares fit is arrived at from 
a different point of view. 

Consider a system of linear equations AX = Y, where A is an m Xn matrix of 
rank n, with m > n. Such a system is said to be overdetermined. In general such a 
system will have no solution. Let us solve this system as “best we can.” A is not a 
square matrix. There is thus no possibility of multiplying both sides of the equation 
AX = Y by the inverse of A to get X. However we can modify the equation so 
that X can be isolated. Multiply both sides of the equation by A’ to get 


A'AX = A'Y. 
Note that A‘4 is a square n X n matrix. The rank of A’A is known to be the same 


as the rank of A, namely, n [1]. Thus (4‘A)7' exists. Multiply both sides of this 
equation by (A‘A)~' to isolate X. We get 


-1 
X=(A'A) AY. 
It is instructive to ask a class at this time whether we have arrived at a contradic- 
tion. We started with a system that might not have a solution and have apparently 
arrived at a solution! The discussion drives home the point that extraneous 


solutions can arise by multiplying both sides of a matrix equation by a matrix. 
Denote this specific vector X. 


¥ = |(4'4)"‘A'/y. 
The matrix (A‘A)~'A’ was arrived at by seeking to isolate X in the equation 
AX = Y. (A'A)'A! is called the pseudoinverse [2] of A, and is denoted A”. It is 
also known as the Moore-Penrose inverse [3] of A. X is called the least squares 
solution [2] of the system AX = Y. 

X is not a solution to the system AX = Y in the standard sense that it satisfies 
each equation of the system. Let us determine its significance. Consider the 
equation which defines X, namely, 

A'AX = A'Y. 
Use the properties of matrices to rewrite this equation as follows. 
A‘AX — A'Y = 0 
A'(AX — Y) =0. 
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This equation implies that the vector (AX — Y) is orthogonal to the column space 
of A, which is also the range of A. AX is thus the projection of Y onto the range 
of A and X is the value of X such that ||AX — Y]| is a minimum. The system 
AX = Y has no solution in general; that is Y is not in the range of A. X is the 
best we can get; AX is the vector in the range of A that is closest to Y. The vector 
(AX — Y), is called the error vector since it gives the difference between AX and 
Y, and its length, || 4¥ — Yl, is called the error. 

We now derive a further geometric interpretation of X. Let AX = Y. We can 


write 
— _ 2 
IY -— Y|| = VLC, — y;) . 


X is a solution to the system AX = Y such that ||Y — Y]|, that is L(y — y)’, is a 
minimum. . 

Consider the parallel subsets corresponding to the ith equations in the two 
systems AX = Y and AX = Y. The distance d, between two such subsets is [4] 


iyi 
rn a ae 
V 2,4; 


X is a point in this subset of AX = Y, so d, is the distance from X to the subset 
corresponding to the ith equation of AX = Y. 
We now show that X is the vector, or point, which minimizes /L; d? . Normal- 
2 


ize the system AX = Y by dividing both sides of the ith equation by 2 ,a;,. The 


J 


d 


modified system is equivalent to the original system. In the new system //X ja; =] 
for each i. Note that this can be accomplished by multiplying both sides of 
AX = Y by the diagonal matrix D, where 


1 


d ;; , 
il az 


jt 


The system is now in the equivalent form DAX = DY. As one would expect, and 
want, X is invariant under this transformation since 


((DA)'DA) '(DA)'DY = (A'DDA) | A'DDY 
= (D*4'A) 'D*A'Y 
= (A'‘A) '(D?) 'D?A'Y 
= (A'A) AY. 
In the normalized system the expression for d, is simpler: 


d; = \y; — y,| 


WY — Y|| = /Xd?. 


Since the least squares solution, X, is the vector or point which minimizes ||Y — Y|| 
we get the following result. 


and yields 
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The least squares solution to an overdetermined system of linear equations is the point 
such that the sum of the squares of the distances from the point to each of the subsets 
defined by the linear equations is a minimum. 


Thus the least squares solution to an overdetermined system is not a solution in 
the sense that it necessarily satisfies every equation of the system. It is a solution in 
the sense that it is the value of X that comes closest to satisfying all the equations 
in the above geometrical way. | 

We illustrate this result for an overdetermined system of three equations in two 
variables in the figure below. The least squares solution X to the system corre- 
sponding to lines 1, 2, and 3 is the point such that d? + dj + d? is a minimum. 


It is interesting to note the duality between the standard least squares curve 
discussion and this least squares point discussion. The points in the standard least 
Squares curve discussion correspond to the subsets here and the least squares 
curve corresponds to the least squares point of this discussion. 
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When Quasinormal Implies Normal 


DEAN HICKERSON, SHERMAN STEIN, and KENYA YAMAOKA 
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In [7] Ore introduced the concept of a quasinormal subgroup of a group, a 
generalization of a normal subgroup. We will develop a condition, expressed in 
terms of the index, that ensures that a quasinormal subgroup is normal. The 
arguments suggest a variety of exercises for a course in group theory or Galois 
theory. 


0. Definition of quasinormal subgroups. Let H be a subgroup of a group G. If 
gH = Hg for all g € G, then H is a normal subgroup of G. It follows that for all 
subgroups J of G, 

JH = HJ. (*) 


(For subsets A, B C G, AB = {ab:a € A,b © B}.) We will call a subgroup H of 
G that satisfies (*) for all subgroups J of G a quasinormal subgroup of G. If H is 
a subgroup of G, then the following conditions are equivalent. 


(0) A is quasinormal in G. 

(1) JH = HJ for every cyclic subgroup J of G. 

(2) For every g © G and h EH, there exist r€ Z and h' €H such that 
hg = g'h'. 

(3) For every subgroup J of G, HJ = (H,J), the subgroup spanned by H 
and J. 


If G is finite, then these are also equivalent to 
(4) For every subgroup J of G,[(H, J): H) =[J: HJ]. 


That (0) and (1) are equivalent follows from the fact that every subgroup of G is 
a union of cyclic groups. Condition (2) is a translation of (1) in terms of elements; 
(3) depends on the fact that HJ = JH if and only if HJ is a group. Condition (4) 
follows from the equation |HJ| = |A||J|//H OJ|, which is valid for finite sub- 
groups Hand J. 

Quasinormal subgroups have been the subject of several investigations (see [3, 7, 
9, 11], and the Deskins and Venzke article in [12]). 


1. When quasinormal implies normal. In [7] Ore proved that a quasinormal 
subgroup of a symmetric group is normal. In a similar spirit we show that, if H is a 
quasinormal subgroup of G and [G:H] is a squarefree integer or twice a square- 
free integer, then H is a normal,subgroup of G. Incidentally, the density of such 
integers is 7/m* = 0.71. 


LEMMA 1.1. A quasinormal subgroup H of a group G such that |G:H_] is prime is 
a normal subgroup of G. 


Proof. Assume that H is not normal in G and let H’ = a~'Ha be a conjugate 
subgroup distinct from H. Let K = HH’ =H'H. Since [G:H] is prime and 
HcCKCG, K=G. In particular, a~' =h’h for some h' € H',h € H. That is, 
a~' =a 'h*ah for some h*,h € H. From this it follows that a © H, contradicting 
the assumption that a~'Ha # H. Thus H is normal in G. O 
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For the next lemma, we use the fact that, if G is a group and N CH CG are 
subgroups with N normal in G, then AH is quasinormal in G if and only if H/N is 
quasinormal in G/N. The proof is left to the reader. 


LemMMA 1.2. A quasinormal subgroup H of a group G such that [G:H] = 4 is a 
normal subgroup of G. 


Proof. Assume that H is not normal in G and define H’, a, and K as in the 
proof of Lemma 1.1. Since H C K CG and [G:H] = 4, it follows that K is H or 
G or else [K:H] = 2. As in the proof of Lemma 1.1, the first two cases cannot 
occur. Thus [K:H] = 2, and A is normal in K. It follows that K is the normalizer 
of H, and consequently there are exactly two conjugates of H, namely H and 4H’. 

Let N =H 8H". Then N = (\{g7'He: g € G}, so N is a normal subgroup of 
G. Moreover, since H is quasinormal in G, [K:H]=[H':N]=[H:N] and 
[H:N] is, therefore, 2. (See Fic. 1.) 


N=HOH 


Fic. 1 


We have G > HON, [G:H] = 4, [H:N] = 2, and N normal in G. Thus the 
group G/N has order 8, H/N is quasinormal in G/N and has index 4 in G/N. 
Examination of the five groups of order 8 shows that a quasinormal subgroup of 
index 4 in a group of order 8 is normal. (Three of these groups are abelian. The 
other two can be checked directly or with the aid of [10], which displays the lattices 
of subgroups of small-order groups.) 

Thus H/N is normal in G/N so H is normal in G, contradicting the initial 
assumption. From this it follows that H is normal in G. 

Or, as the referee pointed out, we may argue as follows. Let G = G/N and 
H = H/N = (hy. For each element @ € G consider (@, h). If this group is not G, 
then it is a group of order 2 or 4, hence abelian, and @ and h commute. Thus H is 
normal in G, hence H is normal in G. Assume, on the other hand, that 
(a, hy = G. Now, the order of @ is 2, 4, or 8. If @ has order 8, G is abelian. If 2 
has order 2, condition (2) implies that ha = ah, so that G is abelian. If @ has order 
4, (@) is a normal subgroup of G. Thus h~'ah = @ or @°. The first case implies 


516 DEAN HICKERSON, SHERMAN STEIN, AND KENYA YAMAOKA _ [June-July 


that G is abelian. In the second case we have ah = ha’, hence haha = @* = 1. 
Condition (2) applied to A and ha yields hha = hah. Hence ha = ah = ha? and 
a = a>, which is a contradiction. oO 


THEOREM 1.3. If H is a quasinormal subgroup of a group G and [G:H]=n isa 
squarefree integer or twice a squarefree integer, then H is a normal subgroup of G. 


Proof. We will argue by induction on n of the given type. If n = 1, the result is 
obvious. For n = 4 it follows from Lemma 1.2. So suppose that n is some other 
integer of the stated form, and assume that the claim holds for all smaller integers 
of the stated form. Write nm = pm, where p is prime and does not divide m. 

Suppose that H is quasinormal in G and [G:H] =n. Consider any element 
g<G. Since H is quasinormal in G,H(g) is a subgroup of G and H is 
quasinormal in H(g). If Hig) #G, then by the induction hypothesis, H is 
normal in H¢g), so Hg = gH. | 

If Hig) =G, then [H(g):H]=n. This implies that n is the least positive 
integer k such that g* @ H. Let x =g’? and y =g”™. Then the least positive 
integer k such that x* EH is n/p=m, so [H(x):H] =m. Similarly 
[H<y):H] = p. Since H is quasinormal in both H{x) and Hy), the inductive 
hypothesis shows that Hx = xH and Hy = yH. The fact that (p,m) = 1 implies 
that g € (x, y); hence Hg = gH. 0 


2. When quasinormal does not imply normal. For any positive integer m that is 
divisible by 8 or by the square of an odd prime, we will exhibit a finite group G and 
a quasinormal subgroup H such that [G:H] = m and H is not normal in G. The 
next lemma uses a group described in [2, p. 134; 4, p. 187]. 

In its proof we will make use of two properties of commutators. Given a group 
G and a,b &G, let [a, b] denote a~'b~'ab, the commutator of a and b. Then we 
have ([5: p. 253; 1, p. 26; 8, p. 177): 

(i) If [a, b] commutes with a, then [a”, b] = [a, b]" for any n € Z. 

(i) If [a, b] commutes with a and with b, then for any integer n > 0, 

(ab)" = a"b"[b, a. 

LEMMA 2.1. For an odd prime p and n > 3 or for p = 2 and n > 4 let G be the 
group of order p" with generators a and b and relations a?" ' =1, b? = 1, and 
ba = a'*?"’b. Let H = (b). Then [G:H] = p"~'! and H is quasinormal in G but 
not normal in G. 


Proof. Every element in G has a unique representation in the form a‘b/ with 
0<i<p"',0 <j <p. Since a*'ba =a” “b €H,H is not normal in G. 
To show H is quasinormal in G, we first note the following. Since 


P n—l 
—_ + —_ 
= gPtP’ = @P, 


ba?b~' = (bab ')" = (a'*P"”’) 
we have a”? € Z(G) and a” © Z(G). Since 
[b,a] =b-'(a7'ba) = b= 'a”" *b =a", 


we have [b, a], [a, b] € Z(G) (since [a, b] = [b, a] ’). (See [1, pp. 105, 107, and 
Exercise 8.2, p. 115].) So, for any i,j € Z, [b/,a'] =[b, a]? € Z(G) by (i) and 
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similarly [a‘, b’] € Z(G). Also 
[b,a]” = (ary = QP" =], 


Let g © G and h EH. Then g=a'b/ and h = b* for some i, j, k > 0. Let 
r= 1+ p"~’k. By Condition (2) in Sec. 0, it suffices to show that 


hg = gth. 
By (ii), 
re © 
g’ =(a'b!) =a"b"[bi,a'|. 
Note that 
| er ae | | 
a =a'(a”"’) =a'[b, a] =a'[b*, a‘] 
and that 


bit = pite’ 7ik = pi. 


Also the restrictions on p and n imply that p divides (5); and 


[b/, ay) = [b, ais) =] 

since [b, a]? = 1. Thus, g” = a‘[b*, a‘]b/. Consequently, 
gth = "bk = a'[b*, a'|bi** = aib*[ b*, a‘ |b! 
= a'b*(b-*a~‘b*a')b/ = b*a'b/ =hg. O 


LEMMA 2.2. Let H be a quasinormal subgroup of a finite group G. If (n, |G|) = 1, 
then H is quasinormal in the group G X C(n) where C(n) denotes the cyclic group of 
order n. 


Proof. Let k € G X C(n) and h € H. We will show that hk = k"h' for some 
integer r’ and fh’ € H. 

We have k = ga* for some g € G and integer s, where (a) = C(n). Since H is 
quasinormal in G, hg = g’h' for some integer r and h’ € H. Because (n, |G|) = 1, 
there is an integer r’ such that r’=r (mod|G]|) and r’ = 1 (modn). Hence 
hk = hga’ = g’h'a’ = g"h'ad’ = g’a’n' = g’a’h’ = (gay h’ = kh’, and His 
quasinormal in G x C(n). O 


The two lemmas imply the following theorem. 


THEOREM 2.3. Let m be a positive integer that is neither a squarefree number nor 
twice a squarefree number. Then there is a finite group G and a quasinormal subgroup 
H such that [G:H] = m and H is not a normal subgroup of G. 


3. Quasinormal field extensions. The notion of a quasinormal field extension 
can be introduced in a Galois-theory course as follows. 

Let F be a field and let S be a finite-dimensional, normal extension of F. Calla 
field E where F CE CS a quasinormal extension of F if for all fields K with 
FCKCS, . 

[EK: K])=[E: EK]. (* *) 


(EK denotes the field spanned by E and K.) It can be shown that the definition is 
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independent of the normal extension S. By the “natural irrationalities theorem” 
(see [6, p. 196]), if E is a normal extension of F, it is a quasinormal extension of F. 

Let G = G(S, F), the Galois group of S over F. Let H = G(S, E), the Galois 
group of S over E and let J=G(S,K). Then G(S,EK)=HQOQJ and 
G(S, EK) = (H, J). From (* *) we then have |J|/|H OA J| = |( H, J)|/|A|, for 
all subgroups J in G. By Condition (4) in Sec. 0, H is a quasinormal subgroup of 
G. Conversely, if H is a quasinormal subgroup of G, its fixed field is a quasinor- 
mal subfield of S. 

In conclusion, we wish to thank the referee for calling our attention to the 
earlier work on quasinormal subgroups and for simplifying some of the arguments. 


REFERENCES 


M. Aschbacher, Finite Group Theory, Cambridge, N.Y., 1986. 

W. Burnside, Theory of Groups of Finite Order, 2nd Edition, Dover, N.Y., 1911 or 1955. 
W. E. Deskins, Quasinormal subgroups of finite groups, Math Z., 82 (1963) 125-132. 

M. Hall, The Theory of Groups, Macmillan, N.Y., 1959. 

B. Huppert, Endliche Gruppen I, Springer, N.Y., 1967. 

S. Lang, Algebra, Addison-Wesley, Reading, Mass., 1965. 

O. Ore, Structures and group theory I, Duke Math. J., 3 (1937) 149-174. 

E. Schenkman, Group Theory, D. Van Nostrand, N.Y., 1965. 

S. E. Stonehewer, Permutable subgroups of infinite groups, Math Z., 125 (1972) 1-16. 

A. D. Thomas and G. V. Wood, Group Tables, Shiva Publishing, Orpington, England, 1980. 
J. G. Thompson, An example of core-free quasinormal subgroups of p-groups, Math Z., 96 (1967) 
225-227. 

M. Weinstein, editor, Between Nilpotent and Solvable, Polygonal Pub., Passaic, N.J., 1982. 


a 
POW ANNMAFWNHE 


— 
NS 


A Geometric Approach to l’Hopital’s Rule 


GIANLUCA GORNI 
Dipartimento di Matematica e Informatica, Universita di Udine, 
via Zanon 6, I-33100 Udine UD, Italy 


1. Introduction. Calculus textbooks usually give a few versions of L’Hopital’s 
rule for the evaluation of limits. The simplest case calls for hardly any proof at all: 
if both f and g vanish at 0 and are differentiable, with 2'(0) # 0, then 


f(x) - FO) 
f(x) x — 0 f'(0) 
Oe 
g(x) g(x)—8(0) — g'(0) 
x—0 
The matter becomes complicated when we assume that the functions are differen- 
tiable in a neighborhood of the limit point x, but not necessarily in the point itself, 
as in the case when x, = ». The standard proofs then become technical and fail to 


provide the reader with any geometrical insight. The approach we are going to 
present tries to shed some light onto this side of the subject. 


as x— 0. 
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The idea of considering the function f° g~' isn’t new, but we are not aware of 
any previous use of its geometrical meaning. We will also provide a hint to a 
parallel treatment of a “rule” for sequences. 


2. A first survey. Unlike the easy instances of f + g and f - g, the knowledge of 
lim, _,,,f(«) and lim, _, , g(x) doesn’t always per se determine the existence (let 
alone the value) of lim, _, ,, f(x)/g(x). The two nasty cases are when the two limits 
are either both zero or both © as x > xp. 

Why does this happen? In a cartesian t-u plane let us pick the point (g(x), f(x) 
for a value of x (FiGure 1). When g(x) # 0, the ratio f(x)/g(x) is the slope of 
the straight line joining the origin with (g(x), f(x)). If we only know that f(x) and 
g(x) approach zero as x — xy, then the point (g(x), f(x)) gets as close as we want 
to the origin but total freedom is granted to the slope of the straight line. Given 
any fancy behavior of the ratio function x — f(x)/g(x), we can easily figure out 
geometrically how to define f and g, both infinitesimal or both infinite, with 
exactly that ratio f/g. 


f(x) (g(x), f(x)) 


g(x) 


Fic. 1 


So, to know what f/g does in the limit, we may need to collect more 
information than the mere value of the limits of f and g. 

What kind of restriction on f and g might ensure that the ratio f/g tends to a 
limit A € R? Let us for a moment think of x as time and consider the “trajectory” 
of the moving point (g(x), f(x)). In the case 0/0, as a picture suggests, “f/g > A” 
seems to mean, roughly, that the straight line u = At is tangent at the origin to the 
trajectory (FiGuRE 2). The interpretation in the case »/© is less obvious. 


(g(x), f(x)) 


Fic. 2 


520 GIANLUCA GORNI [June-July 


Our problem is now to make precise the idea of “‘tangency” for a trajectory. 
The task is easy if the trajectory is itself the graph of a function t — u, because 
then the familiar machinery of ordinary calculus may be used. This happens, for 
example, whenever g is one-to-one. (We are not going to use the necessary and 
sufficient condition g(x,) = g(x,) = f(x,) = f(x,).) The trajectory is then simply 
the graph of h = f° g~'. But one-to-one is not enough. If we allow x > g(x) to 
shuffle the order, then the origin (case 0/0) may turn out to be surrounded by a 
swarm of points (g(x), f(x)) originated from x’s well apart from x,. In the passage 
from (g, f) to h, the notion of “x — x,” would be forsaken. This is not the case if 
g is monotone (lim, _,,)g° ‘(t) = x, would already suffice). 

Let us sum up the ideas given so far: if g is one-to-one and monotone, then the 
limit of f/g (in the case 0/0) is akin to h’(0), where the function h is defined as 
h =fog'. In the case /o we are led to the equivalence of lim , -, », f(4)/g() 
with lim, _,,,.2(t)/t. | 

A nitpicking detail (case 0/0): From the very start we may have supposed that 
g(x) # 0 if x is close enough to xp, just for (g(x), f(x)) to make sense. Then, 
strictly speaking, h is not defined at t = 0. However, when x > x,, f(x) goes to 
zero, and so we can set A(0) = 0 = lim, , h(t) = lim, _,)f(g7"()) = lim, _, ,, f(x). 

Before stating L’H6pital’s rule, we ask: What conditions can ensure the exis- 
tence of h(t) at least for t # 0? Since h = fo g™', it is natural to assume f and 
g ' differentiable; (g~')’ exists whenever g is defined in an interval, monotone, 
differentiable and g’(x) # 0. If all the conditions are met, we can compute 


1 / 
h=(feg') =(feg'")- (gy) =(fes')- — eo 7 : og t, 
of corey — £ 
h § ~~ g' ? 1.€., h (2( )) ~~ g'(x) ° (1) 


The last formula is not at all surprising if we just think of what happens to the 
point (g(x), f(x)) when x is given the increment dx (FiGurE 3). 


g(x) dx 


f(x) dx 


Fic. 3 


3. L’HOpital’s rule. Let J be a nonempty open interval in R, let x, © RU 
{—, +o} be one of its endpoints, and let f,g: J - R be two differentiable 
functions, with g strictly monotone and g'(x) #0 for all x €J. Suppose that 
either of the next two conditions is satisfied: 

Gi) f(x) > 0, g(x) ~ 0 as x > x;; 

(ii) g(x) ~ +2 as x > x, (no assumptions on f). 
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Then the following implication holds: 


, 


lim —(x) exists and equals A € RU {+o} > lim en exists and equals J. 
XX Z x>XxX9 § 
(2) 


Proof. There is no restriction in assuming g(x) > 0, by changing the sign of 
both f and g if necessary. Since g is continuous, J := g(J) is an interval, too, and 
since g is monotone, one of its endpoints is 0 in case (i) and + © in case (ii). The 
function h := fog! is defined on J and differentiable, with derivative h' = 
(f’/e'de gt. 

Let us first consider the easier case (i). We can extend by continuity the 
definition of h to h(0) = 0. For any x € J we can apply Lagrange’s mean value 
theorem to / on the interval [0, g(x)] (Ficure 4), 


h(g(x)) — h(0) 


ayn HCE). with € € (0, 8), (3) 
that is to say, f f 
gh? = PACs (€)). (4) 
g g(x) 
Fic. 4 


Moreover, g  ‘(€) lies between x and x,, because g is monotone (decreasing, in 
our assumption, but this doesn’t matter any more). So, if we know that f’(x)/g’(x) 
tends to A as x ~ X,), we have that the common value of the two sides of (4) is as 
close as we wish to A whenever x (and hence g™ ‘(€) too) is close enough to Xp. 
Remark: A in formula (2) comes out to be h’(0). 

Case (ii) needs more tricks, because we lack a notion of “derivative at infinity” 
h'(+0) = lim, _,,,.A()/t. 

Claiming that f(x)/g(x) ~ A © R (the case A = + is easier) means that for 
any choice of A, < A < A, we can be sure that the point (g(x), f(x)) lies between 
the two half-lines u = A,t and u = A,t, (t > 0) whenever x is sufficiently close 
tO Xo. 

How can we get to such a conclusion if we only know that f’/g’ — A? Well, let 
us choose A, < pw, <A <p, <A, and M © J such that 


(x between M and x) > uw, < E(x) < p>. (5) 
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g(M)m 2 


Fic. 5 


If we apply the mean value theorem to h on [g(M), g(x)] we see that the graph of 
h lies in the shaded region in Ficure 5‘for t > g(M). Because of the relations 
between the various half-lines, we can easily check visually (and/or formally) that 
for t > t := max{n,, n>}, the graph of h (ie., the point (g(x), f(x) for x > M) falls 
between u = A,t and u = A,¢ and that is what we aimed at. 


4. Extensions and counterexamples. We can somewhat relax the assumption 
g(x) #0. Suppose that g is still strictly monotone, g’(x) = 0 only at points 
isolated from each other (they may cluster against x,), f’(x) vanishes at those 
points too, and, finally, the function x > f’(x)/g'(x) has finite limits at all these 
points. Then f’/g’ can be extended by continuity to a function m: J — R, and it 
turns out that / is still differentiable with h' = mo g~'. What happens, in terms of 
trajectories, is that h is differentiable on /, but as the “time” x elapses, the point 
(g(x), f(x) may slow down, stop, and start smoothly again at certain instants. 
L’H6pital’s rule remains true if we replace f’/g’ with m in formula (2). 

We can apply such a generalization if f’(x) and g’(x) vanish at certain points 
because of a common factor that cancels out in f’/g’, leaving a harmless function 
m, which is continuous at least at those points. But we must check that g’(x) 
doesn’t change sign (i.e., g is monotone). 

Our proof breaks down miserably if we eliminate the hypothesis that g is strictly 
monotone, because there is no longer a reasonable h. But we may be relieved to 
know that formula (2) is hopeless without monotonicity. Consider FiGURE 6. We 
have drawn a trajectory that zigzags between two different half-lines, with horizon- 
tal cusps at the intersections (where the moving point is supposed to stop and start 
again smoothly in the reverse direction). The vertical span of the oscillation is 


FIG. 6 
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supposed to shrink both when the point approaches (0, 0) and infinity, in such a 
way that the slope of the trajectory tends to zero in both directions (things can be 
arranged in this fashion without much trouble). What happens analytically is that 
g(x) changes sign infinitely many times, f’(x) = g(x) = 0 at the cusps, but the 
zeros cancel out in the ratio f'(x)/g'(x), leaving a function that vanishes at the 
cusps and tends to 0 at both extremes of J. But still f/g has no limit, because it 
eternally oscillates between the slopes of the two straight lines. For explicit 
analytical examples see [1]. 


5. A rule for sequences. There is a theorem about the ratio of two sequences, 
sometimes referred to as Cesaro’s rule, that closely resembles L’H6pital’s rule. Let 
a, and b, be two sequences of real numbers, 5, being strictly monotone, and 
suppose that either (i) they are both infinitesimal, or (ii) b, is infinite. Then 


. Gn+1— 4&y . 
lim ————— exists and equals A € R U {4+} 


n> Biv — Uy 


ay, . 
= lim — exists and equals A. (6) 


7c 
n n 


The proof can be constructed along the same lines as for L’H6pital’s rule. The 
mean value theorem will be replaced by an argument involving induction. We leave 
the details to the reader. See also FiGurE 7. 


(by 415 On41) 


(b,, a, 


FIG. 7 
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ELEMENTARY PROBLEMS 


FE 3391. Proposed by David Doster, Choate Rosemary Hall, Wallingford, CT. 


Define a function on the positive integers by putting f(1) = 1 and setting 


flint =f(n)+2 f f(f(n)—nt lan 


f(n +1) =f(n) +1 otherwise. 


(i) Find and prove a simple explicit formula for f() involving the greatest 
integer function. 
(ii) Show that if f(f(m) —n + 1) #n, then f(f(m)-n4+ 1) =n +1. 


and 


E 3392. Proposed by Antal Bege, Miercurea-Ciuc, Romania. 


Given an acute-angled triangle ABC with orthocenter H, let A,, B,,C, be the 
feet of the altitudes from A, B,C, respectively, and let A,, B,,C, be the feet of 
the perpendiculars from H onto B,C,,C,A,, A,B,, respectively. Prove that 


area (A ABC) = 16 area (A A,B,C), 


and determine when equality holds. 


E 3393. Proposed by Bruce C. Berndt, University of Illinois at Urbana-Champaign. 


Define a function ¢ on (—1/e, ©) as follows. If —1/e <x < », determine the 
unique number ¢ in (1/e,~) such that x =tlogt and then put (x) = 
1/1 + log t). 

Show that 60) = (—k)* for k = 1,2,3,.... 


528 


1990] PROBLEMS AND SOLUTIONS 529 


FE 3394. Proposed by Mo Song-Qing, Institute of Applied Physics and Computa- 

tional Mathematics, Beijing, People’s Republic of China. 

Suppose 0 <a, <a, < ::: <a,. For 2 <k <n — 2 consider the inequalities 
Pe ee py ee Fk (1) 
j=l k j= k+1 

and | 


n k n 
» (4,4), ua Bysn—1)/ 2 » (4;4,4, i aj.) **”, (2) 
j=l j=1 
where we adopt the convention that a,,, = a ; 
(i) Prove that (1) and (2) hold for k = 2. 
(ii)* Prove or disprove (1) and (2) for 3 < k <n — 2. 


E 3395. Proposed by Hillel Gauchman, Eastern Illinois University, Charleston. 
Let A be the first orthant in n-dimensional Euclidean space E”, i.e., 
A = {x © E": x = (X,,X),..-,%X,),X, > 0,...,x, > O}. 


Let S be a k-dimensional subspace through the origin in FE”, where 1 <k < 
n — 1, and let S~ be the orthogonal complement to S through the origin. Prove 
that either S or S~ contains a point of A other than the origin. 


E 3396. Proposed by Bruno Haible (student), Karlsruhe, West Germany. 
Suppose we define a sequence {x,},-9 by putting x, = 1, x, = 1/2, and 
(n? + 1)x, = (2n*-—n)x,_,-(n’?-—n)x,_, forn =2,3,.... 


Prove that the sequence {x,} is bounded and determine lim inf x, and lim sup x,,. 


SOLUTIONS OF ELEMENTARY PROBLEMS 
Companion Incircles of a Triangle 


E 3236 [1987, 877]. Proposed by Noam D., Elkies, Harvard University, Cambridge, 
MA. 


For a plane triangle A,A,A3, call two circles within the triangle companion 
incircles if they are the incircles of two triangles formed by dividing A, A, A, into 
two smaller triangles by passing a line through one of the vertices and some point 
on the opposite side. 

(a) Show that any chain of circles C,,C,,... such that C,,C,,, are companion 
incircles for every n consists of at most six distinct circles. 

(b) Give a ruler and compass construction for the unique chain which has only 
three distinct circles. 


(c) For such a chain of three circles, show that the three subdividing lines are 
concurrent. 
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Solution by Jiro Fukuta, Aichi University, Aichi, Japan, expanded by the editors. 
Let r be the radius of the incircle of triangle A, A, A3, 5 be the semi-perimeter of 
A,A,A;, and r, be the radius of the nth circle. We may choose indices so that 
C,,C, are companion incircles generated by a line through A,, with C, in the 
triangle containing A,A, and C, in the triangle containing A,A,. Then, succes- 
sively, the circle C, is the incircle of two triangles containing the vertex A, .oq3- 
Thus we define A, by A, = A,, whenever n = m (mod 3). Let a, be the angle at 
A,, let a, be the length of the side opposite A,, and let h, be the altitude from 
A, We have a, =a, a, =a,,, and h, =h,. The area of triangle A,A,A, can 
be given by rs or by any of 5a,h,. 

In Theorem 2 of [1], Demir shows that successive r,s satisfy 

1 1 r 2 
— + — = . 
lh Piel Tne Ay +2 


Multiplying this by r and subtracting it from 1, we obtain 


| r | r | 1 2r 
ln n+l Aner 


Multiplying the corresponding equations for n and n+ 2 and dividing by the 
equation for n + 1 yields 


EY) P-as)h-acl/b- a} 


Since rs = a,h,,/2, we have 1 — 2r/h, = (s — a,)/s, so this equation becomes 


[" _ i}{ r _ 7 _ (s — Anyi )(S 7 An+2) _ tan2 a,,/2, (1) 
lh Pa+3 


s(s —a,) 
where the last equality is well known to students of elementary trigonometry. 


Proof of (a). Replace n by n+3 in (1). Since a,,,=a,, we have 
(r/r, — D/r,43 —- D= (1/143 — DUY/r,4, — 1). Because r,,; <r, we may 
cancel the common factor and obtain r, = r,,,, which suffices. 


Proof of (b). The hypothesis is equivalent to r,,, =7,, so that (1) reduces to 
r/r, = 1+ tana,/2. We can construct the desired circles if we can construct their 
centers C,, C,, C,, which lie on the respective bisectors of the angles at A,, A, A3. 
These angle bisectors are of course concurrent at J, the incenter of triangle 
A, A, A3-. 

Using n = 1, for example, we proceed to construct C, (see Fic. 1). From J, 
construct the perpendiculars to the three sides, denoting the point of intersection 
opposite A; by 7;. To construct C,, we construct the bisector of right angle A,T,]/; 
its intersection with A,J is C,. 

To verify that this is correct, let D, be constructed on 7; A, so that T,D, = IT; = 
r, and thus /7T,D, is an isosceles right triangle. Then drop a perpendicular from 
C, to A,T, at P,. Using the similarity of A,/JT, and A,C,P3, we have A,I/A,C, = 
IT;/C,P, =r/r,. We want r/r, =1+4+ tana,/2; the latter is 1+ /7T,/A,T; = 
(A,T, + T,1)/A,T,; = A,D3/A,T3, and triangles A,C,T; and A,ID, are similar 
because C,T, and I/D, are parallel. 
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P; T, B, Q; D, 7 


Fic. 1. Construction of unique 3 circles by ruler and compass. 


Proof of (c). Let p, = A;,,P,, and let the pairwise common tangent of the circles 
around C, and C,,, (which passes through A,,,) meet the side A,A;,, at B;. Let 
Q, denote the intersection of the circle about C,;_, with A;_,A;,,, and let R; 
denote the intersection of this circle with A,B,. 

The semiperimeter of triangle A,A,B, is s — p,, because A,B, = A,R, + 
R,B, = A,P, + B,Q,, and thus the perimeter of A, A, A, equals that of A,A,B, 
plus two segments of length p,, namely A, P, and A,Q,. By taking the ratio of the 
areas of A,A,B, and A,B,A, (and similarly for the rotations of the indices), we 
obtain 


A,B; _ ri(s — py) A,B, _ r(S — pp) A3B, _ r3(s — ps) 
B3A, r(s — pz) B,A; r3(s — pi) - B,A, ri(s — po) 
Multiplying the three equations yields A,B,-A,B,:°A,B, = B,A,°B,A,°B, Aj, 
whence Ceva’s Theorem implies that A,B,, A,B,, and A,B, are concurrent—at 

a “notable” point for a triangle, namely, the center of the inner soddy circle. 


Editorial comment. Readers may note the related but significantly different 
result known as Malfatti’s problem, which is discussed in [2] and [3]; at least one 
reader confused the two problems. The common tangents constructed in (c) are 
not the radical axes, and their point of concurrence is not the radical center. 


1. Hiiseyin Demir, Incircles within, Math. Magazine 59 (1986) 77-83. 

2. A. S. Winsor, Modern Higher Plane Geometry, Boston, 1941, pp. 206-208. 

3. C. J. A. Evelyn, G. B. Money-Coutts and J. A. Tyrrell, The Seven Circles 
Theorem, Section 3.3, Stacey International, London, 1974. 


Solved also by H. Demir (Turkey) and the proposer. 
A Vector Identity 


E 3239 [1987, 996]. Proposed by M. S. Klamkin, University of Alberta, Edmonton. 


Show that if A is any three-dimensional vector and B, C are unit vectors, then 
[(A + B) x (A+ C)| x (B x C)- (B+ C) =0. 
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Interpret the result as a property of spherical triangles. 
Solution by Walther Janous, Ursulinengymnasium, Innsbruck, Austria. 
Starting from the formula 
(D x E) X F=(F- D)E - (F- E)D, 
we get 
[(A + B) x (A +C)] x [B x C] 
= |(B x C) - (A+ B)][A+ C] — |[(B x C)- (A+ C)|[A +B] 
= |(B x C) - A|[A + C] — [(B x C) - AJ[A + B] 
= |[(B x C) - A][C — B], 
whence the result follows, since 
(C—B):-(B+C)=C:-C-—-B:-B=1-1=0. 


Editorial comment. No two solvers gave the same “interpretation of the result as 
a property of spherical triangles.” The proposer showed that the result is equiva- 
lent to the assertion that a great circle which bisects two sides of a spherical 
triangle intersects the third side 90° from its midpoint. His argument follows. 

First, if we have two great circles on a unit sphere determined by the two pairs 
of points B’,C’ and B,C, then the two antipodal points of intersection of these 
circles lie on the line / of intersection of the planes of B’,C’,O and B,C, O, where 
O is the center of the sphere. Then a vector V on / is given by V = (B’ x C’) X 
(B x C). (Here B denotes the vector from O to the point B, etc.) 

Second, let the respective midpoints of the sides of the spherical triangle ABC 
be A’, B’,C’. Then 


A’ = (B+ C)/B+C), Bo=(A+C)/JA+C, C=(A+B)/JA+B 
and V - A’ = 0. The equivalence now follows. 


Solved also by A. Brown (Australia), J. Ferrer (Spain), R.-A. Galella, R. H. Garstang, G. N. Lewis, 
J.-M. Monier (France), R. A. Simon (Chile), J. H. Steelman, and the proposer. 


Turning a Simple Ring into a Division Ring 


E 3285 [1988, 763]. Proposed by W. K. Nicholson, University of Calgary. 


Let R be a simple ring with unity element, i.e., a ring with unity element having 
no two-sided ideals other than itself and {0}. Let G be a multiplicative subgroup of 
the set of invertible elements of R. 

(a) Under the assumption that either x € G or 1 —x © G for every x ER, 
show that R is a division ring. 

(b) If, in addition to the assumption of (a), we assume that —1 © G, show that 
G =R — {0}. 


Solution by Virginia Commonwealth University Algebra Students, Richmond, VA. 

Presumably it is intended that we may assume 1 + 0. 

An element & of R is said to be invertible if there exists an element h of R 
such that hk = 1. 
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Lemma. Under the assumptions of (a), if h and k are elements of R such that 
hk = 1, then kh = 1. 


Proof. Let e = kh. Then e” = (kh\(kh) = k(hk)h = kh = e, so that (1 — ee = 
0 and e(1 — e) = 0. By assumption either e € G or 1 —e € G. If 1 —e €G, the 
equation (1 — e)e = 0 and the invertibility of 1 — e give e = 0. But e = 0 implies 
1 = hk = hkhk = hek = 0, which is impossible. Thus we must have e € G. But 
then the equality e(1 — e) = 0 and the invertibility of e give e = 1. Thus the 
lemma is proved (so that one-sided invertibility implies two-sided invertibility). 

(a) Let F denote the set of noninvertible elements of R; we show that F is an 
ideal. Since it must be a proper ideal, the hypothesis of simplicity implies that 
F = {0} and all nonzero elements are invertible (which makes R a division ring). 

We first show that F is an additive group. Clearly F contains 0 and is disjoint 
from G. Since the negative of an invertible element is invertible, F is closed under 
negation. It remains to show additive closure. If a,b © F but a+b é¢€F, then 
there exists c such that cla +b)=1, or ca=1-—cb. By the lemma, c is 
invertible. This implies ca, cb © F, for if cx is invertible, then x is also invertible. 
Now cb € F implies 1 — cb € G, but then ca = 1 — cb contradicts FN G = ©. 

Finally, we show RF C F. We have already seen that an invertible element times 
a noninvertible element is not invertible, so assume a,b © F. By assumption, we 
have 1 —a © G and hence 1 — a is invertible, so b — ab = (1 — a)b € F. This 
expresses ab as the difference of two elements of F, which by the preceding 
paragraph implies ab < F. Similarly FR C F. 

(b) It suffices to show that G contains every invertible element. Let a be an 
invertible element not in G; we also have a~' ¢ G. The assumption of (a) then 
implies that 1 — a,1 —a~',(1 — a7~')"' &G. Since 
a=a(1—a')(1—-a—') =(a-1)(1-a™') =(-1)(1-a)(1-a"!) , 
the assumption —1 € G now implies a € G, which is a contradiction. Thus G 
does contain every invertible element. 


Editorial comment. Several readers submitted solutions to (a) using the Jacobson 
radical. They noted that the assumption implies that x or 1 — x is invertible, for all 
x € R. By Proposition 15.15 of Rings and Categories of Modules, by F. W. Anderson 
and K. R. Fuller, this implies that R/J(R) is a division ring, where J(R) denotes 
the Jacobson radical of R. Since R is simple, this implies that R itself is a division 
ring. J. L. Ansorena and S. Kao noted that it is sufficient to assume only that R is 
semi-simple, in the sense of Jacobson. J. McCarron submitted a proof using the 
Chevalley-Jacobson Density Thedrem. R. J. Chapman and J. H. Steelman reduced 
(a) to (b) by showing that if the’ assumption of (a) holds for any multiplicative 
subgroup, then it holds for any larger subgroup, such as the group of all units. 
J. D. O’Neill observed that if R is the ring of rational numbers and G is the set of 
positive rationals, then the hypothesis of (a) holds but the additional hypothesis of 
(b) does not; however, he showed that if char (R) > 0, then the hypothesis of (a) 
implies the additional hypothesis of (b). 

Solved also by J. L. Ansorena (student, Spain), S. Beslin, C. Brueningsen & M. Leeney, D. Callan, 
R. Carroll (student), R. J. Chapman (United Kingdom), F. J. Flanigan, A. A. Jagers (The Netherlands), 


S. Kao, H. K. Krishnapriyan, C. Lanski, O. P. Lossers (The Netherlands), R. Martin (West Germany), J. 
McCarron (Canada), P. Morandi, J. D. O’Neill, J. H. Steelman, G. P. Wene, and the proposer. 
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A Quadratic Diophantine Equation in Six Unknowns 


E 3304 [1989, 55]. Proposed by Dante Vialetto, Olgiate Olona, Italy. 


Prove that there are infinitely many solutions of 


ax? + by* = cz? 


in integers a, b,c, x, y, Zz with 
a<x<b<c<x<y<z, 


and obtain a family of solutions with as many independent parameters as possible. 


Solution by C. C. Oursler, Southern Illinois University at Edwardsville. With 
integer parameters r,s,t,m,n,l,w,v, we obtain a solution a =/1*w, b = m’w, 
c =n’w, x =mnn, y = Insv, z = lmty, by imposing the following constraints: 

(a) r<s<t is a primitive Pythagorean triple, (b) m > max{s/(t — s), 
s/(s — r)}, (c) n and / are chosen so that rm/s <1 <m<n <tm/s, (d) w > 0, 
and (e) v > nw/(mr). 

From r2 + s2 = ¢? follows /2(mnr)* + m?(Ins)? = n?(Imt)*, which has the re- 
quired form with a = 1°, b =m’, c =n’, x = mnr, y = Ins, z = Imt. To satisfy the 
required inequalities, we need /<m <n and also mnr < Ins < Imt. The last 
inequality is equivalent to n < mt/s, and we require m <n. Thus, once the 
integer m is chosen, a suitable integer n can be found if the length of the interval 
[m, mt/s] exceeds 1, that is, if m >s/(t —s). Similarly, after the integer m is 
chosen, a suitable integer / can be found with mr/s <1<m,if m>s/(s —r). 

We can incorporate a factor of w into each of a, b,c and a factor of v into each 
of x, y, z. The requirement c < x is satisfied if v > nw/(mr). 

The above family of solutions has five positive integral parameters, which 
appear to be independent. Thus, using the familiar parameterization of 
Pythagorean Triples, we can express r,s,t in terms of two positive integral 
parameters p and gq, where p is odd and (q, p) = 1, by putting r = min{(p + q)” 
— 97,4 p+q)q}, s = max{(p + gq) —q?,Apt+aqq}, t=(p+q)+q?. The 
only restriction on the parameter m is a lower bound and thus m can be expressed 
in terms of a positive integral parameter. The parameter w is an arbitrary positive 
integer, while the only restriction on the parameter v is a lower bound and so v 
can be expressed in terms of a positive integral parameter. 


Editorial comment. A referee provided the one-parameter family of solutions 
a=u2, b=(u+1)*, c=(ut 2), x = 3(u + Iu + 2), y = 4u(u + 2), z= 
5u(u + 1), where u is an integer greater than 3. This is easily seen to be a special 
case of the above solution. 


Solved also by Serge Gaignoux. 
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ADVANCED PROBLEMS 


6634. Proposed by R. W. Zeamer, University College, London, England. 


Let S be the set of real numbers of the form 
> Ee, /n}, 
n=2 
where ¢, = 0 or 1. Let K be the field generated by S. Is it true that K = R? 


6635. Proposed by J. Michael Steele, Princeton University, Princeton, NJ. 


Suppose 6,,0,,03,...are independent random variables each uniformly dis- 
tributed on [0,27]. Let 0,.,,6,.5,..-., 9,., denote the order statistics for {0,:1 < 
i <n}, ie. 0,.1,0,..,---,9,., are the numbers 0,,6,,..., 6, arranged so that 
6.1 <6,..5< *+' <6,.,. Adopting the convention that 6,.,,, = 6,.,, we put 

n 


Ss, = > {cos | + COS 6,,.;} {sin On i+] — sin 0, i} 


x il 


» sin( 6, . 544 7 On ids 
i=] 


so that S, is twice the area of the polygon with vertices 


(cos @,.;,8in@,.;), 1 <i <n. 


Show that 
(0 <S,<S,<S,<S, << 27, and 
Gi) S$, > 27 as n — ©, with probability one. 


6636. Proposed by F. Schmidt, Bryn Mawr College, Bryn Mawr, PA. 


If G is a finite group, let f(G) be the average order of an element of G, i.e., 
f(G) = » ord(g)/ord(G), 
geG 


where ord(g) is the order of the group element g and ord(G) is the number of 
elements in G. 

(i) Show that if G is a nilpotent group of order n, then f(G) < f(Z,,), where Z,, 
is the cyclic group of order n. 

(ii)* Is it true that if G is any group of order n, then f(G) < f(Z,,)? 


SOLUTIONS OF ADVANCED PROBLEMS 
Points on the Euler Line 


6557 [1987, 884]. Proposed by Clark Kimberling, University of Evansville. 


Let I’ denote the circumcircle of a triangle ABC. Let A’ be the point, other 
than A, where the A-median of ABC meets I. Let A” be the point, other than A, 
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where the A-altitude of ABC meets I. Similarly define B’,C’, and B",C". Let 
DEF be the tangential triangle of ABC (e.g., D is the point where the line tangent 
to T at B meets the line tangent to I at C). Define (as usual) the Euler line of 
ABC to be the line containing the orthocenter, the circumcenter, and the centroid 
of ABC. Prove that the lines DA’, EB’, FC’ and the lines DA", EB", FC" concur in 
points that lie on the Euler line of ABC. 


Solution by O. P. Lossers, Eindhoven University of Technology, The Netherlands. 
Let O be the center and R the radius of I. Let M denote the circumcenter of 
A DEF and QQ the circumcircle of this triangle. We now introduce some further 
notation, and some facts of advanced Euclidean geometry. 


1. If U, V and W are in this order the midpoints of BC, CA and AB we have 
OU X OD = OV X OE = OW x OF = R?. 


It follows that the inversion (O, R) transforms 2 into the nine-point-circle 
v of A ABC. The center of v is therefore on OM. Hence the Euler line of 
A ABC coincides with OM. 

2. Let G and N be the points of Gergonne and Nagel, respectively, of a 
triangle 7. A well-known theorem states that the isogonal conjugates of G 
and N with respect to 7 are the internal and the external center of 
similitude of the incircle and the circumcircle of 7. 


We are now ready to prove the result. Since the symmedians of a triangle pass 
through the poles of the sides with respect to the circumcircle, the lines AD, BE 
and CF have the point L of Lemoine in A ABC in common. With respect to 
A ABC these lines are isogonal conjugates of AA’, BB’, and CC’ respectively. 
Hence they are isogonal conjugates of DA’, EB' and FC’ in a DEF. It follows that 
DA', EB' and FC" are concurrent in a point S, the isogonal conjugate of L with 
respect to A DEF. 

Since L is the Gergonne-point of A DEF, the point S is (by 2) the internal 
center of similitude of [ and 2 and as such on OM, that is (by 1) on the Euler-line 
of A ABC. 

Let A,, B, and C, be the diametral points on [ of A, B and C respectively. 
Let X be the point of contact of EF with the excircle [, of A DEF having its 
center on DO. Similarly define Y and Z. Now AA” and AA, are isogonal 
conjugates in A ABC. It follows that DA” and DA, are isogonal conjugates in 
A DEF. Also, D is the external center of similitude of [ and I’,. Thus D, A, and 
X are collinear. Hence DA” and DX are isogonal conjugates with respect to 
A DEF. This is also true of EB" and EY as well as FC” and FZ. The lines DX, 
EY and FZ are concurrent, and’their common point N is Nagel’s point in A DEF. 
We conclude (by 2) that DA”, EB" and FC” meet at the external center of 
similitude 7 of I and © and thus on the Euler line of A ABC. 


Editorial Comment. L. Kuipers (Switzerland) and the proposer both provided 
somewhat less erudite solutions based on the mechanism of “homogeneous trilin- 
ear coordinates.” For example, the centroid has coordinates (a~', b~', c~') where 
a, b, and c designate the sides of ABC. It turns out that DA’, EB’, and FC’ concur 
at 


(a{—a* + b* + c*}, b{a* — b* +. c4}, c{a* + b4 — c*}) 
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while DA”, EB", FC” concur at 
(a{-—a'+b' +c}, b{a’ — b'+c},cla' + b' —c'}) 
cos2A cos2B cos2C 


cos A’ cosB’ cosC 


rw 


where a’ = sin’ 2A, etc. 


Solved also by M. Orlowski and M. Pachter (South Africa), Martine Pages (France), Admiral Jose 
Yusty Pita (Spain), Raul A. Simon (Chile), an anonymous contributor, and the proposer. 


The Rediscovery of Susan N. Brown’s Integral 


6572* [1988, 461]. Proposed by David L. Book, Naval Research Laboratory, 
Washington, DC. | 


Show that 


oO 


—q(t) dt 
i {l—e } aya = 7 


where 


] AC] ds 
t)=— + st)——.. 
q(t) = =f In(1 + st) 


This result appeared in an article in Zhurnal Eksperimental’noi i Teoretecheskoi 
Fiziki, 90 (1986) 330 by S. E. Esipov and I. B. Levinson, who stated that they had 
obtained it numerically and verified it to fourteen significant figures. (At the 
request of the editors, the Symbolic Computation Group of the University of 
Waterloo has verified the result to over forty significant figures.) 


Solution I by Joseph Oesterlé, Université de Paris VI, France. The function g has a 
holomorphic continuation to the domain U = C — R ; it is defined by the integral 
formula 

inc ia U (1) 
f)=—- n( li + st )——> a 
q(t) = =f In(1 + st)--—5 (4 U) 


where In(re’’) = Inr + i0 if —7 <6 <7. By differentiation under the integral 
sign one shows that the derivative of q is given by 


] t Int y 5 
(t) =— {|= - — t . 
W)=Fral5-—) ew (2) 
The function q satisfies the functional equation 
1 1 
a(t) =a{ 7] +5 Ine (teU). (3) 


This follows from the fact that both sides of (3) have the same derivative on U and 
the same value at t = 1. 

It is clear from (1) that lim, _, 9 ,. q(t) = 0. Combining this result with (2), one 
deduces that 


lg(t)| < —|t|In |t| for t © U, |t| small enough. (4) 
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From (3) and (4) it follows that 
et ~ Vt (tf EU,|t| > &) (5) 


where Vre® = yre®” if -7 <6 <7. 
The function gq satisfies a second functional equation: 


q(t) + q(—-t) =InQi - it) Cmr> 0), (6) 


valid only for t with positive imaginary part. This follows from the fact that both 
sides of (6) have the same derivative on the upper half plane, and the same limit at 
t= 0. 

To solve the problem, we shall integrate ¢-3/*{1 — e%} dt on the contour 
represented in the figure (which depends on parameters 7, <¢, R), then take the 


imaginary part of the result and successively let 7 — 0, e ~ 0, R— ©. Let us 
denote by A,, for 1 <i < 4, the limit of the contribution made by the part of the 
contour denoted by (i) in the figure. These values are given by: 

A, = 0, by formula (4); 

A, = 0, because we have taken the imaginary part of the integral; 

A, = —17, by formula (5); 


1 — e9@+i0") © [1—(1+it)e™@ 
" (¢+i0*)” r= | 'm —it?/? 


00 1 — eat) 


= | ya 


by formula (6). Cauchy’s theorem implies A, + A, + A; + A, = 0, which yields 
the result. 
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Solution II by J. Boersma, Eindhoven University of Technology, The Netherlands. 
From the derivative 


00 sds log t t 
q(t) = = [ ooo st 
(1 + st)(1 4+ s*) w(1+t-) 2(1 + t°) 
we deduce for g(t) the alternative representation 
; log s 1 
t)=-— q los + t* 
q(t) = — fs a og(1 + #7). 
By introducing the notation 
1 ~ logs 
t)=—-— d 
m(t) a, 1+s° ” 


the integral under consideration can be rewritten as 
00 em) dt 
r= [\1- ———, |-S 
1/4 3/2° 
0 (1422) | 07 


The evaluation of the latter integral is contained in the solution of the integral 
equation 


f(x) = Qr)7! f (losix —i)f(t)dt +x, (0<x<«) 


as presented by Brown [2] and Boersma [3]. Indeed, by combining formulas (36a) 
and (3) of [3] and by using the integral equation with x — 0, one has (in Boersma’s 
notation) 


I= —(27)'7iF(0) = —(27) 71” f (og) f(0) dt 
= —(2ar)'/ lim | f(+) — 7/2], 


The value of the latter limit is found from formulas (7.1) and (7.5a) of [2] to be 
—(a/2)'/*. Thus I = 7. 


Editorial comments. Brown’s original evaluation uses the Weiner-Hopf tech- 
nique, and was done in response to a challenge by Prof. K. Stewartson. The 
problem arose from the study of a particular case of the Navier-Stokes equation 
(see [1]). 

The solutions provided by Paul R. Chernoff, J. Koekoek (The Netherlands), 
William A. Newcomb, and Tomas Schonbek were similar in nature to Oesterlé’s 
H. K. Kuiken (The Netherlands) gave a solution quite similar to Boersma’s. Kim 
MclInturff observed that the integral is lim,,_,,, Sy, where 


and 
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MclInturff showed that J, = V2 7 = 4.4429 and (by a much lengthier calculation) 
that 
2 


1, = 4V2G + y27in2 - ~ 4.7716 
where G is Catalan’s constant. The average of S, and S, is 3.24997..., which is 
slightly closer to 7 than is 3. This does not ‘solve the problem, but leads us to 
wonder whether S,, converges to 7 as rapidly as the Nth partial sum of the 
Leibniz series, and whether it is amenable to various convergence acceleration 
techniques. 


REFERENCES 


1. A.I. Van de Vooren and A. E. P. Veldman, Incompressible viscous flow near the leading edge of a 
flat plate admitting slip, J. Engrg. Math. 9 (1975) 235-249. 

2. S. N. Brown, On an integral equation of viscous flow theory, J. Engrg. Math., 11 (1977) 219-226. 

3. J. Boersma, Note on an integral equation of viscous flow theory, J. Engrg. Math., 12 (1978) 
237-243. 


Functions Leaving a Distribution Unchanged 


6573 [1988, 461]. Proposed by Gérard Letac, Université Paul Sabatier, Toulouse, 
France. 


Suppose X is a real random variable which 1s normally distributed with mean 0 
and variance 1. Suppose f and g are functions from R to R such that, for each b in 
R, the random variable f_X + b) — g(b) is also normally distributed with mean 0 
and variance 1. Prove that there exists « in {—1,1} and c in R such that 
f(x) = ex + c almost everywhere. 


Editorial remark. The published version of this problem erroneously had FCX + 
b) in place of f(X + b). 


Solution by Stephen Portnoy, University of Illinois, Urbana. It is given that 
X ~ Normal(0, 1), 


where “~ ” means “is distributed as.” If there exist real functions f and g such 
that 


f(X + b) ~X + g(b) 
for all real b, we can conclude that f(x) = «x + c. In fact, this conclusion holds 
whenever X has a distribution with a density f, which is sufficiently smooth. 


THEOREM. Suppose X has a distribution with a finite variance and a density fy 
which is strictly positive, continuous, and bounded. Suppose that there are real 
functions f and g such that for all b, 


f(X +b) ~X+4+g(b). (1) 
Then f(x) = + x + c almost everywhere (for some c). 


Remarks. (1) If the density function fy is not smooth, the conclusion need not 
hold. For example, let X ~ Uniform(0, 1) and let f be periodic with period 1 and 
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with f(x) =x for x € [0,1). Then f(X¥ + b) ~ X for any b. In fact, this will also 
hold if f is periodic with period + and with f(x) = 2x for x € [0, 5). 

(2) The proof below is elementary for the most part and based on a straight-for- 
ward “change of variable” calculation. There are a few measure theoretic techni- 
calities, but the basic idea given in parts (i) and (ii) below is quite simple. 


Proof. (i) First assume that f is continuously differentiable. The first step is to 
compute the density of f(X + 5) and show that f is in fact monotonic. To 
compute the cumulative distribution function for f(X + 5), let U = {x: f’(x) # O}. 
Since f(X) has a density (from (1)), US has measure zero. Also, U is a countable 
union of intervals which may be taken to be maximal: if (a,b) C U define 
a* = inf{a’ < a: (a’, b) C U} and similarly for b*. Then f'(a*) = f’(b*) = 0 and f 
is monotonic on (a*, b*). Given real y, let x; be such that either f(x;) = y or x; is 
one of the endpoints of a maximal interval. Let f;' be a local inverse for f on an 
interval containing y. Now compute the cumulative distribution function for 
f(X + b) (that is, P{f(X + b) < y}) by considering the contribution from each 
maximal interval (a*,b*) CU. If x, =f; Cy) € (a*, b*), the contribution is 
Fy (x; — b)— Fy(a* — b) if f(x + 5) is increasing on (a*, b*), and is Fy(b* — b) 
— Fy(x; — b) if f(x + b) is decreasing on (a*, b*). Here Fy is the distribution 
function of X. Otherwise, the contribution is zero if f(x + b) > y on (a*, b*), and 
is F,(b* — b) — Fy(a* — b) if f(x + b) <y on (a*, b*). Therefore, by (1) 


LeFy (x; — 5) = Fy(y — g(b)) (2) 


where x, =f, ‘(y), and «, is zero, one, or minus one as described above. Upon 
differentiating (2), we find that the density satisfies 


d fx(x; — 6) 
~4 — b)|—f7! = ) ——— = —g(b)). 
Lill (y) 6) al | » F(x ely 8(4)) 
Now if f were not monotonic, there would be y* which would be a local maximum 
for f; that is, for some i, f’(x*) = 0. Letting y > y* (and x; > x**), the left-hand 
side above would tend to + while the right-hand side remains bounded. This 
contradiction shows that f must be monotonic. 
(ii) It will now be shown that the conclusion of the result holds if f is 
monotonic. From (2), for any x and b, 


Fy(x —b) = Fy(f(x) —g(b)) or Fy(z) = Fy( f(z + 5) — 8(4)) 
for any z and b. Differentiating with respect to b, we obtain 
O=fy(f(z + b) — g(b) {f(z + b) — 8'(b)}. 


Hence, f(z + b) = g’(b) for all z and b. Thus (choose b = 0) f’ is constant and f 
is linear. 

(iii) Finally, we show that f can be taken continuously differentiable. Since 
E|X| < +0, formula (1) with b = 0 yields 


IFC) f(x) de < +0. 


Thus, f is integrable on any interval. Now, for any u and b, f(X +u+b)~X + 
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g(u + b) (from (1)). Hence, 
[f(X + u + b)d(u) du~X+ [g(u + b)d(u) du, (3) 


where @ is any density. In particular, take @ to have compact support and 
uniformly bounded, continuous second derivative. Define 


f(x) = [f(xt+u)d(u) du, R(x) = fg(x+u)d(u)du. (4) 


Then, by dominated convergence, 


f(x) = [f(v)o"(v - x) dy, 


which is finite. Thus f is continuously differentiable. Also, from (3), f(X + b) ~ 
X + 2(b). Therefore, from parts (i) and (ii), f is linear. It remains to show that this 
implies f is also linear. 

Choose $(u) = 6,(u) = u(t — u)’ for 0 <u <t (and zero otherwise). By (4) 
(and dominated convergence), we have for all x and ¢ that 


(x) = [Oflu 6"(u ~ x) du 


_ fr 
= i f(u){20? — 12t(u — x) + 12(u —x)"} du 
Differentiation with respect to t yields 
(20?) f(x +t) = f° ‘f(u){4t — 12(u —x)} du ace. (5) 


Since f is integrable on intervals, the right-hand side of (5) is continuous. So f can 
be taken continuous everywhere (by perhaps redefining it on a set of measure 
zero). Since f is continuous, the right-hand side of (5) is continuously differen- 
tiable. Thus, differentiation of (5) with respect to x shows that f is twice 
continuously differentiable. Therefore, for any density @ with compact support 
and uniformly bounded, continuous second derivative, integration by parts gives 


0=f"(x) = ff'(x + u)o(u) du. 


As a consequence, f"(x + u) = 0 for all u. Hence, f is linear; that is, f(x) = ax +c. 
Finally, to show that a = +1, note that (1) implies Var fLX + b) = Var X, or 
Var X = Var(aX +c +b) = a’*Var X. Thus, a* = 1 and the result follows. 


Paul R. Chernoff provided a nice solution of the problem, under the original hypotheses, by 
complex variable techniques. The proposer gave a proof using moment generating functions and 
covariance matrices. Two incomplete solutions were received. 


Fluid Flowing on a Sphere 


6578 [1988, 666]. Proposed by S. T. Stefanov, Sofia, Bulgaria. 


A fluid flows on a 2-sphere S$ with continuous velocity vector v(x, t). Let T(x, t) 
be the temperature of the fluid at the point x © S at the time ¢t. Assume that 
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T(x,0) < T, and T(x, 1) > T, for all x € S and some 7,. Prove that there exist 
Xy € S and t, € (0, 1) such that T(x, ty) = Ty and V(x, ty) = 0. 


Solution by John A. Frohliger, St. Norbert College, De Pere, Wisconsin. We 
assume, of course, that the temperature function T(x, ft) is continuous. Without 
loss of generality, S is the sphere in R° with center 0 and radius 1. Assume that 
for every ¢ in (0, 1) and every x in S either T(x, t) # Ty or V(x, t) # 0. Identifying 
a point x € § with its position vector, we observe that whenever v(x, t) # 0 the 
vectors x and V(x,t) are orthogonal and hence linearly independent. Define a 
function F such that 


F:S x [0,1] -S 
by 
(T, — T(x, t))x + (t — t*)¥(x, ft) 
I(T) — T(x, t))x + (t — t7)0(x, tf) 


Then F is a homotopy from F(x, 0) =x, the identity map on S, to F(x, 1) = —x, 
the antipodal map. This is impossible, of course, since the degree of the identity 
map is 1, the degree of the antipodal map is —1, and homotopic maps from S$ to $ 
must have the same degree. Therefore, there must exist some x, in S and some fy 
in (0, 1) such that V(x), ty) = 0 and T(x, ty) = Th. 


Editorial comment. For the terminology and basic results used in the above 
solution, see E. H. Spanier, Algebraic Topology, McGraw-Hill, New York, 1966, 
pp. 175, 195-196. Peter Landweber remarks that the argument of the above 
solution “applies to any even-dimensional compact smooth hypersurface M*” ¢ 
R*"*! with nonzero Euler characteristic y(M), since y(M) is twice the degree of 
the Gauss map M2" > $?".” 


Solved also by John Bryant, W. Hensgen (Germany), Peter Landweber, John Henry Steelman, and 
the proposer. 


F(x,t) = 
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Perspectives of Nonlinear Dynamics, vol. 1. By E E. Atlee Jackson, Cambridge Univer- 
sity Press, 1989, xix + 495 pp. 


JOHN M. GUCKENHEIMER 
Department of Mathematics, Cornell University, Ithaca, NY 14853 


“Chaos” has been a trendy subject almost from the time that Li and Yorke used 
the word to describe complicated behavior in deterministic dynamical systems. In 
addition to books that try to explain the subject in non-technical terms, there have 
been a growing list that seek to introduce a scientific audience to the subject of 
nonlinear dynamics (as physicists call it) or dynamical systems theory (the mathe- 
matician’s term) that provides the context within which chaotic phenomena are 
studied. This book is one of these. Its title gives away the background of its author. 
As a mathematician, I find the book frustrating. Most mathematicians will feel 
much more comfortable approaching the subject through ordinary differential 
equations texts with a modern flavor or through Devaney’s book, An Introduction 
to Chaotic Dynamical Systems. Despite this attitude on my part, I think that the 
book deserves a closer examination. This is a time when the mathematical 
community seems to be examining closely what we teach to undergraduates and 
why. There is something about our own perspectives to be gained from taking a 
look at this book and comparing our scientific attitudes with those of our 
colleagues in physics. 

The book begins with a “brief historical outline” that is a listing of scientific and 
mathematical achievements that the author finds notable, but the dates are 
imprecise and the choice is truly eclectic. When I ask myself about the purpose of 
this outline and the discussion of the term “‘nonlinear” that follows it, I come away 
baffled about the intended audience for this book. Clearly, it is not intended as a 
book for an audience lacking technical mathematical skills, and the discussion 
assumes too much knowledge and uses too much terminology to be accessible to 
students. My conclusion is that the author wrote the book for himself as a way to 
organize his thoughts. He invested substantial effort in learning about the subject 
and has given us a record of his understanding. 

Technically, the book leaves, a lot to be desired. It is not at all a mathematics 
text or systematic treatise. Subjects are discussed but not fully developed. Defini- 
tions are often incomplete, either leaving the reader to ponder the meaning of 
unfamiliar concepts that are used but not discussed or glossing over important 
points. The definitions of dimension on page 59 and structural stability on page 
102 are examples. The book is maddening with its emphasis upon placing things in 
a historical perspective and then not getting the history quite right. The bibliogra- 
phy is over thirty pages, but my attempts to find the entries associated to 
references in the book had a poor success rate. 

Despite its flaws, this book prompts difficult questions. The author states his 
objective in the preface as giving an introduction to the subject that is accessible to 


544 


1990] REVIEWS 545 


the nonmathematician without losing much precision. Why does he feel compelled 
to do so? This is the question on which I would like to dwell. Mathematics strives 
for complete consistency and total rigor. In the process, it has created a world that 
is difficult for people without mathematical training to approach. Moreover, the 
attempts of the mathematical community to explain ideas and phenomena to a 
broader audience seem half-hearted and not very effective. It is left to others to 
write books like this one or another half dozen books with similar goals. As a 
result, substantial mathematical issues get either distorted or omitted in the 
presentation of the subject matter. Must it be this way? Are mathematicians 
singularly incapable of bringing mathematics to a broader public (including under- 
graduate students)? 

By raising these questions in a rhetorical manner, it is evident that I think that 
we can do a better job of presenting mathematics to a wider audience. How should 
we begin? One step that we can take is to recognize the importance to mathemat- 
ics of clear exposition that explains mathematical problems and ideas. There are 
historical examples (Courant and Robbins, What is Mathematics and Hilbert and 
Cohn-Vossen, Geometry and the Imagination are two I grew up with), but there are 
few recent examples of projects that have been undertaken with the intent of 
introducing the excitement and wonder of current mathematical discoveries to 
others. Gleick’s book Chaos is written from a decidely physicist’s viewpoint, yet a 
prominent representative of the mathematics community in Washington com- 
mented to me that Gleick’s book has been one of the best things that has 
happened to promote mathematics to the government in decades. It seems to me 
that the simplest explanation as to why little effort is made to develop such 
exposition is that we give little recognition or reward to those who spend their time 
in such endeavors. In our attempts to reform calculus and the parts of the 
undergraduate curriculum that are not directed primarily at mathematics majors, 
we might also seek to motivate ourselves to bring the spirit of modern mathematics 
to our students in the process. In adjusting elementary courses to the background 
of students while standards in elementary and high school mathematics have 
fallen, we have failed to recognize that a single course or two will seldom overcome 
a decade of inadequate preparation. 

Dynamical systems theory is a good example to examine in this regard. It is a 
subject that has always grown within an environment in which input from the 
physical world has played a large role. The development of many ideas has been 
prompted by phenomena first observed in experiments and computational studies. 
These phenomena can be described along with conceptual approaches to place 
them in a theoretical context. There is an appealing philosophical aspect to the 
subject in the way in which it illuminates our understanding of randomness and 
determinism. Discussions of these matters take us out of the realm of proving 
theorems, but working with these theories in a reliable fashion does require 
sensitivity to their mathematical structure. For example, predictions based upon 
the behavior of “typical” systems sometimes encounter the question as to whether 
behavior that occurs for values of a variable in a Cantor set of positive measure is 
to be regarded as typical. We have not seized the opportunity to bring this material 
before the public in an imaginative manner. Somehow, it seems to me that the 
prevailing attitudes of the mathematical community see such efforts as slightly 
demeaning. 
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What is our role as mathematicians in promoting the use of dynamical systems 
theory? If we insist upon narrowly circumscribed boundaries for our professional 
activities, then other scientists fill the gap that is left between mathematics and its 
use in the sciences. Their approach may clash with our mathematical values and 
intellectual taste. In particular, many scientists have the attitude that “truth” 
emerges more from successive refinements of ideas with some degree of validity 
than from logical reasoning. The corollary of this attitude appears to be a casual 
regard for accuracy and completeness in presenting mathematical concepts. One of 
the most compelling aspects of mathematics for me is the sense of inevitability that 
comes when I understand something deeply. I find satisfaction in being able to 
look at a phenomenon and say to myself that ‘“‘it must be this way” because I can 
follow a chain of reasoning that starts at first principles and ends with a descrip- 
tion of the phenomenon. I don’t think that our colleagues in other sciences share 
these attitudes fully. This difference of values is evident to me when I approach 
the literature on chaos. The subject is one in which there has been a real interplay 
between closely reasoned mathematical theory and physical experiments, but the 
thrill of scientific discovery is usually given the dominant role and the underlying 
mathematics fades into the background. The beauty and value of the deeper 
understanding that comes from rigorous mathematical argument are largely forgot- 
ten. 

In my opinion, mathematics would benefit if more attention were directed to 
the connections of mathematics with the real world and if we recognized and 
rewarded work that plays a significant role in making mathematical ideas available 
to broad audiences. We might then find books that present enlightened perspec- 
tives on modern mathematics that don’t make the subject look like an impenetra- 
ble forest of jargon to outsiders. 
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General, S*, P**, L**. The Emperor’s New 
Mind: Concerning Computers, Minds, and The Laws 
of Physics. Roger Penrose. Oxford University Pr, 
1989, xiii + 466 pp, $24.95. [ISBN: 0-19-851973-7] 
A tour de force of modern science (quantum theory, 
computability, dynamical systems, brain physiology, 
quasicrystals) designed to demolish the foundation 
of “strong” artificial intelligence—that the brain is 
merely a computer—by an avalanche of evidence 
that consciousness is intrinsically non-algorithmic. 
Penrose bases much of his argument on a modern 
mathematical Platonism—relying in part on his own 
rich experience as an inventive mathematician—from 
which he deduces that mathematicians’ judgment of 
mathematical truths are also non-algorithmic. In- 
novative, unconventional, iconoclastic, imaginative, 
and irresistible. LAS 


General, S, P, L*. Boojums Allthe Way Through: 
Communicating Science in a Prosaic Age. N. David 
Mermin. Cambridge University Pr, 1990, xxi + 
308 pp, $16.95 (P). (ISBN: 0-521-38880-5] A col- 
lection of witty and lucid essays for lay persons 
about physics and mathematics, mostly reprinted 
from Physics Today. “All the essays ...are informed 
by my desire to cut through the atmosphere of ver- 
bal dreariness with which scientists seem unneces- 
sarily to have surrounded themselves.” Essays range 
over quantum and relativity theories, the practice of 
physics, and “mathematical musings.” A warm, hu- 
man, and humane view of physics. (A “boojum” is a 
term from Lewis Carroll that the author deliberately 
introduced into the technical vocabulary of physics. ) 
LAS , 


General, S*, P*, L**. Does God Play Dice? The 
Mathematics of Chaos. Ian Stewart. Penguin Books, 
1989, 317 pp, £6.99 (P). [ISBN: 0-14-012501-9] A 
mathematically vigorous, popular treatment of chaos 
theory written with relentless energy and pun-filled 
wit. Using rich historical anecdotes and informative 
mathematical connections, Stewart traces a tale from 
Laplace’s celestial determinism to Einstein’s scepti- 
cism about quantum theory, suggesting strongly that 
some day we may see the unpredictability of quan- 


tum events as mere quantum chaos. LAS 


General, P. Quels Mathematiciens Pour L’an 
2000? |’Association Mathématiques A Venir (26 rue 
Dumeril, 75013 Paris), 1989, (P). A glossy four-color 
promotional brochure intended to highlight the im- 
portance of mathematics to France and the chal- 
lenge “pour garantir un flux sufficant de scientifiques 
compétents.” Based on data from 50 lycées gathered 
as a sequel to a December 1987 colloquium on this 
subject at l’Ecole Polytechnique. LAS 


General, P*, L. A Challenge of Numbers: People 
in the Mathematical Sciences. Bernard L. Madison, 
Therese A. Hart. National Academy Pr, 1990, xv + 
119 pp, $9.95 (P). [ISBN: 0-309-04190-2] The first 
report of the National Academy of Sciences Project 
MS 2000 containing a detailed analysis of human re- 
sources in the mathematical sciences. 45 graphs and 
68 data tables document the role of mathematics in 
higher education and in the work place. Sample find- 
ings: the rate of growth in mathematically based oc- 
cupations is about twice that for all occupations; en- 
rollments in mathematical science courses have dou- 
bled in the last twenty years, but the increases have 
all been at the lower levels; attrition from the math- 
ematical sciences is approximately half per year after 
the ninth grade; and the expected future supply of 
faculty will be insufficient to replace retirees. LAS 


Elementary, S(13). Theory and Problems of 
Trigonometry, Second Edition with Calculator-Based 
Solutions. Frank Ayres, Robert E. Moyer. Schaum’s 
Outline Series. McGraw-Hill, 1990, 224 pp, $9.95 
(P). (ISBN: 0-07-002659-9] Revision of a 1954 re- 
view manual adding calculator use to complement 
logarithmic calculations. “The work with logarithms 
is entirely optional.” Major focus is on solving trian- 
gles to calculate specific numerical values of angles, 
area, and lengths. LAS 


Mathematics Appreciation, T(13-16), S*, 
L**, Journey Through Genius: The Great The- 
orems of Mathematics. William Dunham. Wiley, 
1990, xiii + 300 pp, $19.95. [ISBN: 0-471-50030- 
5] Twelve masterpieces of mathematics from Hip- 
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pocrates’ quadrature of the lune to Cantor’s “great 
theorem” on non-denumerability, each set in histori- 
cal context and explained in modern language (“like 
playing Mozart on modern instruments”). Intended 
for popular audiences, it offers superb source ma- 
terial for mathematics appreciation courses. Em- 
phasizes biography, history, creativity, and proofs in 
carefully crafted vignettes—what Isaac Asimov calls 
“mathematics by lightning flash.” LAS 


Mathematics Appreciation, S* L**. Game, Set, 
and Math: Enigmas and Conundrums. Ian Stewart. 
Basil Blackwell, 1989, viii + 191 pp, $19.95. [ISBN: 
0-631-17114-2] Twelve columns reprinted (in En- 
glish) from the “Mathematical Visions” column in 
Pour La Science, the French translation of Scientific 
American. Stewart’s column, written for the French 
edition, continues Gardner’s “Mathematical Games” 
featured in Sctentific American. Topics range widely 
from “Pascal’s Fractals” to the probability of win- 
ning in tennis; the mathematical level is more de- 
manding than Gardner’s columns, but the chatty, 
imaginative, playful style is similar. A worthy sequel 
to an unsurpassed master of popular mathematical 
writing. Too bad it only appears in the French edi- 
tion. LAS 


Mathematics Appreciation, S*, L*. Mathemat- 
tcal Magic Show. Martin Gardner. MAA, 1989, 302 
pp, $17.50 (P). (ISBN: 0-88385] An up-dated reprint 
of Gardner’s 1977 collection (TR, November 1977) 
of Scientific American columns, originally published 
by Alfred A. Knopf. Begins with more than you ever 
wanted to know about “Nothing” and ends with a 
complementary essay on “Everything.” In between 
are acrostics, card tricks, dragon curves, polyomi- 
noes, dice, and numerous other mathematical odd- 
ities. Includes a new introduction by Gardner fans 
Ronald Graham and Perci Diaconis. LAS 


Mathematics Appreciation, T(13). Mathemati- 
cal Ideas, Sizth Edition. Charles D. Miller, Vern E. 
Heeren, E. John Hornsby, Jr. Scott Foresman, 1990, 
xx + 844 pp, $29. [ISBN: 0-673-38829-8] Text for 
liberal arts mathematics course. In this edition a 
more complete introduction to calculators appears 
in the front of the book; introduction to the metric 
system is relegated to the appendix, although met- 
ric units are used in many examples and exercises; 
and many chapters and sections are expanded. Lots 
of nice topics, but few exercises go beyond mim- 
icking the examples. (Second Edition, \TR, March 
1974; Third Edition, TR, June-July 1978; Fifth Edi- 
tion, TR, October 1986, Extended Review, August- 
September 1988.) SB 


Precalculus, T(13: 1). Precalculus with Applica- 
tions. Stanley I. Grossman. Saunders College, 1990, 
xvii + 762 pp, $45.25. [ISBN: 0-03-007117-8] A 
well-written text whose topics include a review of ba- 
sic algebra, functions and graphs, polynomials, ratio- 
nal functions, exponential and logarithmic functions, 
trigonometry, conic sections, an introduction to lin- 
ear algebra, and an introduction to discrete math- 
ematics. It has numerous examples with all steps 
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included. Also contains many applications and brief 
historical readings. Each section has many exercises 
with answers to most odd-numbered exercises in the 


back. RH 


Education, P*, L. Academic Preparation In Sci- 
ence: Teaching for Transition From High School 
To College, Second Edition. Jose P. Mestre, Jack 
Lochhead. Ed: Robert Orrill. College Entrance Ex- 
amination Board, 1990, ix + 130 pp, (P). [ISBN: 0- 
87447-352-7] A major revision of the 1986 original 
edition, rewritten to reflect new foundations of sci- 
ence education provided by recent cognitive research. 
Focuses on the role of structured inquiry as a means 
of fostering higher order skills. Much of the volume 
relates well to mathematics—partly as a supporting 
skill for learning science, but primarily in terms of 
similarity of analyses and strategies of learning. LAS 


Education, L*. The Mathematics Curriculum: To- 
wards the Year 2000. Eds: John Malone, Hugh 
Burkhardt, Christine Keitel. Curtin U of Technology 
(POB U1987, GPO Perth, Western Australia 6000), 
1989, 469 pp, (P). (ISBN: 0-909848-95-5] Proceed- 
ings of a Theme Group from the 1988 International 
Congress on Mathematical Education in Budapest, 
Hungary. 75 papers from over a dozen different coun- 
tries arranged according to Congress sub-themes into 
four categories: curriculum in the year 2000, impact 
of computers, teachers as agents for change, and the 
dynamics of planned curricular change. The papers 
display world-wide concern for curricular renewal, 
but the diagnoses and prescriptions differ greatly 
from country to country. LAS 


Education, P. The Coordinated Implementation 
of National and State-by-State Reform in School 
Mathematics. Association of State Supervisors of 
Mathematics, 1990, 212 pp, (P). Documents from 
a September 1989 Chicago conference of ASSM that 
launched state-based plans to implement the NCTM 
Curriculum and Evaluation Standards for School 
Mathematics (TR, June-July 1989). Contains texts 
of ASSM position statements, ASSM action items, 
and—the major part of the volume—copies of each 
state’s action plan. LAS 


Education, T(13-16: 1). <A Problem Solving 
Approach to Mathematics for Elementary School 
Teachers, Fourth Edition. Rick Billstein, Shlomo 
Libeskind, Johnny W. Lott. Benjamin/Cummings, 
1990, xx + 945 pp, $37.95. [ISBN: 0-8053-0390-1] 
Changes from the previous editions include refer- 
ences to the NCTM Standards in most chapters, a 
new chapter on motion geometry and tessellations, 
geometry activities keyed to von Hiele levels, and ex- 
tensive Logo activities. Problem solving and calcula- 
tor usage is expanded. Excellent text incorporating 
laboratory activities. (First Edition, TR, November 
1981; Extended Review, January 1982; Second Edi- 
tion, TR, February 1985.) MW 

Education, P. A Curriculum in Fluz: Mathematics 
at Two-Year Colleges. Ed: Ronald M. Davis. MAA, 
1989, 48 pp, $7.50 (P). [ISBN: 0-88385-065-6] Is- 
sues and recommendations for mathematics curricula 
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at two-year colleges, with chapters on baccalaure- 
ate transfer programs, statistics, remediation, occu- 
pational career programs, mathematics for the com- 
puting sciences, and computing in the mathematics 
curriculum. A joint undertaking of AMATYC and 
MAA to help focus attention on the special needs 
of two-year colleges, where one-third of all first-year 
college students study mathematics. LAS 

Logic, T(17: 1), L. Logik fur Informatiker. Uwe 
Schoning. Reihe Informatik, Band 56. Bibliographis- 
ches Institut, 1989, 172 pp, (P). [ISBN: 3-411-14012- 
7| An introductory textbook on logic for computer 
scientists. Chapters on propositional logic, predicate 
logic, and logic programming. Exercises, short bibli- 
ography. DFA 

Foundations, P, L*. Pursuit of Truth. W.V. 
Quine. Harvard U Pr, 1990, x + 113 pp, $17.50. 
[ISBN: 0-674-73950-7] A concise recapitulation of 
ideas on evidence, reference, meaning, intension, 
and truth by one of America’s leading philosophers. 
Quine focuses on certain classes of sentences for 
which truth is “disquotation:” ‘Snow is white’ is true 
iff snow is white. Truth in mathematics, for Quine, 
inclines one towards the propositions of predicative 
set theory; such a view “inactivates the more gratu- 
itous flights of higher set theory.” LAS 


Foundations, T**(15-16: 1), S, P, L. Founda- 
tions and Fundamental Concepts of Mathematics, 
Third Edition. Howard Eves. PWS-Kent, 1990, xvii 
+ 344 pp, $33. [ISBN: 0-534-92163-9] A welcome 
reappearance of the classic An Introduction to the 
Foundations and Fundamental Concepts of Mathe- 
matics by Howard Eves and Carroll V. Newsom in a 
slightly revised form. JNC 


Graph Theory, S(17-18), P. Graphentheorie, 
Band 1: Anwendungen auf Topologie, Gruppentheo- 
rte und Verbandstheorie. Klaus Wagner, Rainer Bo- 
dendiek. Bibliographisches Institut, 1989, xv + 252 
pp, (P). [ISBN: 3-411-03226-X] 


Combinatorics, P. Sequences: Combinatorics, 
Compression, Security, and Transmission. Ed: Re- 
nato M. Capocelli. Springer-Verlag, 1990, xii + 549 
pp, $49.50. [ISBN: 0-387-97186-6] Forty-two pa- 
pers on combinatorial algorithms, data compression, 
security, and transmission. Includes a Paul Erdos pa- 
per on probabilistic number theory with $2100 worth 
of conjectures. BC 

Combinatorics, T(16-18: 1, 2), S, P, L. Ma- 
troid Theory and tts Applications in Electric Network 
Theory and in Statics. Andras Recski. Algorithms 
& Combinatorics, V. 6. Springer-Verlag, 1989, xiii + 
531 pp, $89.50. [ISBN: 0-387-15285-7] An introduc- 
tion to matroid theory (including the necessary back- 
ground about graphs and matrices), with special em- 
phasis on its algorithmic aspects and applications in 
electrical engineering and statics. The breadth of the 
mathematical portion is between the level of Welsh’s 
book (1977) and Lawler’s work (1976). About 40% 
of the material is on applications (which may be 
skipped over if desired); most of these have never 
before been given in book form. Nearly 800 exercises 
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and problems with full solutions. LCL 


Number Theory, T(14-15: 1), L. Elementare 
Zahlentheorte. Friedhelm Padberg. Bibliographis- 
ches Institut, 1989, 166 pp, (P). [ISBN: 3-411-00639- 
0] Intended chiefly for teachers and prospective 
teachers of school mathematics. Exercises, solu- 
tions. JD-B 


Number Theory, P, L**. A Dictionary of 
Real Numbers. Jonathan Borwein, Peter Borwein. 
Wadsworth, 1990, viii + 424 pp. [ISBN: 0-534-12840- 
8] <A reverse handbook, of special functions, listing 
over 100,000 eight-digit decimal numbers between 0 
and 1 with a crisply encoded clue as to how that 
particular number arises as the decimal part of the 
value or root of some common functions. Includes 
elementary and special functions, simple algebraic 
and transcendental constants, rational combinations 
of constants, common physical constants, real roots 
of cubic polynomials with small integer coefficients, 
and much more. Generated by Maple and typeset 
in TeX to avoid human error. An invaluable aid to 
mathematicians who do computer-assisted research; 
also makes a great coffee-table gift for those with a 
sense of humor. LAS 


Algebra, T**(15-16: 1), L. Galois Theory, Second 
Edition. Ian Stewart. Chapman & Hall, 1989, xxx + 
202 pp, $52.50; $22.50 (P). [ISBN: 0-412-34540-4; 0- 
412-34550-1] Changes from the First Edition (TR, 
August-September 1973; Extended Review, Octo- 
ber 1976) include the addition of an introductory 
overview, a new chapter on the calculation of Ga- 
lois groups, a modification of the historical section, 
some changes in the proofs, more examples, and some 
modification of the exercises. This is a very nicely 
written book which would serve as an excellent un- 
dergraduate text. RH 


Algebra, T(15-16). Numbers, Groups and Codes. 
J.F. Humphreys, M.Y. Prest. Cambridge University 
Pr, 1989, xvi + 288 pp, $19.95 (P); $59.50. [ISBN: 
0-521-35938-4; 0-521-35084-0] An interesting intro- 
duction to group theory and algebra written with an 
eye toward applications in computer science. The 
chapters cover number theory (congruences, factor- 
ization, public key ciphers), sets and relations (func- 
tions, finite state machines), logic (Boolean algebras, 
switching circuits), group theory, and elementary 
coding theory. Well-written with many exercises, SG 


Algebra, P. Methods of Representation Theory with 
Applications to Finite Groups and Orders, Volume 1. 
Charles W. Curtis, Irving Reiner. Classics Library. 
Wiley, 1990, xxi + 819 pp, $34.95 (P). [ISBN: 0-471- 
52367-4] A paperback version of the 1981 hardcover 
edition (TR, March 1982). Still an (if not the) out- 
standing reference book in the field. SG 


Algebra, P. Arithmetical Completely Simple Semi- 
groups. E.-A. Behrens. Research & Exposition in 
Math., V. 17. Heldermann Verlag, 1989, vii + 135 
pp, DM 48 (P). [ISBN: 3-88538-217-2] Develops 
a non-commutative arithmetic for partially ordered 
semigroups. Defines arithmetical completely simple 
semigroups and applies them to the integral normal 
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ideals in the central simple algebras over the quotient 
field of a Dedekind ring under a generalized multipli- 
cation. Only a basic knowledge of algebra is required 
as background. RH 


Calculus, T(13: 2), S. Applied Calculus, Second 
Edition. S.T. Tan. PWS-Kent, 1990, xv + 775 
pp, $37. (ISBN: 0-534-92135-3] Intends to “in- 
clude at least one real-life application in each section 
(whenever feasible).” Lots of good problems; most 
come from business and economics, some from so- 
cial and life sciences, very few from physical sciences. 
Changes in this edition (First Edition, TR, June- 
July 1987) include addition of a preliminary precal- 
culus review; “self-check exercises” (with worked out 
solutions) at the end of each section in addition to 
regular exercises; additional examples and exercises 
(those requiring calculators are labeled); section on 
limits now includes discussion of one-side limits; top- 
ics of integration section has been reorganized with 
an earlier introduction of the definite integral and 
fundamental theorem; numerical integration section 
now includes error analysis of trapezoidal and Simp- 
son’s Rules; expanded series section includes error 
analysis of integral, comparison and ratio tests. As 
in the previous edition, there is no mention of the 
mean value theorem. SB 


Calculus, T(13: 1). Essential Calculus with Ap- 
plications. Richard A. Silverman. Dover, 1989, 
xii + 292 pp, $8.95 (P). [ISBN: 0-486-66097-4] A 
“corrected and slightly enlarged republication” of 
the original 1977 Saunders publication (TR, Octo- 
ber 1977); now includes the section Supplementary 
Hints and Answers formerly issued as a separate In- 
structor’s Manual. JNC 


Calculus, T(13: 2-3). Calculus. Robert Seeley. 
Harcourt Brace Jovanovich, 1990, xv + 1048 pp, $44 
(ISBN: 0-15-505681-6] What distinguishes this from 
many calculus texts is the genuine effort towards pre- 
senting motivational material in advance of the rigor- 
ous calculus. This comes in the form of applications 
set in their historical context. For example, opti- 
mization problems and motion precede the rigorous 
discussion of limits, and Taylor polynomials begin the 
chapter on sequences and series. KS 


Real Analysis, T(16-17: 1). An Introduction to 
the Theory of Real Functions. Stanistaw Lojasiewicz. 
Transl: G.H. Lawden. Wiley, 1988, ix + 230 pp, 
$57.95. [ISBN: 0-471-91414-2] A terse theorem- 
proof English version of a Polish text first published 
in 1964. Set in the context of metric sp&ces; covers 
Borel sets and Baire functions, Reisz’s theorem, and 
the Radon-Nikodym theorem. Elegant, crisp, pris- 
tine; will require a well-motivated reader. LAS 


Real Analysis, P, L. Lecture Notes in Mathemat- 
tcs-1419: New Integrals. Eds: P.S. Bullen, et al. 
Springer-Verlag, 1990, 200 pp, $21.80 (P). [ISBN: 
0-387-52322-7] Proceedings of an August 1988 con- 
ference held at the University of Ulster in honor of 
Ralph Henstock, and focused on generalized integrals 
related to Henstock’s work on the generalized Rie- 
mannian integral “That was as good as, and indeed, 
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in many ways, far superior to Lebesgue’s.” LAS 


Real Analysis, T(16-17: 1, 2), S. Problems in 
Real Analysis: A Workbook with Solutions. Char- 
alambos D. Aliprantis, Owen Burkinshaw. Academic 
Pr, 1990, vii + 285 pp, $29.95. [ISBN: 0-12-050256- 
9] A collection of 456 exercises, with complete so- 
lutions, on topics in real analysis: set theory, gen- 
eral topology, measure theory, normed spaces, topics 
in Lebesgue integration, L, spaces. Best used as a 
companion to the authors’ book Principles of Real 
Analysis (TR, November 1982), in which all these 
problems appear, and to which many references (e.g., 
theorem numbers) are made. Could be a valuable 
study aid for readers who resist temptation (per- 
haps considerable—solutions appear hard on heels 
of problems) to peek. PZ 


Real Analysis, P. Almost Everywhere Conver- 
gence. Eds: Gerald A. Edgar, Louis Sucheston. Aca- 
demic Pr, 1989, xv + 416 pp, $49.95. [ISBN: 0-12- 
231050-0] The proceedings of a conference held in 
Columbus, Ohio, June 1988. Contains 25 papers in 
this expanding area of ergodic theory; some of a sur- 
vey nature, others quite technical. TAV 


Numerical Analysis, T(16-17: 1), L. Numeri- 
cal Analysis: A Second Course. James M. Ortega. 
Classics in Appl. Math., V. 3. SIAM, 1990, xiii 
+ 201 pp, $25.50 (P). [ISBN: 0-89871-250-5] An 
unabridged, corrected republication of the original 
1972 edition (TR, August-September 1972; Extended 
Review, October 1974). Presents theoretical results 
pertaining to rounding error, discretization error, 
and convergence error. Applications of these results 
are given to the study of systems of linear equations 
and differential equations. AO 


Numerical Analysis, P. Iterative Methods for 
Large Linear Systems. Eds: David R. Kincaid, 
Linda J. Hayes. Academic Pr, 1990, xxx + 319 
pp, $49.95. [ISBN: 0-12-407475-8] Seventeen pa- 
pers from a conference held October 19-21, 1988 at 
the University of Texas at Austin. These papers pro- 
vide an overview of the state-of-the-art in the use 
of iterative methods for solving sparse linear sys- 
tems. AO 


Operator Theory, T(18: 1, 2), S, P. Lecture 
Notes in Mathematics-1413: Uniqueness of the In- 
jective III, Factor. Steve Wright. Springer-Verlag, 
1989, 108 pp, $13.50 (P). [ISBN: 0-387-52130-5] 
Based on a lecture course by the author, this volume 
contains an exposition of work of A. Connes (Jour- 
nal of Operator Theory, 14 (1985)) and U. Haagerup 
(Acta Math., 158 (1987)). Together these constitute 
a proof of what’s promised in the title. KS 


Operator Theory, P. Matriz and Operator Ezten- 
sions. H.J. Woerdeman. CWI Tract, V. 68. Mathe- 
matisch Centrum, 1989, 158 pp, Dfl. 48 (P). [ISBN: 
90-6196-379-6] Taken from the author’s Ph.D. the- 
sis, written at Vrije Universiteit in Amsterdam, it 
concerns the problem of extending a partially defined 
operator to a full operator having certain prescribed 
properties. LCL 


Functional Analysis, T(16-17: 1, 2).  Anal- 
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ysis Now. Gert K. Pedersen. Grad. Texts in 
Math., V. 118. Springer-Verlag, 1989, xiv + 277 pp, 
$49.80. [ISBN: 0-387-96788-5] Key propositions of 
functional analysis (topology, Banach and Hilbert 
spaces, spectral theory, unbounded operators, inte- 
gration theory) embedded in a context to motivate 
the necessary abstractions that “every young ana- 
lyst” should know: Norms, Operators, and Weak 
topologies—the “Now” of the title. “Our critics, es- 
pecially those well-meaning pedagogues, should come 
to realize that mathematics becomes simpler only 
through abstraction.” LAS 


Functional Analysis, P. Inverse und schlecht 
gestellte Probleme. Alfred Karl Louis. BG Teub- 
ner Stuttgart, 1989, 205 pp, DM 26,80 (P). [ISBN: 
3-519-02084-X] 

Functional Analysis, P. Approzimation Problems 
in Analysis and Probability. M.P. Heble. Math. 
Stud., V. 159. North-Holland (US Distr: Elsevier 
Science), 1989, xi + 245 pp, $92. [ISBN: 0-444- 
88021-6] Recent research on approximation theory 
with particular emphasis on probabilistic methods 
and applications. Introductory chapter on general- 
izations of the Weierstrass-Stone theorem. Appen- 
dices give brief review of theoretical background. JO 


Analysis, P. Representations of Lie Groups, Kyoto, 
Hiroshima, 1986. Eds: K. Okamoto, T. Otshima. 
Adv. Stud. in Pure Math., V. 14. Academic Pr, 
1988, 660 pp, $99.50. [ISBN: 0-12-525100-9] Pro- 
ceedings of a symposium on Analysis on Homoge- 
neous Spaces and Representations of Lie Groups in 
September 1986. Contains 22 papers. MLR 


Analysis, T(18: 1), P. Stochastic Calculus in Man- 
ifolds. Michel Emery. Universitext. Springer-Verlag, 
1989, x + 151 pp, $29 (P). (ISBN: 0-387-51664-6] 
An introduction to the Schwartz-Meyer second-order 
geometry and its use in stochastic calculus. Contains 
basics of the stochastic calculus and differential ge- 
ometry, so is reasonably self-contained. In spite of 
this, the work is quite technical and demanding. TAV 


Algebraic Geometry, P. Lecture Notes in Mathe- 
matics-1417: Algebraic Geometry. Eds: A.J. 
Sommese, A. Biancofiore, E.L. Livorni. Springer- 
Verlag, 1990, 320 pp, $31.20 (P). [ISBN: 0-387-52217- 
4] A collection of eighteen papers presented at a 
1988 conference in Italy. The papers are devoted 
primarily to the study of hyperplane sections of pro- 
jective varieties, and the role played by such sections 
in the classification of projective varieties by projec- 
tive invariants. SG \ 


Differential Geometry, T(16-18: 1), S»L. Dif- 
ferential Forms with Applications to the Physical Sci- 
ences. Harley Flanders. Dover, 1989, xv + 205 
pp, $5.95 (P). [ISBN: 0-486-66169-5] Introduction 
to differential forms, aimed at weaning physicists and 
engineers away from tensors. A slightly revised ver- 
sion of the 1963 original. BC 

Differential Geometry, P. Contact Geometry and 
Wave Propagation. V.I. Arnold. L’Enseignement 
Mathematique, 1989, 56 pp, 27 Frs. (P). Contact 
geometry is the odd-dimensional counterpart of sym- 
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plectic geometry and is, according to the author, the 
natural basis for optics and the theory of wave prop- 
agation. This monograph presents, briefly, the basic 
results of contact geometry. JO 


Differential Geometry, P. Geometry of Manifolds. 
Ed: K. Shiohama. Perspectives in Math., V. 8. Aca- 
demic Pr, 1989, xv + 517 pp, $49.95. [ISBN: 0-12- 
640170-5| Thirty-two papers contributed by partic- 
ipants in the 35th Symposium on Differential Geom- 
etry held in Matsumoto, Japan in July 1988. Sec- 
tions on dynamical systems, geometry of subman- 
ifolds, Lie sphere geometry, Riemannian geometry, 
the geometry of Laplace operators, and Yang-Mills 
connections. JO 


Algebraic Topology, P. Lecture Notes in Mathe- 
matics-1418: Homotopy Theory and Related Top- 
tics. Ed: M. Mimura. Springer-Verlag, 1990, 241 
pp, $21.80 (P). [ISBN: 0-387-52246-8] Proceedings 
of an August 1988 international conference held in 
honor of Hirosi Toda on the occasion of his sixtieth 
birthday. Opening lecture on Toda’s work is by J.F. 
Adams; 18 other papers extend various aspects of 
Toda’s work. LAS 


Dynamical Systems, S(16-18), P, L*. Ezploring 
Complezity, An Introduction. Grégoire Nicolis, Ilya 
Prigogine. WH Freeman, 1989, xi + 313 pp, $24.95. 
(ISBN: 0-7167-1859-6] One goal for this book is to 
provide a short introduction to methods devised over 
recent decades to explore complexity. From a de- 
scriptive beginning on the vocabulary of complexity 
(e.g., nonequilibrium, stability, bifurcation and sym- 
metry breaking, long-range order, chaos, attractors, 
and fractals), the authors continue with more de- 
tailed descriptions (e.g., nonequilibrium physics and 
modern dynamical systems). One of the most impor- 
tant goals is to show that concepts of complexity are 
quite universal, and break through interdisciplinary 
barriers. The purpose is nothing less than to pro- 
mote a whole new world view in which evolution and 
plurality (rather than stability and periodicity) are 
the key words. The authors (one a 1977 Nobel lau- 
reate for chemistry, the other a distinguished the- 
oretical physicist and chemist) take every occasion 
to stress the importance of these concepts for better 
understanding the global environment in which we 
live. LCL 


Dynamical Systems, T(18: 2). Infinite-Dimen- 
stonal Dynamical Systems in Mechanics and Physics. 
Roger Temam. Appl. Math. Sci., V. 68. Springer- 
Verlag, 1988, xvi + 500 pp, $64. [ISBN: 0-387-96638- 
2] Typical of most books in this series, this volume 
is a reasonably self-contained, readable, graduate- 
level text on the subject. Includes applications of 
the ideas of invariant sets and attractors to a large 
number of physically interesting systems. First chap- 
ter has a “User’s Guide” for readers interested only 
in learning enough for physical applications. JO 


Probability, T*(14: 1). Introduction to Probability 
and Its Applications. Richard L. Scheaffer. Duxbury 
Adv. Ser. in Stat. & Decision Sci. PWS-Kent, 1990, 
x + 363 pp. (ISBN: 0-534-91970-7] A well-written 
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text for a one-semester, post-calculus course. In- 
cludes discrete and continuous multivariables, and 
examples of stochastic processes. Extensive, well- 
chosen exercises, clear graphics, and careful organi- 
zation make this an attractive choice. TAV 


Stochastic Processes. Lecture Notes in Mathe- 
matics-1412: Stability Problems for Stochastic Mod- 
els. Eds: V.V. Kalashnikov, V.M. Zolotarev. 
Springer-Verlag, 1989, x + 380 pp, $33 (P). [ISBN: 
0-387-51948-3] The proceedings of an international 
conference in Sukhumi (USSR) in the fall of 1987. 
The scope of the 26 papers is quite broad, as is the 
whole field of stability. TAV 


Stochastic Processes, P. Queueing Networks— 
Ezact Computational Algorithms: A Unified Theory 
Based on Decomposition and Aggregation. Adrian E. 
Conway, Nicolas D. Georganas. Computer Systems 
Ser. MIT Pr, 1989, xiv + 234 pp, $25. [ISBN: 0-262- 
03145-0] A state-of-the-art view of exact computa- 
tional algorithms to analyze product-form queueing 
networks in equilibrium. Uses the techniques of de- 
composition and aggregation. Attempts to lay the 
ground work for future development of exact algo- 
rithms. TAV 


Stochastic Processes, T?7(16: 1), S(15-16), L. 
Spontaneous Phenomena: A Mathematical Analysis. 
Flemming Tops¢ge. Transl: John Stillwell. Academic 
Pr, 1990, xi + 182 pp, $29.95. [ISBN: 0-12-695175- 
6] Translation of a 1983 Danish book. Introduces 
the stochastic modeling of spontaneous phenomena 
by a detailed development, including the background 
physics, of a Poisson process to model radioactive 
decay. Includes a section of 46 problems of widely 
varying difficulty which can serve as the basis for a 
course, and acollection of 18 examples from a variety 
of disciplines “for further investigation.” RSK 


Statistics, T(14-16: 2). Probability and Statistics 
for Modern Engineering, Second Edition. Lawrence 
L. Lapin. Duxbury Adv. Ser. in Stat. & Decision 
Sci. PWS-Kent, 1990, xxii + 810 pp. [ISBN: 0-534- 
91654-6] Beginning (not elementary) level statistics 
text designed specifically for engineering students. 
Exercises and examples refer to engineering appli- 
cations. Text assumes students have strong quan- 
titative background, and calculus is used. A notch 
above most elementary statistics texts in terms of 
theory covered, but below most theoretical proba- 
bility and statistics texts for undergraduate math- 
ematics majors. Many theoretical results are pre- 
sented (e.g., joint probability distributions, moment 
generating functions, gamma distribution’ etc.), but 
most proofs are left out. Covers descriptive statis- 
tics, basic probability, random variables and distri- 
butions, sampling and sampling distributions, esti- 
mation, hypothesis testing, regression, multiple re- 
gression, analysis of variance, nonparametrics, and a 
nice section on special topics (control charts, accep- 
tance sampling, reliability and life testing, Weibull 
distributions). Good text with new approach for en- 
gineers. JJ 


Statistics, S(15-18), L. Analyzing Quantitative 
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Behavioral Observation Data. Hoi K. Suen, Donald 
Ary. Lawrence Erlbaum Associates, 1989, xi + 359 
pp, $34.50. [ISBN: 0-89859-945-8] A conceptual in- 
troduction to sampling, reliability, and validity issues 
in the context of analyzing human and animal behav- 
ior. Also presents ARIMA time series and Markov 
chain models. RWJ 


Statistics, S(17), P. Robust Regression: Analy- 
sis and Applications. Eds: Kenneth D. Lawrence, 
Jeffrey L. Arthur. Statistics: Textbooks & Mono- 
graphs, V. 108. Marcel Dekker, 1990, xiii + 287 pp, 
$89.75. [ISBN: 0-8247-8129-5] Fourteen papers on 
robust regression including five on robust time se- 
ries analysis and two on ridge regression. Includes 
examples with real data. TH 


Statistics, P. Nonlinear Regression, Functional Re- 
lations and Robust Methods: Volume II, Statistical 
Methods of Model Building. Eds: Helga Bunke, 
Olaf Bunke. Prob. & Math. Stat. Wiley, 1989, 
449 pp, $69.96. [ISBN: 0-471-91239-5] Chapters 
on nonlinear regression models, robust linear regres- 
sion, and errors-in-variables regression. Little com- 
bination of the three topics. Encyclopedic character 
(many topics presented quickly). Translation from 
German. TH 

Statistics, S, P, L**. The History of Statis- 
tics: The Measurement of Uncertainty before 1900. 
Stephen M. Stigler. Harvard University Pr, 1986, 
xvi + 410 pp, (P). [ISBN: 0-674-40341-X] Paperback 
reprint of 1986 original edition (TR, August 1987). 
A readable, fact-filled survey focused on variability 
in astronomy and geodesics, probability in social sci- 
ence, and studies of heredity. LAS 

Programming, T(14: 1, 2), S. Accessing C: Tips 
from the Ezperts, Second Edition. Ed: Strawberry 
Software. Van Nostrand Reinhold, 1989, xiii + 
306 pp, $29.95 (P). [ISBN: 0-442-20662-3] Text de- 
signed to bridge the gap “between the syntax of the 
language and the knowledge to complete real-world 
applications in C.” Surprisingly even quality despite 
a large number of contributors. Exposition is to the 
point and includes many helpful code examples. JO 
Computer Systems, S(13-14), P, L. Maple for 
the Calculus Student: A Tutorial. Wade Ellis, Jr., 
Ed Lodi. Brooks/Cole, 1989, xii + 67 pp, $10.25 
(P). [ISBN: 0-534-11874-7] A clear, leisurely, and 
friendly tutorial introduction to the rudiments of 
using the algebra system Maple for graphical, nu- 
merical, and symbolic computing. Calculus-level 
operations (curve plotting, symbolic differentiation, 
and antidifferentiation, solving differential equations, 
etc.) are emphasized, but not only calculus students 
will benefit. The booklet can serve anyone as a first 
quick overview of the syntax, capabilities, and “feel” 
of a modern computer algebra system, and as an en- 
ticement to look further. PZ 

Computer Systems, P, L. IATpX for Scientists 
and Engineers. David J. Buerger. McGraw-Hill, 
1990, xvi + 198 pp, $29.95 (P). [ISBN: 0-07-008845- 
4] A practical guide to IATpX for those who have 
never used TpX . Provides examples of what’s needed 
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to do standard things without going into full details 
on how IATpX works. Covers formatting environ- 
ments, mathematical typesetting, tables, footnotes, 
bibliography, indices, and document styles. Con- 
cludes with a glossary and a display of various Com- 
puter Modern Fonts at different magnifications. LAS 


Applications, P. Transactions of the Seventh Army 
Conference on Applied Mathematics and Comput- 
ing. US Army Research Office (POB 12211, Research 
Triangle Park, NC 27709), 1990, xx + 704 pp, (P). 
Nearly 50 papers on a wide variety of topics in ap- 
plied mathematics from a June 1989 conference at 
West Point. Image algorithms, inverse problems, 
game theory, wave propagation, tiling, splines, and 
much more. LAS 

Applications, T, S, P, L*. Mathematical Bio- 
economics: The Optimal Management of Renewable 
Resources, Second Edition. Colin W. Clark. Wiley, 
1990, xiii + 386 pp, $49.95. [ISBN: 0-471-50883- 
7| A timely revision of the 1976 First Edition (TR, 
February 1977) emphasizing the “deliberate over- 
exploitation” of biological resources possessing low 
rates of biological growth. “If forests, marine mam- 
mals, or grazing lands are incapable of replenishing 
themselves at sufficiently high rates, economically 
rational owners will tend to over-exploit these re- 
sources.” Chapters feature economic models of re- 
source management; optimal control; nonlinear, dy- 
namical, stochastic, and discrete models; growth, ag- 
ing, and multi-species models; and resource regula- 
tion. LAS 


Applications (Biological Science), P, L. Lecture 
Notes in Btomathematics-83: Mathematical and Sta- 
tistical Approaches to AIDS Epidemiology. Ed: C. 
Castillo-Chavez. Springer-Verlag, 1989, ix + 405 pp, 
$47.10 (P). [ISBN: 0-387-52174-7] Nineteen papers 
on modelling HIV infectivity, transmission dynam- 
ics, social dynamics of sexually transmitted diseases, 
and interactions with the immune system. BC 


Applications (Biological Science), P. Some 
Mathematical Questions in Biology: The Dynamics 
of Ezcitable Media. Ed: Hans G. Othmer. Lect. on 
Math. in the Life Sci., V. 21. AMS, 1989, villi + 181 
pp, $34 (P). [ISBN: 0-8218-1171-1] Six papers on 
excitable (as opposed to oscillatory) systems, includ- 
ing synchronization of cardiac pacemakers, collective 
behavior in the hippocampal, and wave propagation 
during aggregation in slime mold. BC 


Applications (Communication Theory), P. Sig- 
nal Processing, Part I: Signal Processing ‘ Theory. 
Eds: L. Auslander, T. Kailath, S. Mitter. IMA, V. 
22. Springer-Verlag, 1990, xiii + 251 pp, $29. [ISBN: 
0-387-97215-3] Seventeen papers from a 1988 work- 
shop in signal processing. BC 

Applications (Fluid Dynamics), P. Computa- 
tional Fluid Mechanics, Selected Papers. Alexandre 
Joel Chorin. Academic Pr, 1989, xv + 223 pp, $35. 
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(ISBN: 0-12-174070-6] Fourteen of the author’s pa- 
pers on the general themes of turbulence and the 
numerical solution of fluid mechanics equations with 
small viscosity. Originally published between 1967 
and 1982, these papers are collected now because of 
the author’s refreshing and reasonable belief that the 
ideas of the subject “are easier to learn if they are 
presented in the simpler garb that preceded the de- 
velopment of sophisticated implementations and of a 
general mathematical theory.” JO 


Applications (Fluid Dynamics), P. Boundary 
Value Problems in Mechanics of Nonhomogeneous 
Fluids. S.N. Antontsev, A.V. Kazhikhov, V.N. Mon- 
akhov. Stud. in Math. & Its Applic., V. 22. North- 
Holland (US Distr: Elsevier Science), 1990, xii + 309 
pp, $85.25. [ISBN: 0-444-88382-7] An understand- 
able but choppy English translation of the original 
1983 Russian edition, this volume reports investiga- 
tions into boundary value problems for systems of 
partial differential equations describing viscous gas 
flows and the filtration of multi-phase mixtures in 
porous media. Each chapter includes a section de- 
scribing relevant open problems. JO 


Applications (Physics), 8(18), P. Analysis and 
Continuum Mechanics. Eds: S.S. Antman, et al. 
Springer-Verlag, 1989, x + 829 pp, $80.40 (P). 
(ISBN: 0-387-50917-8] A collection of papers ded- 
icated to J. Serrin. MU 


Applications (Physics), T(18), S, P. Scattering 
Theory for Hyperbolic Operators. Vesselin Petkov. 
Stud. in Math. & Its Applic., V. 21. Elsevier Science, 
1989, xiv + 373 pp, $92.75. [ISBN: 0-444-88056-9] 
Applicable to spaces of either even or odd dimen- 
sion, the approach used here involves the RAGE type 
theorem. Topics include contraction semigroups and 
power bounded operators, the Cauchy problem for 
the wave equations, scattering theory for symmetric 
systems with dissipative boundary conditions. MU 


Applications (Simulation), P. The Monte Carlo 
Method for Semiconductor Device Simulation. C. Ja- 
coboni, P. Lugli. Springer-Verlag, 1989, x + 356 pp, 
$98. [ISBN: 0-387-82110-4] It is getting to the (sub- 
micron) point where you cannot design a computer 
chip without first simulating the physics of charge 
transport. Monte Carlo (casino) methods are likely 
to help. BC 


Reviewers 
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matical competition for high school stu- 
dents in the United States and Canada 
that begins with the AHSME taken by over 
400,000 students, continues with the Amer- 
ican Invitational Mathematics Exam involv- 
ing 2,000 students, and culminates in the 
100-contestant USAMO. 
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S. Klamkin brings together 75 original USA 
Mathematical Olympiad (USAMO) problems 
for years 1972-1986, with many improve- 
ments, extensions, finger exercises, open 
problems, references and solutions, often 
showing alternative approaches. The prob- 
lems are coded by subject and solutions 
are arranged by subject as an aid to those 
interested in a particular field. Contains 
a glossary of frequently used terms and 
theorems, and a comprehensive bibliogra- 
phy with items numbered and referred to 
in brackets in the text. The problems are 
intriguing and the solutions elegant and in- 
formative. Students and teachers will en- 
joy working these challenging problems. In- 
deed all those those who are mathemat- 
ically inclined will find many delights and 
pleasant challenges in this book. 
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Good writing conveys more than the 
author originally had in mind, while poor 
writing conveys less. Well-written papers 
are more quickly accepted and put into 
print and more widely read and appreci- 
ated than poorly written ones—and for 
notes, monographs, and books the qual- 
ity of writing is more important than it is 
for papers. 

In Writing Mathematics Well, Leo- 
nard Gillman tells his readers how to 
develop a clear and effective style. All 
aspects of mathematical writing are cov- 
ered, from general organization and 
choice of title, to the presentation of 
results, to fine points on using words and 
symbols, to revision, and finally, to the 
mechanics of putting your manuscript into 
print. No book can by itself make you a 
better writer, but this one will alert you to 
the opportunities for better and more 


forceful writing. It does this both by pre- 
cept and by example. 

A book to be read for its sharpness 
and wit as well as for enlightenment, Writ- 
ing Mathematics Well should be on the 
shelf of anyone who writes or intends to 
write mathematics. It will amuse and 
delight the already careful writer and it will 
help reform and refine the sensibilities of 
those who may be somewhat careless 
about their writing. 
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culus and linear algebra) until the last page. Mathematicians will be intrigued by the 
development which places Galois Theory in an historical perspective and allows it to 
unfold from the emergence of the classical construction problems to the discovery of 
their solutions. The book presents some of history’s greatest and most elegant mathematics. 
Read what some reviewers say about this monograph: 

‘‘Hadlock has produced a pedagogic masterpiece . . . His idea of beginning 

with the three Greek problems and Ietting them fire the reader’s spark plugs 

Is brillant. . . (his) ability to inject vitality and enthusiasm into mathematical 

text 1s remarkable. (George Piranian, University of Michigan) 


“In the preface, the author claims to have written the book for himself. . . 
Fortunately for us, he chose to share his work with the mathematical commu- 
nity. I suggested the book as collateral reading in a one-semester course in Galois 
Theory and the students found it very readable and most helpful in establish- 
ing a motivation and direction . . . This is a delightful book for both student 
and teacher.’’ (John D. Leadley, Mathematical Reviews) 


Table of Contents 
Chapter 1—The Three Greek Problems 
Constructible Lengths; Doubling the Cube; Trisecting the Angle; Squaring the Circle; 
Polynomials and Their Roots; Symmetric Functions; the Transcendence of T. 
Chapter 2—Field Extensions 
Arithmetic of Polynomials; Simple, Multiple, and Finite Extensions; Geometric 
Constructions Revisited; Roots of Complex Numbers; Constructibility of Regular 
Polygons I; Congruences; Constructibility of Regular Polygons II. 
Chapter 3—Solution by Radicals , 
Statement of the Problem; Automorphisms and Groups; The Group of an Exten- 
sion; Two Fundamental Theorems, Galois’ Theorem; Abel’s Theorem; Some 
Solvable Equations. 
Chapter 4—Polynomials With Symmetric Groups 
Background Information; Hilbert’s Irreducibility Theorem; Existence of Poly- 
nomials over Q with Group S.. 


4 aw a 
/ os’ Order from: 


The Mathematical Association of America 
1529 Eighteenth Street, N.W. 
Washington, D.C. 20036 


Books thaf make mathematics fun for 
students fo learn, and easy for you to 


feach.. . 


Investigations in Algebra 
Albert A. Cuoco 


Investigations in Algebra departs from a preoccupation with calculus as the 
ultimate goal of, and the universal introduction to, advanced mathematics 
by using Logo to explore combinatorics, number-theory, the study of dis- 
crete functions, and other topics that are not on the traditional path to 
calculus. This approach encourages students to participate actively in 
exciting mathematics, developing their facility for abstraction as well as an 
appreciation for the power of mathematical methods. 

“Back-to-basics enthusiasts would have us believe that experiential learning 
is incompatible with hard facts and measurable achievement. Al Cuoco, a 
gifted mathematician as well as a master teacher, gives his students a 
remarkable fund of solid, sophisticated mathematical knowledge, while 
respecting the exploratory process through which that knowledge was 
originally discovered. The book's use of computer programming as a me- 
dium for mathematical experiments helps make this work possible, without 
allowing the programming to displace the mathematics as the focus of 
attention.” — Dr. Brian Harvey, Lecturer, Computer Science Division, Univer- 
sity of California, Berkeley 
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for ways to integrate computers into the mathematics curriculum. 
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STUDIES IN NUMERICAL ANALYSIS 
Gene H. Golub, Editor 


Golub set high standards in the lead arti- 
cle on numerical solution of polynomials by 
J.H. Wilkinson which won the Chauvenet 
Prize as the best piece of mathematical ex- 
position to appear in 1985. These papers 


This book ts not just for numerical an- 
alysts, it is accessible to anyone with 


some mathematical foundation who 
wants to taste selections from a nu- 
merical analysis menu. The papers do 
a good job in motivating and introduc- 
ing their subjects, and some provide 
a nice historical background. | recom- 
mend the articles in this book for any- 
one interested in learning more about 
the current research interests in numer- 
ical analysis. They may excite their 
readers to investigate further this rel- 
evant field of mathematics. 


David C. Arney in Mathematics and Com- 
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Has Progress in Mathematics Slowed Down?* 


PAUL R. Hatmos, Santa Clara University 


PauL Hatmos has three degrees from the University of Illinois; soon after 
getting the last one he became, for a couple of years, assistant to John 
von Neumann. Since then he has taught at many universities (including 
Chicago, Michigan, and Indiana) and has visited many others (including 
Miami, Montevideo, Hawaii, Edinburgh, and Western Australia); he has 
been on the faculty of Santa Clara University since 1985. His mathematical 
interests include ergodic theory, algebraic logic, and operators on Hilbert 
space. 


Prologue. Do we know anything that Dedekind didn’t know? We should. 
Dedekind died in February, 1916. Six weeks before that, late Friday afternoon on 
New Year’s Eve 1915, the MAA was born in Columbus, Ohio. In connection with 
the celebration of its diamond anniversary, in August 1990, in Columbus, Ohio, it 
became my mission to report on whether and how mathematics has changed 
during the 75 years of the MAA’s existence, and what follows is an attempt at such 
a report. 

I am not trying to teach any mathematics in this report, nor even any history of 
mathematics—all I am trying to do is share an interesting look at the growth of 
mathematics in the last 75 years. Everybody could find out everything I found out 
by spending a few months looking at all extant volumes of Mathematical Reviews 
and a few dozen other journals, but everybody hasn’t done it, and I have, and I am 
ready to tell what I have found. 

The question that I set out to answer might be phrased this way: if you had a 
time machine to take you back to Dedekind, what could you teach him about 
progress in mathematics since his day? In an attempt to organize the possible 
answers, I propose to put them into three classes: concepts, explosions, and 
developments. 

The first class consists of the new concepts that Dedekind could not have 
predicted or expected, the new words that all mathematicians alive today have 
heard over and over again and that many of us wish we knew more about. (Sample: 
catastrophe theory.) By an explosion I mean a piece of mathematical progress that 
is genuine mathematics, so recognized by the whole profession, but that is at the 
same time the answer to old problems of such great fame that it is hot news not 
only for the Transactions, but also for the Times for a day, for Time for a week, and 
for student mathematics clubs for many months. (Sample: the four-color theorem.) 
The third proposed class consists of the deep and in some cases even breathtaking 
developments (but not explosions) of the kind that may not make the Times, but 
could possibly get Fields medals for their discoverers. (Sample: the independence 
of the continuum hypothesis. Incidentally: not all the developers mentioned below 
got Fields medals, and not all Fields medalists of the period are mentioned below, 
but qualitatively the two classes are essentially the same.) Many (but not all) of the 


*A greatly condensed version of this paper was presented in August 1990 at the Columbus meeting 
of the Mathematical Association of America. 
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subjects have been and some of the others no doubt soon will be treated in 
expository articles in the MONTHLY. 

I am listing a total of 22 subjects: 9 concepts, 2 explosions, and 11 developments. 
If someone else had picked 22, the chances are that the overlap would have been 
considerable (10? 18?) and so would the difference. There is nothing wrong with 
that. Some of the subjects that could (should?) have been included but were not 
are: exotic spheres, the Hauptvermutung, NP completeness, pseudodifferential 
operators, and the simplex method. One thing is for sure: no matter what subjects 
were treated, not all would be of equal importance. Some of the topics that I 
discuss are yes-or-no theorems (such as the solution of Hilbert’s fifth problem), 
some are tools (such as the fast Fourier transform), some are attitudes (such as 
nonstandard analysis), and some are gigantic theories (such as the Atiyah-Singer 
index equation). Each topic that is listed is here because either it is important or it 
is amusing or, at the very least, it has been well publicized, and, in any case, it 
deserves attention. All the topics do have at least one feature in common: none of 
them belongs to what is sometimes called applied mathematics and (in particular?) 
none of them belongs to computer science. I had two reasons for that decision: one 
is that I don’t know the subject and the other is that it will surely be represented in 
other reports. 

The articles vary in length from one paragraph to about a dozen. The length 
was determined, in each case, by how complicated the subject is, by what I was 
able to learn of it, and, of course, to some extent, by personal preference. It is 
unusual in research mathematics to find something that is both interesting and 
elementary; when I did find something like that, I took advantage of the find and 
digressed somewhat from the main point. 

If one of the pieces of mathematics that I am reporting on is one that you know 
about, you won’t learn anything from what I say, but if not, you are in danger of 
learning something. You will probably not learn the precise statement of a 
theorem (mathematics is, after all, not a collection of theorems but a collection of 
ideas), and you will certainly not learn a proof. The purpose of the articles is not so 
much to explain things as to put them into a context. That can mean many 
different things; one of them is that an article about a generalization of something 
should at least refer to the easiest nontrivial manifestation of that something. 
Another thing it means is that the articles are written in prose; they contain very 
few of the customary abbreviating symbols of mathematical exposition. In a 
recently much publicized phrase, the articles are intended to contribute to the 
mathematical version of “cultural literacy”; they don’t say what a subject really is, 
but they say enough about it to make casually dropped tea-time references more 
comprehensible than they were before. 

A bibliography for a report such as this would almost be a bibliography for all 
mathematics. As a compromise between that and nothing, I offer exactly one 
reference at the end of each section. The understanding of a reader who consults 
one of these references would, I hope, be increased by the explanation there. In 
addition, such a reader would, I know, find further references to the articles and 
books that treat the subject of interest. 

The unique reference for each section is either to 


The Encyclopedic Dictionary of Mathematics 
(abbreviated EDM) 
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(prepared by the Mathematical Society of Japan, M.I.T. Press, 1980) or to the 
journal 
The Mathematical Intelligencer 
(abbreviated MI). 

EDM references appear in the form 

EDM 146/B 
(referring to the article labelled 146/B), and MI references appear in the form 

MI 8/1/40 
(referring to volume 8, number 1, page 40). 

That’s it; now let’s get down to work, and see what these concepts, explosions, 

and developments really are. 


Concept 1: Moore-Smith limits. Modern general topology, and in particular its 
manifestations in the so-called weak topologies of certain function spaces, taught 
us that limits of sequences are no longer a strong enough tool for analysis. In 
classical analysis we learn that a set is closed if and only if it contains the limits of 
all convergent sequences in it, but there are many important and useful topological 
spaces for which that statement is not true. If, however, sequences are replaced by 
“generalized sequences” (the streamlined word is “nets’’), and, correspondingly, 
ordinary limits of sequences are replaced by Moore-Smith limits of nets, the 
classical proofs work again, often with no changes except terminological ones, and 
they yield results just as useful as the classical ones. (Example: a set is closed if and 
only if it contains the Moore-Smith limits of all convergent nets in it.) 

The Moore of the seminal 1922 Moore-Smith paper is the great E. H. Moore 
(one of the teachers of R. L. Moore) and the Smith is the otherwise largely 
forgotten H. L. Smith. The word “net” was first used, so far as I know, by J. L. 
Kelley in 1950. 

Reference: EDM 89/H. 


Concept 2: Distributions. The set S of infinitely smooth functions R” with 
compact support is in an obvious way a vector space. Let us say that a sequence 
{y,,} of such functions converges to 0 if the supports of all the g,, are covered by 
some fixed compact set, and not only does the sequence converge uniformly to 0, 
but, in fact, so do all its mixed partial derivative sequences. 

If now f is a complex-valued integrable function on R”, then the equation 


T;(@) = [o(x) f(x) de 


defines a continuous linear functional on S, and the same is true if f is only locally 
integrable. If uw is a complex finite measure in R”, then the equation 


T,(@) = | e(x) du (2) 


defines a continuous linear functional on S, and the same is true if yw is required to 
be finite on compact sets only. If, for instance, w4( EZ) = 1 when the origin belongs 
to E and p(E) = 0 otherwise (in other words, w is a point mass at 0), then 
T,(~) = (0) for every g—which may remind you of the curious behavior of 
something that used to be called the Dirac delta function (but which, of course, 
never was a function). 
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All continuous linear functionals on S are called distributions; the inspiration of 
Fields medalist Laurent Schwartz (and the other giants on whose shoulders he 
stood) was to realize that the concept of distribution generalizes the concept of 
function and that in many classical situations, in which no function exists that 
satisfies certain prescribed conditions, a distribution satisfying them may exist and 
is for all practical purposes just as good. A typical kind of “prescribed condition” 
is a partial differential equation to be solved—a distribution solution often gives as 
much applicable information as a classical honest one. 

Reference: EDM 130/B. 


Concept 3: The Monte Carlo method. Mark a table with a system of parallel 
lines spaced, say, two inches apart from one another, and consider a needle (a 
mathematically idealized needle, in other words a line segment) of length one inch. 
Drop the needle on the table at random and ask: what is the probability that the 
needle will not land between a pair of the parallel lines but will cross one of them? 
This is the famous Buffon needle problem, and the method of working it out is not 
especially difficult; it is an exercise in many elementary probability texts. The 
answer turns out to be 1/7. 

Is that a bit of a surprise? How does 7 get into the answer to this question? 
That’s not the main issue right now, but as long as it’s been raised a comment on it 
might be in order. The answer to the probability question depends, of course, on 
how the needle gets on the table—or, to say the same thing more precisely, it 
depends on the probability distribution that is tacitly or otherwise assumed to be 
the one that the fall of the needle is subject to. Different possible distributions 
have been studied, and, to nobody’s surprise, yielded different answers. One 
possibility is to let the probability distribution of the location of the center of the 
needle be uniform on a line segment perpendicular to the parallel lines and of 
length three inches, say, and, independently, to let the probability distribution of 
the angle at which the needle is inclined to that perpendicular segment be uniform 
between 0° and 180°. Once angles are explicitly mentioned that way, it should be 
no surprise that zr enters the answer. 

If the answer is accepted, a curious consequence follows. Perform the experi- 
ment, repeatedly, many times, and count the ratio of the number of successes to 
the number of trials—the law of large numbers tells us that that ratio will be very 
nearly equal to 1/7r. Conclusion: the value of 7 can be determined from a physical 
(probability) experiment, with no calculation (except the formation of a reciprocal 
at the end). This was known and appreciated a couple of hundred years ago—it 
was a forerunner of the modern technique known as the Monte Carlo method, 
introduced around 1945 by Ulam and von Neumann. 

The Monte Carlo method has had frequent applications when the obvious 
calculations that a question demands seem formidable—replace the question by a 
probability question that has the same answer and find the answer to the probabil- 
ity question either by actual experimentation, or, more likely, by computer simula- 
tion of random experimentation. An easy example is the evaluation of a definite 
integral {> f(x) dx where f is a bounded function (for definiteness, 0 < f(x) < 1 
whenever 0 < x < 1). Proceed as follows: choose (or let your computer choose for 
you) a large set of pairs (x, y) of random numbers, with 0 <x < 1,0 <y <1, and 
then evaluate the ratio of the number of them for which y < f(x) to the total 
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number. That ratio is approximately equal to the average {> f(x) dx—and, lo and 
behold, the integral has been (approximately) evaluated. 

This example is a little too naive for the theory, but the genuine real-life 
applications are in spirit pretty much the same. To calculate something too messy 
to calculate, whether or not the something is already a probability, replace the 
question by a probability question (or a different probability question) that is more 
easily susceptible of experimental (or, rather, computer imitated experimental) 
evaluation. 

Why the name Monte Carlo? Presumably because that’s the name associated 
with one of the most famous gambling places of the world. What the name suggests 
is that if you can’t solve a problem, try gambling with it—that is, replace it by a 
gambling problem that has the same solution and that you have a better chance of 
being able to solve. 

Reference: EDM 378 /B. 


Concept 4: Categories. If you understand vector spaces and linear transforma- 
tions, you are 90% of the way toward understanding categories. Mathematicians 
have been viscerally aware for a long time that vector spaces and linear transfor- 
mations behave “just like” groups and homomorphisms, and they in turn behave 
just like topological spaces and continuous mappings—the parallelisms are great 
and all pervasive. If you know what subgroups are, you know what subspaces are, 
and if you understand how to form quotient groups, you are in a good position to 
discover (to rediscover) how to define quotient spaces. 

Category theory is a formalization of that visceral understanding; it was born (to 
Eilenberg and Mac Lane) in 1945. A category is a class of “things” of two kinds, 
objects and morphisms, satisfying three easy axioms. Brutally summarized, those 
axioms say that morphisms can be composed, provided only that their domains and 
ranges match properly, that for each object X there is an identity morphism from 
it to itself, and that if Hom(CX, Y ) denotes the set of all morphisms from X to Y, 
then the only way HomCX, Y) and Hom(X’, Y’) can have any elements in common 
is to have X = X' and Y=Y’. 

Vector spaces, and groups, and topological spaces fit under this scheme, and so 
do abstract sets, and rings, and differentiable manifolds, and modules, etc., etc. 
More special concepts than just morphisms can be defined within each category 
(such as monomorphisms, subobjects, direct products, and duality); they are all 
pleasant and handy and none of them holds any surprises. 

An important part of the theory of categories is the theory of functors. A 
functor is a way of associating with the objects and morphisms of one category the 
objects and morphisms of another. Typical example: each vector space has a dual, 
and each linear transformation between vector spaces has an adjoint. Trivial but 
helpful example (called the forgetful functor): each topological space determines a 
set and each continuous mapping determines a mapping—just forget the topology 
and the continuity, and thus pass from the category of topological spaces to the 
category of sets. 

Category theory started as (and continues to be) a convenient language in which 
to describe many phenomena, and for many of us that’s all it is. For the fascinated 
specialist it is a subject of research that can discuss quotient categories, adjoint 
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functors, categories of categories, and other such towers of elaborate complications 
of great fascination—long may it wave. 
Reference: EDM 53/A. 


Concept 5: K-theory. If U and V are finite-dimensional vector spaces with the 
field of real numbers acting as scalars, then so is the direct sum U @ V of U and V. 
At a casual glance the formation of direct sums looks like a perfectly respectable 
addition process, but a second glance shows that there is something a little wrong: 
the process isn’t associative. The direct sum (U ® V) © W resembles the direct 
sum U @ (V ®@ W) in many ways, but strictly speaking it’s not the same; the 
elements of the first are ordered pairs whose first coordinates are ordered pairs, 
whereas the elements of the second are ordered pairs whose second coordinates 
are ordered pairs. The temptation to consider the correspondence that assigns to 
every element ((u,v),w) in the first direct sum the element (u,(v,w)) in the 
second is well nigh irresistible. That correspondence is an isomorphism between 
the direct sum spaces involved, and the natural way out of the non-associative 
obstacle is to identify two vector spaces if they are isomorphic. If that is done, and 
that is the normal thing to do in this situation, another small obstacle arises, but 
it’s easy to make it too go away: it has to be checked that the addition process can 
be unambiguously defined for isomorphism classes of vector spaces as well as for 
just plain vector spaces. That is, what has to be shown is that if U and U’ are 
isomorphic vector spaces and if V and WV’ are isomorphic vector spaces, then the 
direct sum U © V is isomorphic to the direct sum U’ @ V’. That is true and almost 
obvious, and once it is on record, then the direct sum of two isomorphism classes 
of vector spaces can be defined by choosing a representative element of each class, 
forming the direct sum of those representative elements, and then forming the 
isomorphism class of that direct sum. 

Very well—a sort of addition can be defined for vector spaces (yes, yes—they 
should be called isomorphism classes of vector spaces, but everyday mathematical 
idiom continues to call them vector spaces anyway, and mentally remembers that 
equality no longer means honest equality but isomorphism)—what can be said 
about the class of all vector spaces under this addition? Natural question: do they 
form a group? The first part of the answer is easy—there is an identity element, 
that is, there is a neutral vector space with the property that adding it to any other 
produces no changes—namely, the unique 0-dimensional vector space. The next 
part of the answer is dishearteningly negative: no, vector spaces under direct sum 
do not form a group, because inverses, negatives, do not exist. 

The setback is not serious—it is no more serious than that encountered by a 
child who has learned to add positive integers, has discovered 0, and is discouraged 
by his inability to go backward—subtraction doesn’t always make sense. To 
subtract u, say, from u + v is something that can be done, but subtraction of an 
arbitrary positive integer from another may or may not be doable. Similarly, 
someone might be willing to say that subtracting U from U @ V is permissible, and 
yields the answer V, but what about the general case, and, in particular, what 
about subtracting U @ V from U? | 

The way out, for vector spaces, is the same as the way out was for integers—the 
integers that do not exist are created by fiat, or by an explicit set-theoretic 
construction: negative numbers, and negative direct sums (!) are adjoined to the 
system under the study, and the result is a perfectly good group. 
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What group is it? The answer is easy to state and easy to understand—it is, 
except for notation, the same as the additive group of all integers. The point is that 
if isomorphic finite-dimensional vector spaces are identified, then there is nothing 
to distinguish between two vector spaces except their dimension—which is a 
nonnegative integer—and, consequently, the process of adjoining “negatives” to 
vector spaces (or, rather, to their isomorphism classes) is exactly the same as the 
process of adjoining negatives to the nonnegative integers. 

The procedure indicated above can be carried out, and turns out to be 
profitable to carry out, even if the field of real numbers is replaced by an arbitrary 
ring. Rings occur in “the real world” more often than fields do, but they constitute 
a more difficult subject. Algebraic K-theory is a partial attempt to face the 
difficulties. What makes the use of fields easier is that a finite-dimensional vector 
space has a finite basis. No, that’s not a redundancy. To see that it is not, rephrase 
it this way: a finitely generated module over a field has a finite basis. (Recall that 
the definition of a module is just like that of a vector space except that an arbitrary 
ring is allowed to play the role of the coefficient field.) The point is that the 
corresponding statement for modules over rings instead of fields is not always true. 
It is true for “good” modules, but a precise definition of “good” is a technicality 
that can safely be omitted from an overview such as this one. (According to a 
popular definition the good modules are the ones that are finitely generated and 
projective.) 

All right then, if vector spaces are replaced by modules, the group theoretic 
construction goes pretty much the same way. Addition is defined for “good” 
modules by forming direct sums, and then the definition has to be modified, as 
before, so as to apply to equivalence classes of modules instead of modules; the 
result is a lovely associative addition, with a zero element, but no inverses. 
(Caution: the useful notion of equivalence here is a slightly weakened version of 
isomorphism, but the principal features of the theory remain the same anyway.) 
Let inverses, negatives, be adjoined, by fiat, or by the usual construction of forming 
ordered pairs (“differences”) and equivalence classes thereof, and the result is a 
group. The equivalence relation that defines the group assigns to each module over 
a ring R an element of the group—that element plays the role of the generalized 
dimension of the module. The group thus associated with the prescribed ring R is 
denoted by K,(R) and is called the Grothendieck group of R—and that associa- 
tion is the first step of what is called K-theory. 

K-theory does more than that first step: in general it associates an abelian group 
K,,(R) with every ring R and every nonnegative integer n, in ways that become 
more and more mysterious as 1 increases. For n = 0, the result is a generalization 
of dimension; for n = 1, it is a generalization of determinant; after that—don’t 
ask. But, in any event, it can’t do any harm to take a quick look at n = 1. 

The objects at the center of the stage this time are not modules, but automor- 
phisms of modules. If the coefficient ring is a field, a good assumption to begin 
with, then automorphisms can be viewed as nonsingular matrices. When should 
two such objects be regarded as equivalent? One classically inspired possibility is 
to call two matrices equivalent just when they can be obtained from one another by 
a finite sequence of elementary transformations. Elementary transformation? In 
this context that means: add an arbitrary scalar multiple of any row (or column) to 
any other row (or column). Alternative definition: call a matrix elementary if it is 
either equal to the identity matrix or differs from it at exactly one non-diagonal 
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entry, and then define an elementary transformation as multiplication, fore or aft, 
by an elementary matrix. Since an elementary matrix obviously has determinant 1, 
it follows that every matrix in the group generated by the set of all elementary 
matrices has determinant 1, and that two equivalent matrices must have the same 
determinant. The two theorems in the preceding sentence are the weaker halves of 
two better theorems, which say that a matrix has determinant 1 if and only if it is a 
product of elementary matrices, and that two nonsingular matrices (over a field) 
are equivalent if and only if they have the same determinant. Summary: the 
quotient group of the group of all invertible matrices by the (normal) subgroup 
generated by the elementary matrices is equal (well, isomorphic) to the multiplica- 
tive group of nonzero elements of the coefficient field. 

The point of this summary is that it is a nontrivial statement about determinants 
of matrices over fields in which the word “determinant” does not occur. Determi- 
nants of matrices over rings are hard to come by, but elementary transformations 
and elementary matrices make perfectly clear sense. The concept they lead to must 
cope with two kinds of matrix operations—the algebraic ones of sum and product 
and the geometric one of direct sum. The best way to do that is to apply similar 
considerations to infinite matrices each of which is in fact a direct sum of an 
infinite identity matrix and a finite invertible matrix (with entries in a prescribed 
ring R). The resulting quotient group (modulo the subgroup generated by the set 
of all elementary matrices) is denoted by K ,(R) and is called the Whitehead group 
of R. It turns out to be abelian (a nontrivial theorem), and its elements are the 
generalizations of determinants (of matrices over fields) inthe same sense in which 
the elements of K,(R) are the generalizations of dimensions (of vector spaces over 
fields). 

The uses of K-theory in algebra and topology are many, and, in particular, the 
topological cousin of the algebraic theory discussed above is essentially involved in 
the proof of the Atiyah-Singer generalization of the Riemann-Roch theorem. 

Reference: EDM 2306/1. 


Concept 6: The fast Fourier transform. The Fourier transform of a function f 
on the real line is usually defined to be the function g given by 


1 re —ixy 
g(x) == J f(y) e dy, 


assuming that the integral exists in some decent sense. The Fourier transform is a 
powerful tool in modern pure analysis and it is also a powerful tool in classical 
hard analysis and many of its concrete applications. In the applications it is, of 
course, desirable to minimize the computations that enter, and it is not surprising 
that various approximations are frequently introduced, such as replacing the 
infinite interval by a finite one and replacing the integral by a sum. 

The discrete Fourier transform comes therefore to play a role—it is the 
transformation that assigns to an n-tuple (Xy,..., x, _,) of complex numbers the 
n-tuple (y,,..., Y,—1) given by 

1 771 . 
Ye=— Lx, e rmra/n, g=0,1,...,.n-1. 


N »=0 


The so-called fast Fourier transform is a minor observation—an inspired minor 
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observation—whose discovery is variously attributed to J. W. Cooley and J W. 
Tukey (1965) and to C. F. Gauss (1805). The observation is that the sum, the 
discrete Fourier transform, can be evaluated by expressing it as a sum of clever 
subsums and making clever use of the algebraic properties of the roots of unity 
(the numbers e  *7/?4/") that enter. The popular name of those algebraic proper- 
ties is “trigonometric identities’. The idea that makes cleverness possible is that 
the Vandermonde matrix formed with the powers of a primitive nth root of unity 
can, for some n’s, be factored in such a way that the factor matrices have many 
entries equal to 0, and that, as a result, the subsum evaluations need less 
arithmetic. 

The evaluation of each y, by the definition of the discrete Fourier transform 
involves the evaluation of n — 1 products (x, times e~*7'?4/" for p = 1,...,n — 1) 
followed by n — 1 additions (the partial sums). Since there are n values of y,, the 
total work involves n(n — 1) additions and the same number of multiplications. 

The simplest version of the fast Fourier transform applies to the case in which n 
is represented as a product of two factors, n = hk. In that case the fast Fourier 
transform can involve as few as n(h + k — 1) additions and n(h + k) multiplica- 
tions. If, for instance, n = 100 = 10 X 10, then the direct method takes 9900 
additions and 9900 multiplications, whereas the fast method takes 1900 additions 
and 2000 multiplications—a huge difference, which gets huger as n grows and the 
process is iterated. 

It’s a beautiful and useful part of what used to be called numerical analysis 
(before it came to be called computer science). Is it more than a trick? Does it 
teach us anything about pure analysis, about the group of roots of unity—does it 
have valuable analogues in other groups? 

Reference: MI 7/3/49. 


Concept 7: Nonstandard analysis. Leibniz used infinities and infinitesimals, but 
he admitted feeling slightly queasy about them. His successors banished such 
things from the mathematical heaven and worked with e’s and 6’s instead. The 
modern theory of nonstandard analysis dredged the forbidden concepts up from 
the underworld and is trying to reinstate them at the right side of Cauchy’s throne. 

A possible attitude toward infinitesimals is to regard them as “ideal’’ elements, 
similar in that respect to the points at infinity in the projective plane. Once 
assumed, or defined, or constructed, the main tool in working with them is the 
so-called transfer principle, which says (roughly) that anything sayable in the 
appropriate formal language and true about the nonstandard universe remains 
true about the standard universe. 

The basic standard universe consists (roughly) of “individuals” (Urelemente), 
and sets of them, and sets of sets, and sets of sets of sets, and so on ad infinitum. 
The nonstandard universe has many extra elements; they are (roughly) functions 
with values in the standard universe, or, slightly more precisely, classes of functions 
that have been identified according to a suitable equivalence relation. There is a 
sense in which the new elements are reminiscent of sequences. A constant 
sequence, whose constant value belongs to the standard universe, plays the role of 
that value, but there are sequences that are infinitely small (converge to 0) and 
others that are infinitely large (diverge to ~), and in the democracy of the 
nonstandard universe they are all treated equally. 
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Consideration of such universes leads to ordered fields that are like the field of 
real numbers, and play the role of that field in the theory, but that are non-Archi- 
medean—that is, they do not share with the real numbers the property that a small 
thing added to itself often enough becomes a large thing. For those who are 
comfortable with the use of formal languages, and believe that infinitesimals and 
infinities are intuitively clear and pleasant concepts, nonstandard analysis seems to 
be an efficient tool—such people can use that language to think in and, they say, 
they are led thereby to make otherwise elusive discoveries. Their greatest single 
victory so far (the only one that every proselyting article quotes) is the first proof 
(by Allen Bernstein and Abraham Robinson) of an invariant subspace theorem 
about certain special operators in Hilbert space—and, with considerable justice, 
they refuse to be fazed by the fact that soon after the first proof (nonstandard) 
other (completely standard) proofs came along. 

The future of the subject is not yet clear to most non-believers—will it or won’t 
it become an established part of mathematics—will it or won’t it displace e’s and 
6's? 

Reference: EDM 274 /E. 


Concept 8: Catastrophes. Consider the graph of the parabola given by the 
equation y* = x and consider the mapping (x, y) > x (projection) from that graph 
to (the positive part of) the x-axis. With exactly one exception every point of the 
graph has a neighborhood that is homeomorphic to its image, which is an open 
interval on the line. The exception is, of course, the origin; in no neighborhood of 
the origin is the projection mapping one-to-one. This phenomenon is described by 
saying that the origin is a singularity of the (smooth) mapping from a 1-dimen- 
sional manifold (the graph) to another (the axis). 

Consider the surface of a sphere, project it perpendicularly to a plane, and note 
that the local behavior of the projection varies from point to point. Each point of 
the northern hemisphere has a neighborhood in which the projection acts as a 
homeomorphism, and the same is true of the southern hemisphere, but at points of 
the equator the projection has a kind of singularity called a “fold.” It looks like a 
fold, locally, doesn’t it?: a small neighborhood of a point of the equator is, in 
effect, folded over on itself by the projection. 

Consider next a suitable cubic surface (visualize it as a plane folded over a part 
of itself and then folded back again at a slightly different angle), project it 
perpendicularly to a plane, and note that this time the projection can misbehave in 
different ways. In the case of the sphere, any point of the plane that had an inverse 
image at all had either one or two inverse images; for the cubic surface that 
number can be 1, 2, or 3. The points for which it is 3 correspond to the regular 
points of the projection mapping, the points for which it is 2 correspond to a fold 
singularity, and the unique point for which it is 1 comes from a singularity called a 
“cusp”. The reason for the terminology this time is that the “fold” points on the 
surface project onto a semicubical parabola in the plane whose cusp, in the usual 
sense of the word, comes from the cusp singularity. 

The mathematics of catastrophe theory studies the problem of classifying the 
singularities of smooth mappings (such as the projections above) between smooth 
manifolds (such as the parabola, the sphere, and the cubic surface above). A 
typical result (for the 2-dimensional case) is that every smooth mapping of a 
surface to a plane has, after possibly an appropriate small perturbation, nothing 
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but cusps and folds for its singularities. In higher dimensions there are similar (and 
surprisingly short) complete lists of “elementary catastrophes”. The theory goes 
back to the work of Whitney in 1955; since then it has been extensively cultivated 
and applied by several mathematicians. The list of names includes Thom in France 
and Arnold in Russia; both of them have made important contributions to both the 
theory and its applications. Legitimate applications exist, for instance, to wave 
propagation problems, elastic stability, and geometric optics. 

Where does the term “catastrophe” come from? The answer seems to be that it 
was suggested (by Thom?) as an appropriate way to describe startlingly discontinu- 
ous changes in the world around us. Here is a simple example: suspend a weight 
from a horizontal bar, and ask for the maximum supportable load. Intuition and 
experience suggest that as the load is increased nothing much happens, except 
possibly a gentle bend, till, suddenly, the bar breaks and the weight falls—a 
catastrophe. A more complicated example that is mentioned by almost everybody 
who writes on the subject is the degree of aggressiveness of a dog that is 
approached by a possible enemy. The dog feels both fear and rage—which is 
greater? What will the dog’s response be: flight or fight? Once again intuition and 
experience suggest that the dog will cower at first and back away, till suddenly, 
because the distance to the enemy has decreased too much and the threat has 
increased beyond the supportable point, the dog will snarl and attack—a catastro- 
phe. 

How does it happen that a deep mathematical theory, with genuine and 
valuable applications, becomes as controversial as catastrophe theory has become? 
There is nothing wrong with the theorems that Whitney, Thom, Arnold, and others 
have proved; what then can a controversy be about? The answer is that some of 
the proponents of the theory (notably Thom himself, and Zeeman) are describing 
and claiming applications that seem to others (notably Arnold) far-fetched and 
unsound. The applications claimed are to economics, embryology, linguistics, 
psychology, and other subjects, and include political elections, mental disorders, 
prison riots, heart beats, stock market crashes, and the outbreak of war. 

Here are four of the many comments that Arnold makes. 

“The mathematical articles of the founder of catastrophe theory, René Thom, 
were reprinted as a pocket book—something that had not happened in mathemat- 
ics since the introduction of cybernetics from which catastrophe theory derived 
many of its advertising techniques.” 

‘A particular aspect of Thom’s work on catastrophe theory is his original style: 
he established a fashion in not giving even sketchy formulations of results, let 
alone proofs.” 

“When the mapping we are concerned with is known in detail we have a more 
or less direct application of the mathematical singularity theory to various natural 
phenomena...In the majority of works on catastrophe theory, however, more 
controversial situations are considered where not only are the details of the 
mapping not known, but its very existence is problematical.” 

“The deficiencies of... catastrophe theory are too obvious to discuss in detail. 
We remark only that articles on catastrophe theory are sharply distinguished by 
the catastrophic lowering of the level of the requirement of rigour and also of the 
requirement of the novelty of the published results. Although one can understand 
the catastrophe theorists’ reaction against the traditional flow in mathematics of 
rigorous but dull epigonic works, nevertheless their lack of respect to their 
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predecessors (to whom belong the majority of concrete results) can hardly be 
justified.” 
Reference: EDM 410/K. 


Concept 9: Chaos. To a mathematician the word “point” is virtually synony- 
mous with “element of a set”. Let me give a moderately complicated set whose 
“points” are quite a bit more complicated than the points that most students 
usually meet. Begin with the closed unit interval J = [0,1], and let X be the set of 
all finite disjoint unions of nondegenerate closed subintervals of J. (“Nondegener- 
ate” means that single points are not to be considered as intervals.) A “point” in 
the following paragraph is to be an arbitrary element of X. 

Let 7. be the mapping of the set X into itself that acts on each point P by 
removing the open middle third of each of the closed intervals whose union P is. 
Thus, for instance, since / itself is a point of X, the image 7.(/) makes sense and 
is equal to the set (point) [0, 3] U [4, 1]. What happens when the mapping T, of X 
into itself is iterated? Start, for instance, with J, and form _ successively 
T-1), TEC), Te), .... That decreasing sequence of finite unions of closed 
intervals is familiar to anyone who has ever taken a course in the set theory of real 
numbers: the intersection of that sequence of sets is the classical Cantor set. (The 
suffix C is intended to be a reminder of Cantor.) 

The transformation so described is an artificial but interesting and suggestive 
example of a dynamical system. According to the most general definition, a 
dynamical system is a mapping of a set into itself. That definition, however, is too 
general to be of much use. The concept becomes useful when the underlying set 
and the transformation that acts on it have some structure of mathematical interest 
(algebraic, analytic, or geometric), and in all extant studies of dynamical systems 
such extra structures are indeed present. 

The concept of “chaos” depends on that of dynamical system; roughly speaking, 
the theory of chaos is the study of the behavior (misbehavior?) of dynamical 
systems at infinity. Take any transformation T of any set X into itself, form the 
successive iterates T, 77, T°,..., and then ask an intelligent question that involves 
them. Example: let X be the real line, define 7x to be cos x for every x in X, and 
ask: what happens to the sequence x, 7x, T*x,T?x,... for various values of x? 
The answer is easy to derive, but if you have never done it, I would suggest that 
first you take a hand-held calculator that has a “cos” button, start with any original 
input x that you like, and keep pushing “cos” over and over again. (The result 
looks a little more interesting if you use radians instead of degrees.) 

Another example is the dynamical system 7. above. For still another example, 
much deeper, let X be the closed unit interval with its end points identified (so 
that this time a “point” is a real number between 0 and 1 inclusive, except that 0 
and 1 count as the same point), and let TJ, be the act of doubling modulo 1. Thus, 
for example, 
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and 
T,(v2 — 1) = T,(1.4142... — 1) = 2¥2 — 2 (mod1) 
= 2.8284... — 2 = 0.8284... . 


What happens to the sequence x, T, x, 77x, T3x,... for various values of x? Can it 
converge to some limit? Can its closure contain an open interval? This example, 
relatively innocent as it may look, is in fact quite deep; it is a close relative of one 
that is topologically important (the so-called Smale horseshoe) and one that is 
analytically important (the so-called unilateral shift). 

An example of a dynamical system that is more reminiscent of classical analysis 
than the ones above is obtained as follows: choose a couple of constants a and b, 
and define T = T., in R* by 


T(x, vy) = (y+ 1 - ax’, bx). 


Transformations of this kind were introduced and studied by Hénon. The formula 
defining them is not enough to scare even a student of elementary analytic 
geometry, but the study of their asymptotic properties is delicate and interesting. 
Those properties depend, of course, on the parameters a and Db. If, for instance, 
a =1.3 and b = 0.3, then there exists a set of seven points in the plane that 
constitute a “periodic attractor”. That means that the iterated images of a point in 
the plane will either tend to infinity or else tend to get closer and closer to one of 
the seven points, and then to a next one, and then a next, and then, at the eighth 
step closer still to the first, and then to the second, etc. If, on the other hand, 
a =1.4 and b = 0.3, then the iterated images of some points tend to infinity, 
whereas for other points those images cluster at a complicated set of curves that 
constitute what is called a “strange attractor’ (the “Hénon attractor”). A good 
definition of that concept is yet to be offered. The main property of a strange 
attractor seems to be that it is an infinite set with “sensitive dependence” on the 
initial conditions (whatever that means). Strange attractors occur in, for instance, 
the study of turbulence. 

Why is the word “chaos” used? The reason seems to be similar to the reason 
that caused catastrophes to be called catastrophes—it seems to be a subjective 
(not really a mathematical) reaction to an unexpected appearance of discontinuity. 
A possible source of confusion is that the startling discontinuity can occur at two 
different parts of the theory. Frequently a dynamical system depends on some 
parameters (the way the Hénon 7,, depends on a and b), and, of course, the 
sequence of iterates x, Tx, T*x, T°x,... of a dynamical system T applied to any 
particular x always depends on the initial point x. The startling change of the 
Hénon family (from periodic attractor to strange attractor) is regarded as 
chaos—unpredictability—and the very existence of the Hénon strange attractor, 
not obviously visible in the definition of the dynamical system, is regarded as chaos 
—unpredictability. I would like to register a protest vote against the attitude that 
the terminology implies. The results of nontrivial mathematics are often startling, 
and when infinity is involved they are even more likely to be so. It’s not easy to tell 
by looking at a transformation what its infinite iterates will do—but just because 
different inputs sometimes produce discontinuously different outputs doesn’t jus- 
tify describing them as chaotic. 

A part of the theory of chaos has acquired a kind of popularity (notoriety?), and 
no discussion of the subject can omit at least mentioning that part. I refer to the 
things called “fractals”. 
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The artificial dynamical system 7. with which this exposition began led to the 
Cantor set, which occurs in chaos theory frequently and non-artificially. A curious 
and important property of the Cantor set is that its part in the interval [0, $] is 
similar to the entire set—yjust blow it up by the dilatation x > 3x and the small set 
becomes identical with the large one. Similarly, the part of the Cantor set in the 
interval [Z, 4] is similar to the entire set; in fact, every neighborhood of every 
point of the Cantor set includes a subset that is similar to the entire Cantor set. 
The iteration of transformations tends to produce sets that exhibit such “self 
similarity”, that is, sets that are invariant under change of scale; the Hénon 
attractor itself is another example. 

The phenomenon is not new to the mathematical world. It has long been known 
that the Cantor set is the prototype of a large family of sets obtained by various 
architectural changes. Example: omit middle fifths, say, instead of middle thirds. 
Another example: replace the discarded open middles by, say, equilateral triangles. 
Still another: perform middle omissions not on an interval but on rectangles or 
disks or other interesting sets in the plane. All such sets exhibit local properties 
that are pleasing and surprising when first encountered but that become, upon 
acquaintance, so nonsurprising as to be almost trite—they are like friends whose 
company is as desirable as ever but whose shape and whose gestures remain 
completely familiar even when they grow old or wear disguises. 

The Cantor set is a fractal, and so are various variations on that theme. (No 
definition of the general concept is needed here, but I mention in passing that a 
standard one is this: a fractal is a set whose dimension in the usual sense of 
topology, which is always less than or equal to its Hausdorff-Besicovitch dimen- 
sion, is in fact strictly less. The topological dimension of the Cantor set is 0, and its 

log 2 

log3- 
fractals is that they can be pretty. Certain planar versions of the Cantor set, with or 
without “colorization”, look good on your living room wall, and disguising them by 
changing their sizes and shapes can produce enough pretty pictures to fill the 
whole wall or even an entire room in a museum of modern art. 

“Art” is perhaps the operative word. A great mathematician conceives new 
theorems, discovers their beautiful proofs, and shares the results with the rest of us 
by publishing them. A great cook conceives new dishes, discovers their ingenious 
recipes, and shares the results with the rest of us by preparing them. An infinite 
Andy Warhol iteration of the label on a can of soup may be popular and it may be 
art, but it has very little to do with the genius of a cook. The infinite iteration of 
Cantor-like omissions may be a popular subject of conversation and it may be art, 
but it has very little to do with the insight of a mathematician. Paintings of cans of 
soup are not cooking, and paintings of Cantor sets are not mathematics. 

Reference: MI 2/3/1206. 


Hausdorff-Besicovitch dimension is ) The reason for the popularity of 


Explosion 1: The four-color theorem. Everybody knows what the four-color 
problem is, but I’d like to begin by asking some elementary questions about it 
anyway, some questions that you may not have bothered to ask yourself before. 
The point is that the problem concerns the Euclidean plane, and the topology of 
the plane can exhibit some unexpected and weird phenomena. 

Did you know, for instance, that there exist five regions in the plane (“the lakes 
of Wada’) all of which have the same frontier? With “two” instead of “five” 
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everybody does know it: just remember the right and left half planes, that is, in 
terms of Cartesian coordinates, the set of points with positive x coordinates and 
the set of points with negative x coordinates. The common frontier is the y axis, 
and there is nothing surprising about that. (By the way, “‘region” in this context has 
its meaning familiar from a first course in complex variable theory: an open 
connected set.) What about “three’”’ regions with a common frontier—is there a 
simple example of that? No, there is not; anybody who can construct an example 
with three can construct one with five just as easily, or, for that matter, with any 
other finite number. 

Doesn’t that sound like a negative solution of the four-color problem? The five 
regions represent five countries on a planar map, and since each two of them 
touch (at their common frontier), the only way to color the map correctly is to use 
five different colors. 

Yes, that is a negative solution of the four-color problem, but only if the 
problem is carelessly formulated. If five regions have a common frontier, then they 
and their frontier must twist and interlock in an extraordinarily nasty manner. The 
classical four-color problem assumes (usually tacitly) that the countries repre- 
sented on the map are healthy, not pathological. A simple way to rule out 
pathology is to demand that the frontiers of the countries be polygons (a finite 
number of line segments); the full depth and difficulty of the problem is present 
even in that strongly restricted case. If that seems too severe, it is all right to allow 
the frontiers to be pleasant continuous curves, but, in any event, as the five-region 
example shows, some geometric restriction is necessary. 

A different kind of instructive example to look at is a map consisting of a circle 
in which a certain number of diameters have been drawn (say, for instance, three). 
Those diameters divide the circle into six countries each the shape of a slice of pie, 
and all six “touch” at the center. Does that mean that six colors are needed to 
color the map? No, it does not. That’s a situation that the usual statement of the 
four-color problem does foresee and does rule out: two countries are said to 
“touch” only if they have a reasonably long piece of their frontiers in common, 
more than just some isolated corners. 

Very well then: what is the smallest number of colors that is sufficient to color 
all decent maps? Could it possibly be two? One way to answer that question is to 
ask whether it is possible to design a map that represents three countries each of 
which touches both the others. The answer to that is easy: just make a circular map 
and make the three countries like three slices of pie with each of them having an 
angle of, say, 120° at the center: in that case each of them is “between” the other 
two. Conclusion: two colors are not enough to color all maps. 

All right: is there a map with four countries so that each of them touches all 
three of the others? Yes—take the pie map just constructed and punch a hole in it 
—replace the center by a substantial circular country, with, say, half the radius of 
the entire map. The outside countries still have the property that each of them is 
between the other two, and, at the same time, each of them touches the center 
country. Conclusion: three colors are not enough to color all maps. 

Does the technique used twice by now work for the next stage: is there a map 
with five countries each of which touches all others? In an attempt to see the 
answer, change the four-country map just studied in two ways: put the circle that it 
consists of (call it circle C) inside a larger circle (call it D), and cut the small circle 
in its center (call it B) into two parts. The resulting map has six regions: one is the 
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ring shaped region between D and C, two others are the two halves of B, and the 
other three were present before the changes. Call the five regions inside C five 
countries. A glance at the picture shows that three of them touch each of the other 
four, but two of them fail to do so. Adjoin the outside ring-shaped region (as a 
colony?) to that half of B that fails to touch each of the four countries—and, lo!, 
the result is a map of five countries every one of which touches all four others. A 
proper coloring of this map needs five colors—four are not enough. The technique 
is not difficult to generalize; for each positive integer n there exists a map of n 
countries each of which touches all n — 1 others. 

Is something wrong? A possible objection is that (in the case n = 5) one of the 
five countries in the picture consists of two pieces—the country it represents is not 
connected. Is that wrong? Does that happen in the real world? Sure it does; the 
state of Michigan, for one, consists of two pieces. Does that mean that four.colors 
are not enough for all maps? Yes, it does—provided that maps are allowed to 
contain disconnected countries. Once again the usual statement of the four-color 
problem foresees this obstacle, and rules it out; the problem concerns connected 
countries only. 

If that five-color example is not acceptable, what would have to be done to 
prove that four colors are not enough to color all maps? An obvious possibility that 
the preceding discussion suggests is to try again to design a map with five 
countries, but this time connected ones, so that each of them touches all four of 
the others. Augustus de Morgan, in 1852, used ingenious topological arguments to 
prove that there is no such map (at least they would be called topological in the 
twentieth century)—but he was not allowed to jump to the conclusion that he had 
proved the four-color theorem. There was, perhaps, some temptation to jump to 
that conclusion, but it wouldn’t have been right. 

The point is this: to say that n colors are not enough to color a map is not the 
same as saying that it contains a submap of m + 1 countries each of which touches 
all n of the others—not at all. To see that, even for the already known case of 
n = 3, begin by contemplating a circular map with five pie-slice countries (each 
making an angle of 72° at the center). That map can be colored with three colors: 
just go A, B,C, A, B around the circle. Now punch a hole in the middle, that is, 
replace the center by a substantial circular country. The result is a map in which 
no submap of four countries has the property that each of them touches all three 
of the others—but, nevertheless, the entire map cannot be colored with only three 
colors. This argument proves again that three colors are not always enough, but 
this time the proof does not use four mutually touching countries. That is: many 
touching countries imply the need for many colors, but not conversely—many 
colors might be necessary for more subtle combinatorial reasons, and that’s one of 
the sources of difficulty with the four-color problem. 

In 1976 Appel and Haken offered a proof of the four-color theorem. Their main 
achievement was to reduce the problem to a finite one: they showed that once the 
answer is known for a finite list of specific maps (a very large list, to be sure), then 
it is known for all. That reduction is the first step of their proof; the second step is 
a computation. The reduction is standard mathematical reasoning—in principle 
(though definitely not in detail) it was known long ago. That is: it was known what 
kind of reduction was necessary, and many of its steps had already been taken. The 
computation was then carried out electronically, and, when the systematic exhaus- 
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tion procedure (which needed over 1000 hours of computer time on the first run 
through) encountered no obstacles, the computer ran up a flag and blew a fanfare 
of trumpets. Victory: the answer is yes, four colors suffice. 

After the excitement died down, several grumbles and more severe complaints 
began to be heard: the program was never checked, some people said that it was 
uncheckable, and there were even dark rumors of errors, plain old-fashioned 
mathematical errors, gaps in the reduction proof. Grumbles and rumors such as 
that may be interesting, but right now they are beside the point. Assuming that the 
reduction is correct, assuming that the program is correct, and assuming that the 
electronic components functioned completely correctly (and they always do, don’t 
they?), I want to ask: what did we learn from the proof? What do we know now 
that we didn’t know before? 

I do not find that an easy question to answer. To be sure: I am not going to 
spend my time looking for a counterexample to the four-color assertion. The 
printout had at least that practical effect: it discouraged attempts to prove it 
wrong. Except for that, however, I feel that we, humanity, learned mighty little 
from the proof; I am almost tempted to say that as mathematicians we learned 
nothing at all. Oracles are not helpful mathematical tools. 

The most celebrated and difficult outstanding problem of mathematics is the 
Riemann hypothesis. There are many important theorems, with correct proofs on 
record, that are of the form “if the Riemann hypothesis is true, then... .” It would 
be good to know a proof (or disproof?) of the Riemann hypothesis, but till one is 
discovered the theorems in which it appears as a hypothesis can be illuminating 
and useful. That value judgment, however, does not change my opinion of oracles. 
If an oracle told me that the Riemann hypothesis is true, I don’t think that my soul 
would be any richer for that single syllable. 

The development of mathematics shortens proofs, gives insight, and deepens 
understanding by the discovery of ever new concepts and by the resulting subsump- 
tion of old ones under a suitably general theory that took years, decades, and 
sometimes centuries of labor to construct. Technical terms (Banach space, Ar- 
tinian ring, fiber bundle) may frighten and alienate non-specialists, but they are 
just short phrases that abbreviate and clarify the effort, the work, and the thought 
of our great predecessors. 

We may still be far from finding a “good” proof of the four-color theorem. We 
need a simple insight into a new and complicated kind of geometry or intricate 
algebra, and the distance from there to a purely conceptual, existential proof of 
the four-color theorem is probably just as great. I hold as an article of faith, 
however, that we have seen and travelled greater distances than that; the distance 
from the Peano axioms to the Atiyah-Singer index theorem could, for instance, be 
one of them. I believe that the computer (and, for another example, the 10,000 
pages of published proof solving the simple groups problem) missed the right 
concept and the right approach. Their time will come. A hundred years from now 
both theorems (maps and groups) will be exercises in first-year graduate courses, 
provable in a couple of pages by means of the appropriate concepts, which will be 
completely familiar by then. Down with oracles, I say—they are of no use in 
mathematics. 

I cannot, however, stop with that dictum—in all honesty, I should report that 
Appel and Haken do not completely share my religion. Speaking of their work, 


578 PAUL HALMOS [August-September 


they say: “We now know that a proof can be found. But we do not yet (and may 
never) know whether there is any proof that is elegant, concise, and completely 
verifiable by a (human) mathematical mind.” 

One final word. By an explosion I mean a loud noise, an unexpected and 
exciting announcement, but not necessarily a good thing. Some explosions open 
new territories and promise great future developments; others close a subject and 
seem to lead nowhere. The Mordell conjecture, the next explosion below, is of the 
first kind; the four-color theorem of the second. 

Reference: EDM 165. 


Explosion 2: Mordell’s conjecture. Every grade school child knows that 9 + 16 = 
25, or, in other words, that 3* + 4 = 57. A microsecond of thought reveals that 
the equation continues to hold when 3, 4, and 5 are replaced by 3k, 4k, and 5k, no 
matter what k Is. 0 the point of view is to look for integer solutions of the 
equation x* + y? , then there is really no virtue in distinguishing between 
3,4,5 and 6,8, 10. “An efficient way of saying the same thing is to divide 

2 
the equation through by z*, so that it becomes (=) +(>] = 1, and then 
Zz 


x 
replacing 5 and — by x and y, so that the equation becomes x” + y? = 1, with 


the understanding that the solutions sought are rational numbers, not necessarily 
integers. From that point of view all the positive multiples of the 3, 4,5 solution 
become just one rational solution. 

The equation x? + y* = 1 has many rational solutions, infinitely many—or, in 
geometric language, the curve that the equation defines has an infinite number of 
rational points on it. Those points are related to the so-called Pythagorean triples, 
and they are quite easy to find. The situation is different for the Fermat equations 
of higher degree; in fact, the celebrated infamous Fermat problem is to show that 
if n > 2, then the curve defined by the equation x” + y” = 1 has no rational 
points on it at all (except the trivial ones for which either x or y is 0). 

The Fermat problem belongs to a broader context suggested by Louis Mordell 
in 1922. Mordell’s genius in formulating the conjecture was to make the Fermat 
problem simultaneously harder and easier. Harder: enlarge the set of curves 
considered (so as to include all algebraic curves of genus greater than 1 over the 
field of rational numbers—a description that includes the Fermat curves of degree 
greater than 3). Easier: relax the conclusion to be drawn (so as to allow the 
existence of rational points, but, to be sure, only a finite number of them) 
Mordell’s conjecture is a conjecture no longer—it was proved in 1983 by Gerd 
Faltings. Consequence: Fermat’s problem may still turn out to have unexpected 
solutions, but, in any event, for each exponent it has only finitely many! 

Reference: MI 6/2/41 


Development 1: Ergodic theory. If you shoot a billiard ball perpendicularly into 
a side of the table, it will bounce straight back to the opposite side and then keep 
on bouncing back and forth between those two sides. If instead you shoot it 
straight from the middle of one side to the middle of an adjacent side, then it will 
bounce out at the same angle as it came in, hit the middle of the side opposite the 
one you started with, bounce onto the middle of the fourth side, and keep on going 
cyclically around like that. You can probably think of many other such 
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patterns—a carefully specialized original aim will produce a periodic pattern. 
What, however, happens if you shoot the ball at an angle that stands in an 
irrational relation to the sides of the table, or, more clearly put, if you shoot the 
ball at an angle that produces a nonperiodic orbit? Boltzmann, one of the founders 
of ergodic theory in the preceding century, thought about such things and formu- 
lated, among other things, what has come to be called the ergodic hypothesis. 
According to a primitive version of that hypothesis, the path of the (irrationally 
aimed) billiard ball will eventually pass through every point on the billiard table. 

A little cogitation about the topology of a rectangle in the plane versus the 
topology of a curve in that rectangle might convince you that that version of the 
ergodic hypothesis is unlikely at the very least—the applicable laws of mechanics 
are not likely to produce plane filling curves. A modified version of the ergodic 
hypothesis came along: it was the conjecture that the path of the billiard ball, while 
it may not go through every point, will in any event get as near as you like to every 
point—that, in other words, it will be everywhere dense in the billiard table. 

Statistical mechanics occupied itself with meditations such as these till G. D. 
Birkhoff came along with his seminal paper in 1931. What Birkhoff proved was 
that if you take any decent subset of the table, such as the right half, or the interior 
of a circle whose center is the center of the table—whatever—the average time 
that the billiard ball will spend in that subset, no matter where it started, is 
proportional to the area of the subset. The technically difficult part of his 
achievement is that it makes sense to speak of the average time—in other words 
that the limit involved in the meaning of that phrase actually exists—but the 
striking part of the conclusion is that the average time is a constant, independent 
of where the ball started. That conclusion implies, among other things, that if you 
look at the interior of any circle drawn on the table, no matter where it is drawn 
and no matter how large or how small it is, the billiard ball is certain to enter that 
interior over and over again—and hence, in particular, that the path of the ball is 
indeed everywhere dense. 

Birkhoff’s work was published well over 50 years ago, and since then ergodic 
theory has become a major part of mathematics that has rich connections with 
both classical and modern analysis and that is definitely one of the major steps 
forward that was taken in the last 75 years. (Family comment: G. D. Birkhoff’s son 
is Garrett Birkhoff, both at Harvard in their days.) 

Reference: EDM 146/B. 


Development 2: Transcendental numbers. Is the number 2V2 rational? If not, is 
it at least algebraic (that is, a solution of a polynomial equation with integer 
coefficients)? The seventh of Hilbert’s famous problems was about that question 
and the general context to which it belongs. A good way to formulate the general 
question is to ask when numbers of the form a® are transcendental. If a = 0, the 
question degenerates, and the same is true if a = 1; similarly if B is either 0 or 1, 
the question is not one anybody wants to ask. If, more generally, B is rational, then 
the question transforms into easy and generally known subquestions concerning 
algebraic numbers. On the other hand, if either a or B is transcendental, then the 
answer sought for appears to be too close to the data. In view of these comments, 
the “right” question to ask is this: if a and B are algebraic numbers, with a 
different from both 0 and 1, and 8 irrational, does it follow that @® is transcen- 
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dental? The answer turns out to be yes; it was obtained, more or less simultane- 
ously and independently, by A. O. Gelfond (# I. M. Gelfand) and Theodor 
Schneider in 1934. The question about 2V2 is covered: since 2 # 0,2 # 1, and y2 
is irrational, it follows that 2¥2 is transcendental. 

The theory didn’t stop there. Here, for instance, is an n-fold generalization, a 
sample of some of the deep results obtained by Alan Baker in a sequence of 
papers in the late 1960’s (for which he got the Fields prize). If a,,...,a@, are 


algebraic numbers different from both 0 and 1, and if B,,...,8, are algebraic 
numbers such that 1, 8,,..., 6B, are linearly independent over the field of rational 
numbers, then af'!a§2 --- a® is transcendental. 


Reference: EDM 414/D. 


Development 3: The continuum hypothesis. In 1900 there was an International 
Congress of Mathematicians in Paris, and that’s where Hilbert presented his list of 
23 problems. The first of those problems was the continuum hypothesis, originally 
formulated by Cantor. The simplest statement of Cantor’s continuum hypothesis 
(there are other, more general, statements of great interest) is that every uncount- 
able set of real numbers is in one-to-one correspondence with the set of all real 
numbers, or, in Cantor’s notation, that there is no cardinal number between X&, 
and 2%, 

Is the continuum hypothesis true? The question has often been likened to a 
similar one about Euclid’s parallel postulate, and the answer has shocked and 
annoyed a lot of people, just as the Bolyai-Lobachevsky resolution of the status of 
the parallel postulate shocked and annoyed many of our grandfathers. In both 
cases there is a more or less pleasant axiom system (in the present case it is the 
Zermelo-Fraenkel system of axioms for set theory) and a less pleasant, more 
complicated, non-obvious additional axiom. If the extra axiom is a consequence of 
the basic ones, it is true, and all is well; if its negation is a consequence of the basic 
ones, it is false, and, for better or for worse, the question is definitively answered. 
The answer, awaited for a long time, turns out to be a subtle and profound 
intellectual achievement. Gédel proved in 1940 that the continuum hypothesis is 
not false—it is consistent with the other set-theoretic axioms—and Paul Cohen 
proved in 1964 that it is not true—it is independent of the other axioms, or, in 
other words, its negation is also consistent. 

Both Gddel and Cohen argue by the construction of a suitable model, but use 
very different techniques. G6ddel starts with a universe of sets that satisfies the 
Zermelo-Fraenkel axioms and shows that there exists a subuniverse that also 
satisfies them, and in which, moreover, the continuum hypothesis is true. Cohen’s 
argument is similar but harder. It is reminiscent of Felix Klein’s construction of a 
Lobachevskian plane that endows a Euclidean disk with a new metric. Cohen, like 
Gédel, starts with a model of set theory and then enlarges it, adjoins new objects 
to it, in such a way as to “force” the continuum hypothesis to be false. (“‘Forcing” 
has become an important technical word in the subject.) 

Where does that leave the continuum hypothesis? Many people believe, as 
Gédel did, that despite the independence of the continuum hypothesis it is in some 
legitimate sense either true or false—that humanity has not yet thought of the 
right way to describe the full truth about set theory, and when the full truth comes 
to be known, when the appropriate additional axioms are found and adjoined to 


1990] HAS PROGRESS IN MATHEMATICS SLOWED DOWN? 581 


the present ones, then the continuum hypothesis will become either provable or 
disprovable. Both schools of thought exist, and you are free to join whichever one 
you find more attractive. Would it influence your decision if you knew what Gédel 
believed? He thought it would become disprovable. 

Reference: EDM 35/D. 


Development 4: Lie groups. Hilbert’s fifth problem asked whether some rela- 
tively mild assumptions about topological groups were enough to imply a strong 
conclusion. A topological group is a set that is both a group and a topological 
space in which the two structures are compatible in the sense that the group 
operations (multiplication and inversion) are continuous. 

A typical example is the set of all 2 X 2 matrices of the form 


x y 
lo 1 

with x > 0; the topological structure is that of the right half plane (all (x, y) with 
x > 0), and the multiplicative structure is the usual one associated with matrices. 
This example has an important special property: it is “locally Euclidean” in the 
sense that every point has a neighborhood that is homeomorphic to an open ball in 
2-dimensional Euclidean space. (Equivalently: every point has a “local coordinate 
system”.) An even more important special property of the example is that the 
group operations, regarded as functions on the appropriate Euclidean space, are 
not only continuous but even analytic. That is obvious at a glance; if the matrix 


lo 1 

0 1 

is identified with the ordered pair (x, y), then 
(u,v)(x,y) = (ux, uy + Vv), 


and 


(x,y) = [= * | 


Xx Xx 


If a group is locally Euclidean, that is, if it can be “coordinatized”’ at all, then 
there are many ways of coordinatizing it; if at least one of them is such that the 
group operations are analytic, the group is called a “Lie group”. Hilbert’s fifth 
problem was this: is every locally Euclidean group a Lie group? 

The problem is analogous to one in complex function theory. It is relatively 
elementary that a twice-differentiable function is analytic; the hard theorem is that 
the conclusion holds under much weaker assumptions. Similarly, it has been known 
for a long time that if a topological group has sufficiently differentiable coordi- 
nates, then it has analytic ones; the Hilbert problem is to draw the same conclusion 
under much weaker assumptions. 

Immediately after the discovery of Haar measure, von Neumann (1933) applied 
it to prove that the answer to Hilbert’s question is yes for compact groups. A little 
later Pontrjagin (1939) solved the abelian case, and Chevalley (1941) solved the 
solvable case. The general case was solved in 1952 and 1953 by Gleason, and 
Yamabe, and, jointly, by Montgomery and Zippin; the answer to Hilbert’s question 
is yes. Gleason offered a new characterization of Lie groups; Montgomery and 
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Zippin used geometric-topological tools (and Gleason’s result) to reach the desired 
conclusion. 
Reference: EDM 406/N. 


Development 5: Simple groups. Every group has two obvious normal subgroups, 
namely the group itself and, at the other extreme, the subgroup that consists of the 
identity element only. A group is called simple if these are the only normal 
subgroups it has. 

Simple groups are like prime numbers in two ways: they have no proper parts, 
and every finite group can be constructed out of them. Consider, indeed, an 
arbitrary finite group, and look at a maximal normal subgroup of it, that is, a 
normal subgroup that is not included in any other proper normal subgroup. If the 
group is simple, then that maximal normal subgroup is just the identity, but in any 
event, no matter what that subgroup is, its maximality implies that the quotient 
group obtained by dividing out by it is simple. The relation among group, normal 
subgroup, and quotient is sometimes described by saying that the original group is 
an extension of the quotient group by the subgroup. In this language, every finite 
group (except the trivial group, the identity) is an extension of a simple group by a 
group of strictly smaller order. That statement is the group-theoretic analogue of 
the number-theoretic one that says that every positive integer (except 1) is the 
product of a prime by a strictly smaller positive integer. 

If the maximal normal subgroup is not trivial, then the procedure just used can 
be used again; the result is a maximal normal subgroup of the maximal normal 
subgroup such that the original subgroup is an extension of the second quotient by 
the second subgroup. The procedure can be repeated so long as it produces 
non-trivial subgroups; the end product is a decreasing chain (a composition series) 
of subgroups of the original group with the property that each quotient group 
obtained by dividing a term of the chain by its successor is simple. A great part of 
the problem of getting to know all finite groups reduces in this way to the 
determination of all finite simple groups. 

The abelian ones among the finite simple groups are easy to determine—it’s a 
classroom exercise to show that they are just the cyclic groups of prime order. 
That’s the only easy part of this subject. What’s hard is to find all non-abelian 
simple groups. Some examples of simple groups are not hard to come by; among 
permutation groups, for instance, the most famous ones are the alternating groups 
of degree 5 or more. For a long time the known simple groups did not exhibit any 
pattern, and even the simplest questions about them resisted attack. Burnside, for 
instance, conjectured (in 1911) that every finite non-abelian simple group has even 
order, and that conjecture stood as an open problem for more than 50 years. 

In a spectacular display of group-theoretic power, Feit and Thompson settled 
Burnside’s conjecture (in 1963)—it is true. The proof occupies an entire issue 
(over 250 pages) of the Pacific Journal. It is technical group theory and character 
theory. Some reductions in it have been made since it appeared, but no short or 
easy proof has been discovered. The result has many consequences, and the 
methods also have been used to attack many other problems in the theory of finite 
groups; a subject that was once pronounced dead by many has shown itself capable 
of a vigorous new life. Thus, for example, Burnside’s dream is by now completely 
realized: by a gigantic collaborative effort of many mathematicians all over the 
planet all finite simple groups have been found and can be explicitly described. 

Reference: EDM 160/D. 
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Development 6: The Atiyah-Singer index theorem. How can a linear transforma- 
tion on a finite-dimensional vector space fail to be invertible? There are two 
obvious answers: it can fail to be injective (one-to-one) or else it can fail to be 
surjective (onto). The first kind of failure is the existence of a nontrivial kernel, 
and the second kind is the existence of a nontrivial cokernel. The kernel of a 
transformation 7 is, of course, the null-space, the inverse image of 0; the cokernel 
is, roughly, the complement of the range, or, precisely, the quotient of the entire 
space modulo the range. It is an elementary fact of linear algebra that the two 
kinds of failure always occur simultaneously, and that, in fact, the numerical 
measures of the two kinds of failure, that is, the dimensions of the kernel and the 
cokernel, are always equal. 

For infinite-dimensional spaces the story is different. If, for instance, T is the 
unilateral shift on the infinite-dimensional sequence space /*, the transformation 
defined by 


T{é,, fo, E3,...} — {0, &), fo, E3,...}, 


then ker T is the trivial subspace {0}, but ran T is the subspace of all vectors whose 
first coordinate is 0, and consequently dim ker T = 0 and dimcoker T = 1. 

_ The index of any linear transformation 7 (including the one just described) is 
defined by 


index J = dim ker JT — dimcoker T, 


whenever that makes sense (that is, whenever it is not equal to « — o). For linear 
transformations on finite-dimensional spaces the index’is always 0, but on infinite- 
dimensional spaces the index (being a measure of the extent to which a transfor- 
mation is irreparably non-invertible) can give interesting information. 

The Atiyah-Singer index theorem has to do with the “analytic” index just 
defined. The first instance of it that most of us run into is the Cauchy integral 
formula thought of as the computation of a winding number. Another instance is 
the Riemann-Roch theorem. Compact Riemann surfaces such as the sphere and 
the torus arise in complex function theory, and the Riemann-Roch theorem (which 
goes back to a paper by Roch in 1865) is about the dimensions of certain vector 
spaces of meromorphic functions on such Riemann surfaces; its conclusion is a 
formula for that dimension. The general Atiyah-Singer theorem is a generalization 
of the Riemann-Roch theorem. It deals with smooth compact manifolds more 
general than Riemann surfaces. Elliptic differential operators acting on smooth 
functions defined on such manifolds have two numbers associated with them. One 
of them is the analytic index, as above, and the other is the topological index, 
which is something else. The topological index is related to K-theory, and, in 
particular, in more classical contexts, to the Euler characteristic. The achievement 
of the Atiyah-Singer theorem (1963) is that those two indices have the same value 
—that, in other words, a property of great analytic importance, with a purely 
analytic definition, is in fact almost completely determined by the topological 
properties of the underlying manifold. 

The equation just referred to is only a small part of the joint work of Atiyah and 
Singer. The results of that work are the deepest and broadest, and, for me as a 
reporter, much the hardest of the contents of this report; they are not just a 
theorem but a theory, a context, a point of view that enters, influences, and is 
influenced by many parts of mathematics. Writing about the spectacular successes 
of research in differential geometry during the last fifty years, Osserman called the 
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Atiyah-Singer index theorem “‘a grand synthesis of analysis, topology, and geome- 

try leading, in particular, to a new way of viewing the Gauss-Bonnet theorem: not 

as an isolated result, but as one instance of a larger scheme of things.” That 

synthesis was probably a major factor in Michael Atiyah’s being knighted in 

England and Isadore Singer’s being awarded a presidential medal in this country. 
Reference: EDM 236/H. 


Development 7: Fourier series. It is a historical misfortune (which was responsi- 
ble for almost 200 years of barking up the wrong tree) that Fourier series were 
discovered before convergence. Fourier series are a vital part of both classical and 
modern analysis; they are important for both abstract theory and concrete applica- 
tions. They arise in topological groups and in operator theory; they have their 
origins in problems about vibrating strings and about heat conduction. 

In their most classical manifestation, Fourier series have to do with numerical- 
valued functions on the line (it is usually best to let them be complex-valued) that 
are periodic of period 27 and integrable on the interval [0, 277]. The Fourier series 
of such a function is the infinite linear combination of the exponential functions 
e"™* n=0,+1,+2,..., with coefficients determined by integrating the given 
function against them. (Alternatively the Fourier series uses sines and cosines 
running in one direction only; the complex version, however, is algebraically 
simpler to manipulate.) 

Trigonometric polynomials (in either real or complex form) are familiar objects 
and are computationally accessible; surely nothing but good could come out of 
representing more difficult functions as limits of such polynomials. It seemed 
natural, therefore, to hope that the “sum” of the Fourier series associated with a 
function f would be “equal” to f, and, in any event, to ask for which functions 
that would occur. The answer, it was hoped, was that good functions would have 
good series, and the history of the subject has been strongly influenced by that 
hope. 

When limits began to be understood, “sum” and “equal” were interpreted in 
the sense of pointwise convergence; the more fruitful and usable concepts of weak 
convergence and convergence with respect to a norm came along only after the 
mathematical community was irretrievably committed to research in the pointwise 
direction. 

How good does a good function have to be? Differentiability is good enough, 
but, it turns out, continuity is not; there are continuous functions whose Fourier 
series diverges at a point (du Bois Reymond, 1876), and, in fact, at many points. 
Must the Fourier series of a continuous function converge almost everywhere? 
That was an unsolved problem for many years. 

Kolmogorov showed that if all that is assumed is that f is integrable on [0, 27r], 
then it could happen that the Fourier series of f diverges almost everywhere 
(1923), or even everywhere (1926). The biggest question along these lines was 
asked by Luzin, and it remained unanswered for 50 years: if f is square integrable 
on [0,27], does it follow that the Fourier series of f converges to f almost 
everywhere? Repeated failure to prove the affirmative led to the official state 
religion among the cognoscenti in the 1950s and 1960s: the answer must be no. 

The answer is yes. The first proof is due to Carleson (1966). A remarkable 
feature of Carleson’s achievement is that it uses no unknown techniques; it just 
uses the known ones better. It depends on an ingenious push-me-pull-you way of 
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selecting subintervals. It is as if Carleson had power enough to replace everyone 
else’s e by e”, and that did the trick. 
Reference: EDM 167/H. 


Development 8: Diophantine equations. Hilbert’s tenth problem concerned the 
solvability of Diophantine equations. The problem was to design an algorithm, a 
computational procedure, for determining whether an arbitrarily prescribed poly- 
nomial equation with (positive) integer coefficients has (positive) integer solutions. 
(The modifier “positive” is technically convenient and does no harm.) 

What does it mean to say there is an algorithm for deciding solvability? A 
reasonable way to answer the question is to offer a definition of computability for 
sets and functions, and then to define an algorithm in terms of computability. The 
concept of computability has received a lot of attention; it has several different but 
logically equivalent definitions all in accord with the intuitive notion that the word 
suggests. 

Suppose now that {E,, E,, E;,...} is an effective enumeration of all the 
polynomial equations under consideration, and let $ be the set of those indices k 
for which E, has a solution. Hilbert’s problem (is there an algorithm?) can be 
expressed by asking whether S is a computable set. The answer is no. The answer 
was a long time coming: it is the result of the cumulative efforts of J. Robinson 
(1952), M. Davis (1953), H. Putnam (1961), and Y. Matijasevié (1970). 

The central concept in the proof is that of a Diophantine set, and the major 
step proves that every computable set is Diophantine. The techniques make 
ingenious use of elementary number theory (e.g., the Chinese remainder theorem, 
and a part of the theory of Fibonacci numbers). The proof exhibits some interest- 
ing Diophantine sets whose Diophantine character is not at all obvious (e.g., the 
powers of 2, the factorials, and the primes). 

One way to prove that S (the index set of the solvable equations) is not 
computable is by contradiction. If § were computable, then it would follow (by a 
slight bit of additional argument) that each particular Diophantine set (i.e., the 
solution set of each particular equation) is computable, and hence (by the “major 
step” mentioned above) that the complement of every Diophantine set is Diophan- 
tine. The contradiction is derived by exhibiting a Diophantine set whose comple- 
ment is not Diophantine. 

The last step uses a version of the familiar Cantor diagonal argument. The idea 
is “effectively” to enumerate all Diophantine sets, as {D,, Dj, D3,...} say, to 
prove that the set D* = {n:n € D,} is Diophantine (that takes some argument), 
and, finally, to prove that the complement of D* is not Diophantine—and that’s 
where Cantor comes in. 

Reference: EDM 100. 


Development 9: Banach bases. In calculus (beginning in the 17th century) we 
learn how to find maxima and minima of numerical functions defined on familiar 
domains such as intervals in the line and rectangles in the plane. In a later subject 
called the calculus of variations (beginning in the 18th century) we try to find 
maxima and minima of numerical functions whose domains consist of sets of 
functions. Most famous example: for each path joining two points in space there is 
a definite time that it takes for a particle to slide along the path from the first 
point to the second—what is the minimum of all those times? (The minimum time 
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is attained for the celebrated path called the brachistochrone—brachisto for 
shortest and chrone for time.) Problems such as that were among the ones that 
gave rise to functional analysis, the part of analysis in which the domains of the 
functions studied are sets of functions, subsets of function spaces, and in which 
algebraic and topological methods are freely used. 

Another point of view on functional analysis is that it is an infinite-dimensional 
generalization of linear algebra. The first idea was to replace vectors and the 
additions involved in applying matrices to them by functions and the integrations 
involved in applying kernels to them. The first systematization of the subject 
(around the 1920’s) was proposed by Banach and Wiener (independently)—the 
result was the theory of Banach spaces, and its many wildly generalized out- 
growths. The study of Banach spaces is a typical instance of the axiomatic method 
in mathematics—abstract and general, with roots in the concrete and special. Its 
called too general by some and not general enough by others. In any event it is still 
a living part of mathematics, and major progress was made in it relatively recently. 

One of the earliest questions about Banach spaces was the basis problem, raised 
by Banach himself in his book (1932). A sequence of elements in a Banach space is 
a Schauder basis for the space in case every vector has a unique expression as a 
convergent infinite linear combination of the terms of the sequence. The countabil- 
ity built into the definition of the word “sequence” implies that if a Banach space 
has a basis, then it is separable (that is, has a countable dense set). The basis 
problem, which was outstanding for 40 years, was the converse: does every 
separable Banach space have a basis? Each space that ever came up in analysis 
had one, and yet a proof that that had to be so remained elusive. 

A classically important concept in the study of Banach spaces is that of a 
compact (completely continuous) operator, that is, a linear transformation between 
Banach spaces with the property that the closure of the image of the unit ball is 
compact. The easy compact operators are the ones of finite rank (that is, the ones 
for which the range is finite-dimensional); the next easiest ones are the (uniform) 
limits of operators of finite rank. If a Banach space is “good’’, then every compact 
operator mapping into it is such a limit (in technical language: the space has the 
approximation property), and, in particular, if a Banach space has a basis, then it 
has the approximation property. 

The basis problem was solved by Per Enflo in 1973. The solution turned out to 
be negative: there exists a separable Banach space that does not have the 
approximation property. The technique is constructive; it is a combinatorial way of 
constructing and putting together infinitely many finite-dimensional Banach spaces. 

Reference: EDM 39/A. 


Development 10: Manifolds. A 2-manifold is a topological space that is locally 
like 2-dimensional Euclidean space (that is, every point has a neighborhood 
homeomorphic to an open disk in the plane) with the properties that the locally 
Euclidean patches are glued together continuously and that there are not too many 
of them (meaning that the entire space is separable). The definitions of 3-mani- 
folds, 4-manifolds, 5-manifolds, etc. are exactly the same—yust replace 2 by 3, 4,5, 
etc., (and replace “plane” by 3-space, 4-space, 5-space, etc.). 

A 2-manifold can be large (it can, for instance, be the entire plane), and even if 
it doesn’t look large it may be intrinsically large in the sense that many paths in it 
converge to points that don’t exist in it—as is the case with an open disk. The 
manifolds whose study is more promising are the compact ones. 
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Can all possible compact 2-manifolds be listed? Yes, and the simplest ones to 
list are the so-called orientable ones. One of them is simply connected (the 
sphere), another has genus 1 (the torus), another has genus 2 (a doughnut with two 
holes), etc.—and all possible orientable 2-manifolds are like one of these (in the 
sense of homeomorphism). The classification problem for compact 2-manifolds is 
classical and was solved long ago. (The 1-dimensional case can safely be left as 
homework.) 

For higher dimensions things are tougher to see and tougher to do. Surprisingly, 
however, when the dimension is 5 or greater, much of the answer is known—sim- 
ply connected 5-manifolds are understood and classified by homotopy theory. For 
3-manifolds the problem is unsolved—that’s what the celebrated Poincaré hypoth- 
esis is about. For 4-manifolds the story is different. Two great victories were won 
by Michael Freedman and Simon Donaldson in 1982 (and each was awarded a 
Fields medal at the Berkeley Congress in 1986). 

It turns out that every oriented 4-manifold is associated with a symmetric 
integer matrix (“the intersection matrix”) of determinant +1, and Freedman 
showed that all such matrices can indeed occur. The correspondence between 
matrices and manifolds is one-to-one half the time and two-to-one the rest of the 
time. The final result is a complete classification of all simply connected oriented 
4-manifolds. If we think of a basketball (the surface of a sphere) as a 2-dimensional 
object (never mind that we usually see it embedded in 3-dimensional space), 
Freedman’s work gives us as much insight about 4-dimensional basketballs as we 
have about the honest-to-goodness ones. 

Donaldson, on the other hand, showed that if a simply connected oriented 
4-manifold has a smooth differential structure and if its matrix is positive definite, 
then the matrix must be equivalent to the identity matrix. That’s a very strong 
conclusion; it shows that the theories of topological and differential 4-manifolds 
are radically different. 

Presenting Freedman’s work, John Milnor said that the proofs of the results are 
extremely difficult. Presenting Donaldson’s work, Michael Atiyah said that it 
opened up an entirely new area, and he went on to say that Donaldson’s youth and 
mathematical power are ‘“‘an indication that mathematics has not lost its unity, or 
its vitality”. 

Reference: MI 5/3/39. 

Development 11: The Bieberbach conjecture. Some problems are interesting not 
because they are interesting but because they are elusive. The Fermat question is 
probably the best case in point. Nobody really wants to know the mere answer to 
that one—what mathematicians want to know is why they don’t know the answer. 
The Bieberbach conjecture about schlicht functions was, in context, perfectly 
natural to the specialist—to most outsiders, however, it seemed like a curious 
technicality whose main claim to fame was that it remained unanswered. 

What it concerned was the class of all those functions that are given by power 
series of the form 


and that are schlicht (meaning analytic and injective or, in the more classical word, 
univalent) in the open unit disk. The class constitutes a “normal family” (a 
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compactness property), which implies that, for each n, the coefficient a, remains 
bounded throughout the class. Bieberbach, studying the subject in 1916, proved 
that in fact |a,| < 2 for every function in the class. One particular function in the 
class, the Koebe extremal function defined by a, = n, shows that the upper bound 
2 is best possible (and is, in fact, attained). The Bieberbach conjecture was that, 
more generally, |a,| <n for all n. 

Lowner proved in 1923 that the conjecture is true for n = 3; in 1955 Garabe- 
dian and Schiffer proved it for n = 4; in 1968 Pedersen and Ozawa proved it for 
n = 6; in 1972 Garabedian, Pedersen, and Schiffer proved it for n = 5; and in 1973 
Ozawa and Kubota proved it for n = 8. Progress was slow and not promising. 

The breakthrough that transformed the conjecture into a theorem came in 1984 
when Louis de Branges offered a proof of the general case, a proof of several 
hundred pages, based on his theory of square summable power series. His original 
proof contained minor errors, but they were correctable and soon corrected. The 
experts were still a little bothered, but their discomfort didn’t last too long. What 
they didn’t like was the reliance of the proof on the seemingly irrelevant methods 
of functional analysis, and, sure enough, ultimately the Leningrad seminar on 
geometric function theory produced a proof that Bieberbach would have liked. The 
proof is shorter and more perspicuous than its special predecessors for the cases 
n = 5 and n = 6. Square summable power series are gone, but the main structural 
insights of de Branges are still the ones that make things go. We don’t, glory be to 
glory, lose them all. 

Reference: MI 8/1/40. 


Epilogue. The answer to the question in the title is clearly and decisively no. 


Two Theorems, Two Sculptures, Two Posters 


HELAMAN FERGusON, Supercomputing Research Center, Bowie, Maryland 


HELAMAN ROLFE PRATT FERGUSON has an A.B. in liberal arts from Hamil- 
ton College and a Ph.D. in mathematics from the University of Washington. 
He is a full Professor of Mathematics at Brigham Young University, 
currently with the Supercomputing Research Center, Bowie, Maryland. His 
published mathematical work is primarily in applying matrix groups to 
algorithm design. Helaman learned to work with stone as a young appren- 
tice, studied painting as an undergraduate and had graduate courses in 
sculpture. His sculpture studio is now in Maryland; his collected sculptural 
work is direct carving of stone and bronze. Helaman is in the 1988 and 1989 
Guinness Book of World Records for juggling a distance of fifty miles. 
Helaman and his wife Claire are raising six sons and one daughter. Claire is 
an artist; she understands mathematicians very well and has observed, “I 
know what you guys are doing. First you shoot the arrow and then you draw 
the bullseye.” 


Fic. 1. Umbilic Torus NC Front View; the closed curve of cusps goes round the torus thrice the long 
way and once the short way. 


Art / Science. Mathematics is both an art form and a science. This duality 
integrates our social reflexes to differentiate disciplines and people. Mathematics 
is a language of physics and a tool of engineering. In manufacturing engineering 
there is something called a critical part. This part is so crucial to the operation of 
the whole that its failure would be a disaster. Such parts are made with the highest 
quality of attention and materials, and a complete pedigree is maintained of the 
source, handling, and steps of manufacture. There are crucial mathematical 
theorems that are worth this high level of attention and celebration. 

I think that the great myths of our century are mathematical equations; the 
heroes of those myths are equals signs and approximations. Haven’t we all flown 
comfortably upon the implications of Navier-Stokes equations? We consistently 
build effective machinery by testing mathematical models or ‘ghost machines’ in 
computer simulations. We have impressive engineering professions motivated by 
function. Function and efficiency were never everything. But mathematics has 
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another side, motivated by a successful aesthetic quite its own. What if the 
resource, intelligence, sophistication, and energy of mathematics were developed 
for aesthetic creation of perfectly useless and priceless artifacts? 


Mathematician / Sculptor. I believe it is feasible to communicate mathematics 
along aesthetic channels to a general audience. Mathematical beauty as well as 
mathematical truth is distinctive and highly developed. A very rigorous mathemati- 
cian submits that “in all mathematics there is a mysterious element: the astonish- 
ing concurrences and ties between objects and theories which at first glance seem 
far apart” [Arnold]. Sometimes I have succeeded in capturing the truth, the 
beauty, and this mysterious element in my sculpture [Cannon, Sakanel]. 

I am only one kind of mathematician, one kind of sculptor. My mathematical 
processes tend to be constructive and algorithmic, [J. Algorithms paper, Anaheim 
talk]. An algorithm is a construction involving step by step putting together and 
organization of resources. Some mathematicians work analytically by breaking 
things down. On the other hand, my sculpture process tends to be destructive. I 
begin with raw material such as stone and systematically destroy the parts I don’t 
want to see or feel. In sculpture jargon, this is called direct carving. It is a 
subtraction process. Some sculptors work mostly by building up. It is helpful to 
draw these distinctions, but they are not hard and fast. 

The first sculpture I tell about is direct carving old style; I direct carved pretty 
much the same way the sculptor-carvers of Willendorf, Hierakonpolis, Ur, Harappa, 
Yazilikaya, Imdugud, Warka, Sidon, Chavin, Gandhara, Yaxchilan, Mathura, 
Vijayanagar, T’ien-hung Shan, T’ang, Souillac, and Naumberg did. I carved much 
the way Myron, Praxiteles, Sluter, Michelangelo, Bologna, Bernini, Coysevox, 
Story, Thorvaldsen, Greenough, Rodin, Taft, Brancusi, Hepworth, Moore, Noguchi, 
or their assistants did. The second sculpture was done by direct carving new style: I 
direct carved using an interesting combination of mathematics and technology not 
available to the above masters. 


First theorem: the classification of surfaces. The homeomorphism problem of 
surfaces is posed as follows: give a short list of easily computable criteria which are 
different for surfaces not homeomorphic and which are the same for surfaces 
which are homeomorphic. The fundamental theorem of the topology of compact 
surfaces states that two compact surfaces are homeomorphic if and only if they 
have the same number of boundaries, handles, and cross-caps. This theorem 
completely solves the homeomorphism problem for surfaces. Surfaces have dimen- 
sion n = 2. The same problem for higher dimensions n > 3 is still open, [Scott, 
Thurston]. 

The Torus with Cross-Cap sculpture presents one handle, one cross-cap, and, if 
one counts the footprint, one boundary. A Torus with Cross-Cap contains the germ 
of the fundamental theorem of the topology of surfaces. When a Torus with 
Cross-Cap is dug out of somebody’s rubble in ten thousand years, the knowledge of 
this fundamental theorem can be celebrated again. Theorems are time invariant, 
so I like to sculpt in geological time scale materials. 


The proof of 3x =x +h; 2x #h. At first sight 3x =x + h; 2x #h looks like 
an obvious error. The usual algebra of cancelling an x in 3x =x +h gives 2x =h. 
Algebraic statements are not necessarily context-free. These algebraic statements 
are taking values in geometric structures; x represents a cross-cap on a sphere, 
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+ represents a connected sum, / represents a handle on a sphere, and = 
represents a reversible deformation. Other things about this algebra, such as 
cancellation, are unusual. Three cross-caps on a sphere, 3x, is a torus with a 
cross-cap, x + h, but a Klein bottle, 2x, is not a torus h. 

Follow the visualization indicated by the accompanying drawings; explain your 
visualization to someone else. Begin with a sphere, cut out boundaries, sew on 
handles and sew on cross-caps. 

Here are two kinds of pictures: sew-ups (leading to algebra) and curve drawings 
(leading to sculpture). A sew-up is a planar region with marked boundaries and 
directions for sewing the boundaries together. For convenience, the directions will 
rely upon personal experience with physical paper and cloth, but only up to a 
point. A new, non-physical ingredient is the possibility of self-intersection. The 
curve drawings are shorthand symbols for a thing visualized in the mind and body 
somehow; a thing some might like to think has independent existence, but a thing 
that takes time and work to visualize. If a visualization doesn’t develop one day it 
will the next. Distinguish when a curve outlines a surface tangent to lines of sight, 
describes a surface self-intersection, lies under part of the surface, or informs a 
cross-section. 


i 


Fic. 2. Six Sew-ups: A Sphere, 0; A Torus, i + 0; A Projective Plane, x + 0; A Klein Bottle, 2x; A 
Handle, h; A Mobius Band, x; A Cross-Cap, x. Sew matching sides of the self inter-locking puzzles. 


Recognize that the algebraic expressions written on the drawings are consistent 
with the definition of equality given by reversible deformations and the definition 
of addition given by the connected sum. Reversible deformation means stretching 
and shrinking, neither cutting nor tearing, but self-intersections are allowed. The 
connected sum refers to the process of joining two surfaces along the boundary of 
a removed disc. There are four interesting reversible deformations and each has a 
sequence of curve drawings. Each curve drawing in the sequence presents a new 
idea in the reversible deformation. The equality between the beginning and ending 
structures may surprise at first but after practice the deformation will seem 
obvious. 

Between a Mobius band and a cross-cap, stretch the boundary of the Mobius 
band as drawn up on the lower part and down on the upper part. The self-intersec- 
tion line of the cross-cap appears naturally when this is done. The cross-cap is a 
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>) BS 


Fic. 3. Six Curve Drawings: A Sphere, 0; A Connected Sum of Two Spheres, 0 + 0; A Sphere with a 
Boundary, a Handle, and a Cross-Cap, x +h; A Torus, h + 0; A Projective Plane, x + 0; A Klein 
Bottle, 2x. 


projective plane with a disc removed, the only difference between a MObius band 
and a cross-cap is the size of the disc hole and where it is. 

In the catalytic presence of a cross-cap drawing, think of one end of the handle 
of the torus as a sort of vacuum hose snorkel elephant trunk tornado thing that 
moves its snout as a hole around in the surface. When it moves across the 
cross-cap it develops a self-intersection but retains its hole to reveal a Klein bottle 


structure. 


Pee) 
C 9% » 
Fic. 4. Six More Curve Drawings: A Mobius Band, x; A Cross-Cap, x; A Sphere with Two Cross-Caps, 


2x; A Sphere with Three Cross-Caps, 3x; A Torus with a Cross-Cap, x + h; A Klein Bottle with a 
Cross-Cap, 2x + x. The last three are the same. 


The AMS Sculpture: Torus with Cross-Cap and Vector Field, Direct Carving 
x +h. In November 1986 I acquired a couple of tons of Carrara white marble and 
carved what eventually became the present AMS Torus with Cross-Cap and Vector 
Field from one of the 1200 Ib chunks. A stone carver would call this spec work. It 
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was not specifically commissioned and had no immediate home; financially (as 
usual), I was going out on a rock. I was motivated by the joy of the thing, the same 
reason I do mathematics. If the finances work out that is great, because then I can 
continue. 

Carrara marble comes from the coastal mountains of northwest Italy south of 
Genoa, north of Livorno. More specifically, southeast of La Spezia, north of 
Lucca, east of the towns of Carrara and Pietrasanta, are mountains bearing a close 
grained, generally white stone of consistently high quality. Many generations of 
families have been quarrying this stone for building purposes; the effects of their 
labors, the exposed white disemboweled mountains, can be seen from far at sea 
and space, as satellite photographs reveal. (These mountains would be a good 
place for a really large torus with cross-cap.) Fine art sculpture does not support 
the stone industry in Italy or anywhere else; occasional good blocks become 
available for reasonable. prices with the right connections and for unreasonable 


MUONO: 
QOO8 


Fic. 5. A Mobius Band is a Cross-Cap, x = x. The cross-cap self-intersection line appears naturally 
when the boundary is crimped toward the center of gravity. 


I carved Torus with Cross-Cap and Vector Field in eleven days, working 10-16 
hours a day. Stone carving is hard work even with modern electric and air tools; it 
firms muscle and tests courage. Carrara marble is beautiful to carve. It flakes 
rather than breaks or chunks; it shows a pretty pressure wave as the pneumatic 
tool penetrates. 

I will describe a few details of the process of direct carving x + h. The first 
stage was to get the idea for the work somewhere in mind and body. For me, that 
began twenty six years earlier when as an undergraduate at Hamilton College I saw 
an elegant paper by another undergraduate at Caltech [Crawley]. Those topologi- 
cal ideas percolated in my system for years. They were there when I was behaving 
like an artist and they were there when I was behaving like a scientist. Long term 
understanding of these ideas became an important source of the myriad decisions 
in my final conversation with the stone. 

The second stage was having the opportunity, which raw materials, tools, and 
experience afford. But these stages pale temporarily when faced with the problem 
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Fic. 6. Sew two Mobius Bands, x + x, along their boundaries to get a Klein Bottle, 2x. The next figure 
has further detail. 


of physically moving a 1200 Ib stone by oneself. Remember the integral over 
distance definition of work with the little differential ds? It is interesting to move a 
big stone a long way, a very little at a time, with just one crowbar and much 
patience. This primitive move was necessary to reposition the stone to access it 
with a four wheeled frame hoist. 


AG 
mee 
BCDC 


Fic. 7. Between three Cross-Caps on a Sphere and a Cross-Cap on a Klein Bottle, 3x =x + 2x. 


CNS 


The third stage was positioning the stone, blocking it up in various ways to get a 
good look at it, and then carving the hole. A practical reason for doing this first 
was to check out the integrity of the stone and reveal any flaws before making a big 
investment of time and energy. At the same time I made inroads on the cross-cap 
as well for the same reason. There was a natural spall on the side of the stone 
where I had decided to uncover the cross-cap. All carving to this point was done 
with a hand-held pneumatic hammer with a sleeve insert for taking a traditional 
chisel, usually a point, and a single carbide tipped hammer drill for the guide hole. 
I had satisfied myself on the integrity of the stone; no fissures appeared along 
which the stone could mysteriously and silently separate into two or more pieces. 
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The block was somewhat rectangular, so I used my hammer drill again, drilling a 
series of holes comprising the general outline of the piece, face on, as it appears in 
the AMS poster. I tapped soft iron into the drill holes, uniformly all the way 
around, to exert pressure on the hole weakened stone and shear off the unwanted 
corners. A similar smaller scale operation involved making parallel carbide saw 
cuts about an inch deep and an inch or less apart, then popping the blocklets off 
with a parting tool. 


Fic. 8. Between a Cross-Cap on a Klein Bottle and a Cross-Cap on a Torus, x + 2x =x +h. The 
Cross-Cap as Catalyst. 


After that initial gross chamfer of the block I proceeded with a pneumatic 
hammer and a point. I was shaping a surface that had a local 2-frame almost - 
everywhere. I didn’t require the 2-frame to be orthogonal and in the case of the 
crystal structure of this stone more like sixty degrees was better. So my cutting 
pattern was to follow two adjacent sides of a parallelogram, first stroke one side 
and then the other side. This operation popped off a parallelopiped shaped flake, 
quantitatively over six hundred pounds of them. The resulting carving was overall 
within a half inch or so of the final and had an interesting pineapple look and 
texture. I worked sensitively especially in the region of the self-intersection line of 
the cross-cap; punching through or splitting would ruin the entire piece. I wanted 
to approximate the line of intersection as closely as possible and reveal the 
expressive translucence of the white marble in that narrow region. 

The next stage was grinding with a set of silicon carbide wheels and diamond 
discs. Dust control is a serious problem at this stage. Marble is mostly the relatively 
harmless mineral calcium carbonate. It is unhealthy but not usually lethal to inhale 
some marble dust. Granite and other silicate based stone dust circulating in the 
lungs can over the long term cause cancerous lesions. I used a pair of snorkels to 
draw the dust laden air into a filter system which captured the dust and vibrated it 
periodically into a barrel. 

Both of these stages, flaking and grinding, were performed gradually over the 
entire piece. This meant not penetrating too deeply in any particular area but 
working with the whole simultaneously. Balance was maintained throughout. 
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Repositioning and reblocking of the piece was required to access the different 
faces; this was time consuming even though the piece was gradually losing weight. 

I brought the final surface up by sanding with successively finer grits, 60 mesh, 
120 mesh, 320 mesh, and finally about 400 mesh. I used flap wheels and discs 
mounted on one of my one-third horsepower flexible shaft tools at 2800 rpm; 
silicon carbide, zirconium oxide, and diamond were the abrasive materials. The 
surface was close to an egg shell polish; a very smooth surface which I intended to 
texture for mathematical as well as tactile reasons. 

Mathematically there is an interesting difference between a cross-cap self-inter- 
section of two surfaces and two abutting corners. The latter had already shown up 
in sculpture and could be had with two straight saw cuts and no particular care. To 
distinguish this cross-cap here I articulated the continuation of one surface 
through the other with a corresponding horizontal vector field on the one and a 
vertical vector field through the other. I developed the vector field theme by 
cutting many parallelish polished grooves over the entire surface. To do this I used 
a tungsten carbide conical ball end cutter rotating at 27,000 rpm. These strokes 
cover the entire surface of the Torus with Cross-Cap and now Vector Field. 1 also 
used this high speed tool to refine the approximation to the cross-cap as much as I 
dared. One slip and ruin. The final cross-cap has a lovely translucent effect when 
backlit. 

This summarizes the direct carving process. I emphasize that concentration is 
essential throughout the whole process.: The concentration aspect is very much like 
doing a long and complex computation with pencil and paper. 


The AMS Poster. The American Mathematical Society (AMS) is a professional 
society primarily devoted to the publication of new mathematical research. The 
AMS has over 25,000 members, publishes six journals and fifteen book series. It 
was founded in 1888 and in the century since has contributed to what has been a 
golden age of mathematics. For the occasion of the 1988 Centennial the AMS 
organized a special summer meeting at its headquarters in Providence, Rhode 


Fic. 9. Torus with Cross-Cap and Vector Field, AMS Building, Providence, Rhode Island. 
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Island [Jackson]. The Mathematical Association of America (MAA) is a profes- 
sional society with more generalized and public goals than the AMS. The MAA 
has over 32,000 members and publishes three journals, a newsletter, and seven 
book series. These two societies hold their meetings jointly and enjoy a spirit of 
complementary cooperation. The 1987 Summer Meeting of the AMS-MAA was at 
the University of Utah, only fifty miles north of my recently completed five 
hundred and fifty pound Torus with Cross-cap and Vector Field. This gave MAA 
President Len Gillman and others opportunity to see the work and approve the 
purchase. The AMS accepted this MAA gift in the words of AMS President Dan 
Mostow, “‘A feast for the eyes as well as the mind.” 

Many universities and corporations have framed and hung the elegant poster 
that the AMS published in 1988 featuring this white marble carving on a royal blue 
background [Pitcher]. 


AMERICAN MATHEMATICAL SOCIETY 
CENTENNIAL CELEBRATION : 1888-1988 


Fic. 10. AMS Centennial Poster, Torus with Cross-Cap and Vector Field. 


Second theorem: Orbits of GL(2,R) on binary forms. The set of real n Xn 
matrices with non-zero determinant is a group with the usual matrix multiplication, 
associativity, identity, and inversion. Call this group GL(n, R). The case of n = 1 is 
the multiplicative group of non-zero real numbers. The case of n = 2 has richer 
structure; GL(2, R) is a very typical and very important group. We will not consider 
n > 3 here. 


GL(2,R) consists of exactly the real invertible 2 x 2 matrices: 

a 6 

y Oo 

These matrices act on column vectors by left matrix multiplication to give the 


| ad — By #0. 
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substitution 


x ax + By 
> ° 

( | yx + dy 

This substitution into quadratic and cubic binary forms gives representations of 
2 X 2 matrices as 3 X 3 matrices and 4 X 4 matrices respectively. The quadratic 
case represents GL(2,R) as a subgroup of GL(3,R); call this group action 
GL(2, R) x R°. The cubic case represents GL(2, R) as a subgroup of GL(4, R); call 
this group action GL(2,R) X R*. These matrix subgroups act by right multiplica- 
tion on the coefficient row vectors of the corresponding binary forms. 

The quadratic and cubic binary forms and their coefficient row vectors are, 


ax” + bxy + cy? @ (a,b,c) € R° 


and 


ax? + bx*y + cxy? + dy? @ (a,b,c, d) € R*. 


The word binary refers to the two variables x, y. Substitute 


(5) ax + By 
— 
y yx + dy 


into each form, expand, and collect x*y“-monomial terms, 


ax* + bxy + cy? > (aa? + aby + cy*)x? + (2aaB + b(ad + By) + 2cdy) x 
+(aB’y* + bBd + cd*)y?, 
ax? + bx?y + cxy? + dy? > (aa? + a*by + acy* + dy*)x? 
+(3aa’B + a7b6 + 2abBy + 2acdy+Bcy? + 3déby”)x’y 
+ (3aaB’ + 2abBs + acd? + bB*y+2Bcdy + 3d5*y)xy* 
+(aB? + bB*5 + Bcd? + dd*)y?. 
These new coefficients are linear in the original coefficients (a,b,...) but 


non-linear in the a, B,..., since only powers of the x, y were involved. The 2 x 2 
matrix 


° 5 | € GL(2,R), 


has the representations 
a? 2aB B? 
° 5) ay adi+By Bd} € GL(3,R), 
y y? 20 5? 


a? 3a7B 3ap? B? 

° 5 | a’y a’hi+2aBy 2aB6 + B*y B75 
— 

ay? 2ady + By? ad? + 2Bdy BS" 

y? 35y" 38°y 5° 


€ GL(4,R). 
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The corresponding determinants are (a5 — By), (ad — By), (ad — By)®, so 
the representations are as invertible as the original matrix. 

The GL(2,R) orbit of a form is the set of all GL(@,R) substitutions in the 
variables x, y of the form. Equivalently, the GL(2, R) orbit is right multiplication 
of a coefficient row vector by all the representation matrices. An orbit is a subset 
of the set of all forms, two different orbits are disjoint, the union of all the orbits is 
a partition of the set of all forms. In the case of the GL(2, R) orbits of quadratic 
and cubic forms there are only finitely many orbits. For higher degree forms there 
are infinitely many GL(2, R) orbits. 

The classification of the binary forms problem is posed as follows: determine 
the geometric structure of the GL(2, R) orbits. The classification of binary quadratic 
forms is a theorem in every calculus book and will be done another way below. The 
theorem for binary cubic forms is less well known but due to [Cayley, Zeeman] and 
is the subject of my Umbilic Torus sculpture and the discussion below. The same 
problem for binary forms of degree higher than three is still open [Mumford]. 


The Cayley-Zeeman construction. A simple question to ask about a polynomial 
in one variable is, are there any repeated roots? 

This question has a wonderful answer: a root is repeated if and only if the 
coefficients of the polynomial satisfy a certain multivariable polynomial called the 
discriminant. The quadratic equation 


ax? + bx +c =0 
has repeated roots if and only if 


b* — 4ac = 0. 


The cubic equation 


ax? + bx* +ex +d =0 
has repeated roots if and only if 


b*c? — 4ac? — 4b3d + 18abcd — 27a7d* = 0. 

Use both the fundamental theorem of algebra and the fundamental theorem of 
the calculus to prove (characteristic zero) that a polynomial has repeated roots if 
and only if the polynomial and its derivative share a root. The discriminants are 
found by eliminating the variable between the polynomial and its derivative. 

Define the quadratic discriminant set 


{(a,b,c) € R°|b? — 4ac = 0}, 
and the cubic discriminant set 
{(a,b,c,d) © R*|b?c? — 4ac* — 4b°d + 18abcd — 27a*d* = 0}. 


The quadratic discriminant set is a double cone; this may not be obvious. The 
cubic discriminant set has even less obvious geometry. The quadratic case will 
become obvious below and the cubic case is worth a sculpture. 

A polynomial with real coefficients could have any real root but complex roots 
must occur in complex conjugate pairs: zero and the real coefficients are fixed by 
complex conjugation. The number of real roots and even number of complex roots 
and the number of repeated roots is called the root type of the polynomial. 
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The group GL(2, R) was defined to act on binary forms, not polynomials in one 
variable. For the inhomogeneous polynomials in one variable 


ax* + bx +, ax? + bx* +x 4d, 


replace x by x/y and multiply through by the right power of y to get the 
homogeneous binary forms in two variables 

ax* + bxy + cy’, ax? + bx*y + cry’ + dy?. 
It now makes sense to talk about repeated roots, discriminants, and root types of 
these binary forms. 

The root type is invariant under any invertible real linear transformation of the 
two variables (x, y). If two cubic forms have the same root type there is a matrix in 
GL(2, R) which sends the roots of one into the roots of the other. The correspond- 
ing substitution changes the coefficients of one into the other. Part of this picture 
is that the cubic can be solved in terms of cube roots and square roots involving the 
discriminant which is an invariant up to scale of the GL(2, R) action. Details of this 
proof, that root type is invariant, are omitted. What is important here is that the 
root type in the quadratic and cubic case characterizes the GL(2, R) orbits. This is 
not true for higher degree forms. 

Root types of the quadratic and cubic forms have the special names given in the 
first column of the following tables. 


hyperbolas two complex 

ellipses two real distinct 
parabolas two real equal 
origin 


TABLE I. 


hyperbolic umbilics two complex, one real 
. elliptic umbilics three real distinct 


parabolic umbilics three real, two equal 
exceptional three real equal 
origin — 


TABLE II. 


The names come from setting the form equal to a constant and naming the 
corresponding curve. For example, {(x, y)|x* — y? = 1} is a hyperbola [Golubitsky]. 
Transform the (a, b,...) according to the linear substitution 


* > 
(3 
and substitute them back into the discriminant. There is a surprising simplification; 


the new discriminant factors into the old discriminant and a power of the 
determinant of the 2 X 2 matrix. The quadratic discriminant becomes 


(b? — 4ac)(ad — By)’, 


ax + By 
yx + Oy 
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and the cubic discriminant becomes 
~(b2c? — 4ac3 — 4b3d + 18abcd — 27a7d*)(ad — By)”. 

The geometric interpretation is that the discriminant sets, defined by setting the 
discriminant equal to zero, don’t change under GL(2,R) action. This group 
invariance will reveal the geometric structure. 

Construct the complex numbers C: select exactly one of the square roots of 
negative one, call it i, adjoin it to the real numbers R so that C=R+ IR. All 
complex numbers correspond to some 2 X 2 real matrices by 


a+pie( a} 1 (4 t}. io (9, a): 


Complex number addition, subtraction, multiplication, and division is now 2 X 2 
matrix addition, subtraction, multiplication, and multiplication by an inverse. The 
imaginary i with i* = —1 is real as a 2 X 2 matrix whose square is the negative 
identity matrix. The matrix transpose corresponds to the complex conjugate. The 
matrix determinant is the square of the absolute value, so the multiplicative group 
of non-zero complex numbers, C%, lies in GL(2, R). 

C~ in polar rather than cartesian coordinates is 

i i cos@ = sin@ 
a+ Bi pe’, c 3 ( 28 mo): 

where we will assume that p is positive. 

There is a complex number representation for the real binary forms. Let 
z=x+yi, the complex conjugate z* =x — yi, and u=u,+ Uy,i, V =U, + Vyl. 
The real part of the complex quadratic form (uz* + vzz*) is 


Re(uz* + vzz*) = (u, + v,)x* + — 2uyxy + (-u, + 0,)y’ 
= ax* + bry + cy’, 
where we take v to be real, v, = 0. The real part of the complex cubic form 
(uz? + vz*z*) is 
Re( uz? + vz?z*) 


(u, + v,)x° + (-3u, - U,)x7y + (-—3u, + v,) xy’ + (uy — v»)y° 


= ax? + bx*y + cry” + dy?. 
Comparing coefficients gives two invertible linear transformations which relate 
the (a, b,...) and the (u, v) coefficient sets: 


a 1 0 1 Uj 
[2 | =| 0 -2 0 U |, 

c -1 0 1 Uy 

a 1 0 1 0 uj 
b} | 0 -3 0 -1]. [42 
cl} |-3 0 1 O Uy 
d 0 1 oO -1 U» 


These transformations simplify the discriminant sets. Look at the new form the 
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discriminants take. The quadratic discriminant is 4 times 
ur + us — UF 
and the cubic discriminant is 4 times 
2 2) 2 2\2 2 2\( 7,2 2 2 
2_ 42 
+ 8u,U,(3v? — v3). 

The quadratic discriminant is obviously a double cone from the emergence of 
the circles u? + u5. The cubic discriminant needs more work. First look at the 
quadratic discriminant another way. 

Recall that the discriminant is zero when the polynomial and its derivative both 


vanish. The forms are homogeneous, so rescale the complex variable z by a 
variable of absolute value 1, z = e'®. Then 


Re(uz? + uzz*) = Re(ue”’? + v) 


1 
Vv, + 5 (ue + u*e~ 7"), 
Set the form 
1 5 i 
v, + 5 (ue ‘e+ y*e “"?) = 0 
and the derivative 
2iue*? — 2iu*e~*'? = 0. 


Add a quarter of the second equation to the first arid solve for u = —v,e*’?. In 
(u,v) = (u, + ivy, V,) coordinates, the quadratic discriminant is the set of (v,e’’, v,) 
for v, and @ real and arbitrary. These pairs obviously form a double cone: think of 
U, as a vertical axis and the v,e’®, 6 arbitrary, as a circle of radius v, located in a 
plane perpendicular to the v, axis at a directed distance of v,. Such a circle 
consists of parabolas or forms with two repeated roots. This circle is a quadratic 
analog of the cubic Umbilic Torus. 


Apply this pattern of calculation to the real part of the complex cubic form 
Re(uz? + iz?z*) = Re(ue*? + ve'®) 
1 3) . . 
= 5 (ue e+ ye"? + ve’? + U*e'?), 
Set the form 
1 33 . . 
5 (ue ‘e+ ute"? + ve’? + v¥e'?) = 0 
and its derivative 
1; 34 , oo, 
5 Give '? — 3iu*e"'? + ive’? — ivte'?) = 0. 


Add six times the first equation to the second to get 


Aue? — 2u*e—? + 2ve? = 0. 
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Set u = 1, this is really two equations u, = 1 and u, = 0. Solve for v in terms of @ 
to get a curve in the u = 1 plane of (u, v) space, 
vu = —2e7"# + eH, 
Change the variable g = $(@ — 77) to simplify 
v=2e'? —e #9 0 <b <2. 

This classical curve is the locus of a point on a circle of radius 1 rolling inside a 
circle of radius 3, a hypocycloid of three cusps [Brieskorn]. In the u = 1 plane this 
curve consists of parabolic umbilics, two or three repeated roots. All three roots 


repeated is the subset of the curve given by the second derivative of the form 
vanishing, or equivalently, 


dv . 
— = 2ie'® — 2ie~**? = 0, 
dd 
or 
et =|= etitk y= 3e2itk /3 keZ. 
These scaled cube roots of unity are cusps on the curve; a tangent line but no 
tangent circle is defined at a cusp. 


Fic. 11. Hypocycloid of Three Cusps with Generating Circles. 


The group C% contains the rotation e’” which acts simply on the cubic form. 
Substitute z > e’z, in the real part of the complex cubic form: 


Re(uz? + uz*z*) > Re(ue*?z? + ve®z*z*). 
Geometrically 
(u,v) > (ue, ve), 
u gets rotated thrice while v is rotated once. This group action leaves the cubic 
discriminant unchanged because the determinant of e’’ as a matrix is exactly 1. 
The plane, given by u = 1 or the v-plane, gets moved by the unit circle group. 


Recall the u thrice while v once, or one-third rotation of v while u rotates once. 
This means that the hypocycloid rotates one cusp over while u = e'” = 1 moves 
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once around and back to u = e”'” = 1. The hypocycloid has three fold symmetry, 
so the resulting surface closes and forms a torus. There are two ways to go around 
a torus. The curve of cusps goes around the torus three times the long way and 
once the short way; the long way is defined by the rotation of the u = 1 plane. 

Now look at this twisted torus with a hypocycloid cross-section from the point of 
view of the (1,v) geometry. The parameterized surface has the complex exponen- 
tial form 


(b,0) > (€7%, 2e%F*9 — e8-29)) 0h, 0 < Qt. 


This torus gives only one view of the cubic discriminant set. This view is the 
intersection of the three dimensional parabolic umbilic set or cubic discriminant 
set with the three-dimensional space formed by rotating the u = 1 plane. There 
are interesting intersections with other three-dimensional spaces. 


Fic. 12. Stereo Pair of GL(2, R) x R*: The Intersection of The Cubic Discriminant Set and a Rotated 
Plane. 


Direct carving GL(2,R) < R‘*, new style. About the same time I was direct 
carving the Torus with Cross-Cap and Vector Field 1 learned how jet engine parts 
are made. I thought there might be some application to my sculpture. I had been 
thinking mathematically about sculptural forms and real objects for some years. It 
turned out that I had independently addressed and solved some problems that 
have industrial applications. The major difference between my sculpture and 
industrial parts is that superficially an aesthetic object tends to be more complex 
and interesting than a functional part. The aesthetic artifact is intended to capture 
human attention and become part of a social event. That aspect may be irrelevant 
or dangerous in machine parts. But the problems for sculpture were more difficult 
than a typical machine part. To solve a problem, find a harder related problem and 
solve that; then the first problem will be easier. So I lectured a room full of 
engineering faculty and graduate students for three semesters on my approach to 
computer-aided design and manufacture. My motivation centered around automat- 
ing direct carving. 

Direct carving GL(2,R) X R* involved thousands of computer driven milling 
machine operations. Computer milling was done to preserve some of the quantita- 
tive as well as the qualitative features of the mathematics behind the fine art. 


1990] TWO THEOREMS, TWO SCULPTURES, TWO POSTERS 605 


Direct carving new style by computer drive is neither better than nor worse than 
direct carving old style by hand-eye-tool-medium coordination. The computer 
driven milling machine is a new tool, some things are easier, some things are much 
harder. To work consistently to the nearest thousandths of an inch does offer new 
options. I am satisfied that direct carving new style has expressive and aesthetic 
potential. 

A lot of work went into revealing this twisted torus with hypocycloid cross-sec- 
tion. It has some significance in the theory of real binary cubic forms. We could 
just make a model of this torus, as exactly as physically possible. When we got done 
we would have just that, a model. Nothing more, something less. A model is less 
than what it attempts to be, a sculpture is more. A sculpture can have perhaps, 
nuance, timbre, mystery, warmth, history, numerous levels of meaning, and many 
references other than what its humble origins would hope or explain. A pure 
model would have no trace of its construction, some people might like that. But 
that is not what I did at all. I am interested in the adventure of affirming pure 
mathematical thought in unpredictable physical form. 

Besides that I had in mind direct carving this umbilic torus. Direct carving 
means I move a tool around on the surface, a real physical tool leaves interesting 
marks, texture. The cutting tool has finite width, I have to live with this finite 
width, exploit the tell-tale marks of a tool that can visit only one point at a time. A 
tool-path is a curve. Surface filling curves can visit one point at a time and cover 
the entire surface. Because the cutting tool has finite width, I decided to use some 
order of a surface filling curve [Hilbert]. What manufacturing engineers typically 
do for tool-paths is have the cutter go back and forth, from one side to the other, 
or make a spiral motion, usually amounting to a series of concentric rectangles. 
These are all patterns familiar to anyone who has ever mowed a lawn. Next time 
mow your lawn following some order of a Hilbert curve. Surface filling curve tool 
paths are a new process. In some situations surface filling curves are more efficient 
than traditional paths. 

There are many surface filling curves; such fractals abound. I chose the 
Peano-Hilbert curve forms because when machined in slightly rough material they 
have overtones of Shang and Chou bronzes, of Mayan pictographic writing. The 
curve itself lies centered in a groove, the groove wall cuts off the neighboring 


Fic. 13. Peano-Hilbert Surface Filling Curves. 
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groove wall to form a maze wall. These maze walls are worn down in the bronze 
finishing process and are interesting to look at and touch. The grooves were cut to 
one-third ball end mill depth, the twisted hypocycloid with three cusps already 
abounds in threes and thirds. 

I define the mth order of the Peano-Hilbert curve recursively by constructing an 
ordered list of non-negative integer coordinate pairs H[n]. The join symbol U 
means to preserve the order as usually read left to right and top to bottom; the 
addition, subtraction and matrix multiplication are supposed to thread through the 
list. Begin with H[0] = (0,0). Then for n > 1, define 


H[n] =H[n-1]- ( | 


0 
LI 
(2"-10) + H[n - 1] 
LI 
(2"-1 2-1) + H[n — 1] 
LI 


n—-1 n . 0 1 

Qn 1,2"-)-HIn-11-() 5). 

I wanted to carve the surface of the torus of parabolic umbilics in such a way 

that I touched each point of the surface once. The parameters ¢, 6 lie in a square 

with each side of length some multiple of 77. The integer coordinates of H[n] grow 

without bound; to keep the curve in the unit square just scale by the right power of 

two and the right multiples of 7. I say multiples because I filled my square domain 

with three pairs of Peano-Hilbert curves directed to go the long way around the 
torus. This gave the final spatial torus image more balance. 


Fic. 14. Stereo Pair of the Peano-Hilbert Umbilic Torus Filling Space Curve, One Third Shown. 


The cutter was a five-eighths inch ball end mill of tungsten carbide rotating at a 
variable rotating speed less than 10,000 rpm. I cut a high density foam positive 
image which was held fixed throughout the cutting process. The available com- 
puter driven milling machine was a three axis unit, the bed travel was 28 inches in 
the x and y directions, the cutting tool quill had a travel of twelve inches. This 
machine geometry meant that I must always approach my complicated surface 
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from a vertical direction. The tip of the tool, if allowed to follow the parabolic 
umbilic surface exactly would be cutting into the elliptic umbilics inside the torus. I 
wanted the ball end mill and the whole milling machine apparatus to consist only 
of hyperbolic umbilics. That is where the observers of the sculpture are, the whole 
exterior space comprising hyperbolic umbilics. How do you feel now that since the 
creation of this sculpture you are hyperbolic umbilic? To avoid cutting up elliptic 
umbilics a tool compensation must be made. I wanted the ball end mill to be 
tangent to the parabolic umbilic surface, this means that a radius vector of the 
sphere part of the ball end mill be normal to the parabolic umbilics. All the 
normals had to be calculated. This was done by writing the twisted hypocycloid 
torus in terms of sines and cosines, explicitly writing out two independent tangents, 
taking their cross-products everywhere, and normalizing those to get the unit 
normal vectors. A simple vector addition with the vertical drop gives the proper 
tool compensation. 

It is sobering to debug a computer program that drives a potentially lethal tool 
with a thrust of over a thousand foot pound inches per second squared. For 
insurance many tool path pictures were drawn on a stereo imaging system. The 
computer languages appropriate to the computer graphics displays were inade- 
quate for addressing the milling machine. The computer driven milling machine 
had its own language and file system based upon earlier paper tape. The program 
for the Umbilic Torus would have been impossible a few years earlier; the paper 
tape would have filled the room. As it was we had to install a hard disc drive in the 
milling machine. 

The final cutting of the high density foam one side at a time took over eight 
hours. It came out looking like a most remarkable waffle. I intervened and 
sculpted this object to transcend the industrial process. Here was a positive image 
emerging in the full context of our electronic information processing era which I 
was about to immerse into bronze age technology. The bronze casting involved the 
traditional inversions. The original positive image was inverted to a negative silicon 
rubber mold, which was inverted to a positive hard wax image. This positive wax 
was dismantled and the pieces sprued with wax attachments to facilitate good 
liquid bronze flow; the positive results were dipped in a ceramic shell material and 
dried repeatedly to form a ceramic shell negative flask. This ceramic shell negative 
was fired in a kiln. The final inversion was when the positive bronze was poured. 
Fine art bronze contains some silicon and zinc with the copper (brass has tin 
instead of zinc). The bronze was welded together, chased, and given a corrosion 
arresting patina of antique greens. These bronze foundry processes date back 
several millenia. Some of the mathematics is ancient too; understanding both the 
torus and the hypocycloid calls up the ancient geometry of wheels. It was striking 
to see the old and the new mathematics and computer abstractions interact with 
the old and the new foundry and milling realizations. 


The JPBM poster. The Joint Policy Board for Mathematics (JPBM), located in 
Washington, D.C., is a consortium of the American Mathematical Society, the 
Mathematical Association of America, and the Society for Industrial and Applied 
Mathematics. In connection with Mathematics Awareness Week in the Spring of 
1989, the JPBM published an elegant five color poster featuring my silicon bronze 
sculpture Umbilic Torus NC [Hoffman]. 
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Fic. 16. JPBM Poster Featuring Umbilic Torus NC. 


The mathematics behind Umbilic Torus NC includes a fundamental representa- 
tion of the matrix group GL(2,R). The ‘NC’ is a cutting technology abbreviation 
for ‘numerical control’. The fundamental group representation appears in robin 
egg blue script in the background on the JPBM poster. We have discussed this 
mapping of 2 xX 2 matrices into 4 x 4 matrices via linear substitutions on binary 
cubic forms, e.g., 


a - _s . . 
Y 0<j<3 O0<j<3 O0<j<3 

The only difference between this expression and that on the poster is the obscure 

point that the range of summation there is simply given over j whereas it is to be 

understood that 0 <j < 3. 
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ELEMENTARY PROBLEMS 
E 3397. Proposed by Ji Chen and Cheng-hui Lo, University of Science and 
Technology, Hefei, Anhui, People’s Republic of China. 
The perimeter of a triangle ABC is divided into three equal parts by three 
points P, Q, R. Show that 
2 
Area( A PQR) > 9 Areal A ABC) 


and that the constant 2/9 is best possible. 


E 3398. Proposed by Alan H. Stein, University of Connecticut, Waterbury. 


Find all pairs of positive integers m,n such that d(m)|n and (n)|m, where ¢ 
denotes Euler’s function. 


E 3399. Proposed by Robert W. Floyd, Stanford University, CA. 


On a certain island with n married couples, every couple consists of a hunter 
and a farmer. The Ministry of Hunting has divided the island into » hunting 
ranges of equal size A. Working independently, the Ministry of Agriculture has 
divided the island into n farming ranges of equal size A. The Ministry of Marriage 
insists that the hunting range and the farming range assigned to each couple be 
close together. To everyone’s surprise the Ministry of Assignments is able to allot 
the hunting ranges and the farming ranges to the various couples in such a way 
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that each couple’s two ranges overlap. The Ministry of Religion declares this to be 
a miracle. 


(a) Show that in fact no miracle is involved by proving the existence of a 
positive number 6, depending only on v1 with the following property: No 
matter how the Ministries of Hunting and Agriculture have made their 
divisions, it is possible for the Ministry of Assignments to make its choices in 
such a way that each couple’s two ranges overlap by an area at least 6, A. 

(b) Determine the best possible value of 6,. 


E 3400. Proposed by Burt J. Totaro, Mathematical Sciences Research Institute, 
Berkeley, CA. 


Let S be the boundary of the unit square [0,1] x [0,1] in R?. Suppose f is a 
continuous real-valued function on S such that f(x, 0) and f(x, 1) are polynomial 
functions of x on [0,1] and such that f(0, y) and f(, y) are polynomial functions 
of y on [0,1]. Prove that f is the restriction to S of a polynomial function of x 
and y. 


E 3401. Proposed by James A. Davis, Michael Kerckhove, and J. Van Bowen, 
University of Richmond, VA. 


Suppose n points are independently chosen at random on the perimeter of a 
circle. What is the probability that all the points lie in some semicircle? 


E 3402. Proposed by Joseph Kupka, Monash University, Clayton, Victoria, Aus- 
tralia. 


A population consisting of particles of various types evolves in time according to 
the following rule: Each particle is deemed to belong to a unique generation 
n = 1,2,3,.... Each particle produces a certain number of “offspring” particles, 
and, for each n, generation n + 1 comprises the totality of offspring of the 
particles in generation n. A particle of type i = 0,1,2,... produces exactly i + 2 
offspring, one each of types 0,1,2,...,i + 1. Let N(n, k) denote the number of 
particles in the mth generation when the first generation consists of a single 
particle of type k. Find a formula for N(n, k). 


SOLUTIONS OF ELEMENTARY PROBLEMS 
A New Property of an Old Configuration 


E 3251 [1988, 131]. Proposed by Clark Kimberling, University of Evansville, 
Evansville, IN. 


Given a triangle ABC, it is possible to construct three circles I',, [,, [¢ inside 
ABC in such a way that each one is externally tangent to the other two, I, is 
tangent to the two sides AB and AC, I is tangent to the two sides BC and B4, 
and I. is tangent to the two sides CA and CB. (Cf. A. S. Winsor, Modern Higher 
Plane Geometry, Christopher Publishing House, Boston, 1941, pp. 206-209 or 
H. G. Forder, Higher Course in Geometry, pp. 244-245.) Let D be the point of 
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contact of I, and I, E be the point of contact of [¢ and Ij, and F be the point 
of contact of [, and I. 

(a) Prove that the lines AD, BE, and CF are concurrent. 

*(b) Prove that the lines A’D, B'E, and C’F are concurrent, where A’, B’, and 
C’ are the excenters of the triangle ABC. 


Solution by I. G. Macdonald, Queen Mary College, London, England. (a) Given 
three circles in a plane, each pair has two centers of similitude, and four triples 
among the resulting six points are collinear. In the case of the circles [,, 13, I 
with centers X,Y,Z, three of these six points are D, FE, F, and the remaining 
three, D’, E', F’, are the points of intersection of the external common tangents of 
I, and I%, [¢ and 1,, and I, and Is, respectively. The points D'EF lie on a line 
p, the points DE’F on a line q, and the points DEF’ on a line r, and finally D’E’F' 
on a line s. For D’EF, collinearity depends on the fact that the line EF makes 
equal angles with the circles [, and I, and, therefore, passes through their center 
of similitude D’; similarly for DE’F and DEF’. Collinearity of the points D'E'F’ 
follows from a theorem of Monge: the three intersection points of common 
external tangents for three circles taken two at a time are collinear (see Hilbert 
and Cohn-Vossen, Geometry and the Imagination, Chelsea, 1952, p. 137). 

Meanwhile, the centers of similitude are collinear with the centers of a pair of 
circles, so we have collinear sets DD'YZ, EE'ZX, and FF'XY. We have D’ on the 
three lines EF, YZ, BC, E' on the three lines FD, ZX,CA, and F’ on the three 
lines DE, XY, AB. Hence the three triangles DEF, XYZ, ABC have s as a 
common axis of perspective. Since the pair of triangles DEF, ABC 1s in perspec- 
tive, the lines AD, BE, CF meet in a common point, as desired. 

(b) We shall use the theorem that two triangles in a plane are in perspective if 
and only if there is a conic with respect to which they are reciprocal, meaning that 
the sides of each triangle are the polars of the vertices of the other (see, for 
example, H. F. Baker, Introduction to Plane Geometry, Cambridge Univ. Press 
(1943), p. 191). Since the triangles ABC, XYZ are in perspective, there is a conic > 
with respect to which they are reciprocal. Let P,Q, R, S be the poles with respect 
to % of the lines p, qg, r, s, respectively. 

We next show that S is the incenter and P,Q, R are the excenters A’, B’,C’ of 
the triangle ABC. Indeed, since s is the axis of perspective of ABC and XYZ, the 
point S is their center of perspective. But AX, BY,CZ are the internal angle- 
bisectors of the triangle ABC, so S is also the incenter of ABC. Moreover, since 
XYZ is the diagonal triangle of the quadrilateral pgrs, it follows that ABC is the 
diagonal triangle of the quadrangle PQRS. Hence the lines PS and QR pass 
through A and are harmonically separated by AB and AC. Since AS is the 
internal bisector of the angle A, it follows that QR is the external bisector. 
Likewise RP and PQ are the external bisectors of the angles B and C, and 
therefore P,Q, R are the excenters of the triangle ABC. 

By construction, the triangles DEF (with sides on p,q,r) and PQR are 
reciprocal with respect to %. Hence they are in perspective, by the theorem quoted 
above. This means that the lines DP, EQ, FR are concurrent, as desired. 


Solved also by E. Raytchev. Part (a) was solved by J. Dou (Spain) and the proposer. Two incorrect 
solutions were received. 
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A Binomial Coefficient Inequality 


E 3309 [1989, 154]. Proposed by Donald E. Knuth, Stanford University, Stanford, 
CA. 


Suppose that a, b, and c are nonnegative integers with b > a. If 0 < @ < 1 and 
0 <x < 1, prove that 


(ay E (S)(et aero 


Here (as usual) 


(i) == Dk + 1) kt 


for any real ¢ and positive integer k, while ( = I. 

Solution I by David Callan, University of Bridgeport, Bridgeport, CT. Fix a and @ 
and let F(x; b,c) denote the left-hand side of the given inequality. We show by 
induction on c that for any integer b > a, F(x; b,c) > 0. For c = Oor x = 0, F(x; 
b,c) =(- an G °), which is easily seen to be positive. Replacing k by c — k 


in the sum, we can use the Vandermonde convolution | OU 2,)= ("*°) to 


obtain F(1; b,c) = (— 1-4-4 ves °| > 0. One can directly verify that F’(x; b, c) 
= —cF(x;b + 1,c — 1). Together these facts provide a basis for the induction. 


Solution II by the editors. The sum is easily seen to be equal to the coefficient of 
u?*? in (—1)?-4@- Cu + x)°(u + 19°79. Expanding (u + x) = [(u + 1) + (x - DIF 
in powers of u + 1, we see that the sum is equal to 


Eyre (le +a+60- ‘Nx ~ 1) 


b+c 


It is easily verified that each nonzero term is positive. (When x = 1 there is only 
one nonzero term.) 


Editorial comment. Many solvers noted that the sum could be expressed in terms 
of the hypergeometric series as 


9-1 (74 7- 9)p —c,b-—a-@ 
= a-1 Cc, a . 
( 1) b! 2F,( b+1 sx], 


where (r), = r(r + 1)--:(r +k — 1). Some solvers used known transformations 
for the hypergeometric series to express this as a sum of positive terms (Solution I] 
above derives one of these transformations). Others used the integral representa- 
tion for the hypergeometric series to express the sum as an integral of a positive 
function. 

Solved also by R. Askey, D. W. Bailey, A. A. Jagers (The Netherlands), H. Ki (Korea), A. G. 


Kostenbauder, E. Levine, C. Libis, O. P. Lossers (The Netherlands), D. Secrest, J. H. Steelman, 
V. Strehl (West Germany), R. J. Wagner, D. Zeilberger, and the proposer. 
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Determining a Secret Number by Asking Suitable Primality Questions 


E 3310 [1988, 155]. Proposed by Albert Wilansky, Lehigh University, Bethlehem, 
PA. 


I have a secret positive integer u, not exceeding some predetermined bound N. 
If you give me any positive integer n, I will tell you whether or not u + n is prime. 

(a) Give a number G(N) and a procedure such that you can always determine 
the value of u in at most GCN) such trials. 

(b) Give a number F(N) such that no strategy will always determine u in fewer 
than F(N) such trials. 


Solution I of (a) by A. Zulauf, University of Waikato, Hamilton, New Zealand. 


G(N) <WN will suffice. To see this we first note that there exists a prime p 
greater than N such that each of the numbers p —) for j = 1,2,...,N-— 1 is 
composite. (Cf. Niven and Zuckerman, An Introduction to the Theory of Numbers, 
Theorem 1.18. For example, if N = 14, we may take p = 127.) Suppose one now 
takesn; =p—N+i-1fori=1,2,...,N,sothatu+n,=p—(N-u)+i- 
1. Then p—N<u+t+n,<p andut+n,=p+u-—1zp. Let I be the first 
value of i for which u + n, is prime. In view of our choice of p, we must have 
u +n, =p, so that u is given by the formula 


u=N+1-T1 


Solution II of (a) by Imre Z. Ruzsa, Mathematical Institute of the Hungarian 
Academy of Science, Budapest. We show that G(N) < {1 + o(1)}(log N)* and that 
for large N only test integers n, not exceeding N need be used. We begin with the 
following lemma. 


Lemma. If 1<u<v<QN, let A, ,(N) be the set of positive integers n not 
exceeding N such that exactly one of the numbers u + n,v + n is prime. Given « > 0, 
we have 


2 


N)|> 
Au N)| 1+elogN 


for sufficiently large N uniformly in u, v. 
Proof of Lemma. Clearly 
IA, (N)| = |{n: 2 < N,v + nis prime}| + |{n:n < N, u + nis prime}| (*) 
— 2|{n: n < N, both u + n and v + n are prime}|. 


By the prime number theorem the first term in (*) is equal to 


Ney at N N N 
f[ "— + 0|——, ] = —~ + 0| —y, 
v logt (log NV) log N (log NV) 


uniformly in v. Similarly the second term on the right of (*) is equal to N/log N 
+ O(N/(log N)?) uniformly in u. Now, using Brun’s upper bound sieve method, 
Schnirelmann proved that the third term on the right of (*) is less than an 
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absolute constant times 
N 1 N u?(d) N 1 
aoanz Il 51 aan? <2 ‘; 
(log N’)° plv—u P} (log N)° au 4 (log N)° ajv—u 4 


(Cf. Satz 88 of E. Landau, Uber einige neuere Fortschritte der additiven Zahlentheo- 
rie, Cambridge Tract No. 35, London, 1937 or Corollary 2.4.1 of H. Halberstam 
and H.-E. Richert, Sieve Methods, Academic Press, 1974). Further 
1 o(v-u) 
yy = = — — = O(log log 3(v — u)) = O(log log N). 
d|t-u d UU 

(Cf. §22.9 of G. H. Hardy and E. M. Wright, An Introduction to the Theory of 
Numbers, Oxford University Press, 1960.) Hence the third term on the right of (*) 
is O(N(log N)~* log log N). Combining these results, we obtain 


2N N log log N 
|A, (CN) | ~ los N + | 
og N (log N) 
and the lemma is proved. 
The result claimed at the beginning of our solution is embodied in the following 
theorem. 


THEOREM. If ¢ > 0, N is sufficiently large, and k ={(1 + «log N)*|, then there 


exists a set {n,,nNy,...,n,} with 1 <n; < N such that a knowledge of which numbers 
ut+n,; (i =1,2,...,k) are prime uniquely determines u. 
Proof of Theorem. We consider all k-tuples (n,,7,,...,,) of distinct integers 


from [1, N]. There are NCN — 1)-:-(N—& +1) such k-tuples; we shall show 
that a positive proportion of them give a set which satisfies the conclusion of the 
theorem. 

If u,v are given integers satisfying 1 < u < v < N, then the number of k-tuples 
of integers from [1, N]\A,, CN) is at most 


2 wN \* 
N- —_ 
1+e log N 


The number of pairs u, v satisfying 1 <u <v< WN is N(N — 1)/2. Thus the total 
number of k-tuples from [1, N] each of which avoids at least one of the sets 
A, ,(N) does not exceed 


re 


uy 


N(N — 1) 2 Nn \* 4 2 ‘ 
———-|N- < —Nkt2)4 — ———____ 
2 1+<elogN 2 (1 + «)log N 
< | et2 —2k 
2° = *P\ 7 + e)log N 
1 
< —N* 
2 


< <N(N 1) (Nk +1), 
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since 
N(N-1)-*:(N-k+1) (N-k)‘ k \* k2 3 
mn wnte DW fg 
for sufficiently large N. Thus there exists a k-tuple of distinct integers from [1, N] 
(in fact there are many such k-tuples) which does not avoid any of the sets 
A,, ,(N), ie., which contains a representative of each A, (N) for 1 <u<v<N. 
And if n; isin A, ,CN), then exactly one of the integers u + n,,v +n, is prime, so 
that the :th trial distinguishes between u and v. Thus the theorem is proved. 


Solution of (b) by Albert Nijenhuis, Seattle. WA. We claim that F(N) > 
[log N/log 2]. For there are 2" possible outcomes to a set of k trials and if these k 
trials determine u in each of the N cases, we must have 2" > N ork > log N/log 2. 


Editorial comment. Ruzsa’s solution of (a) improves upon the shorter solution of 
Zulauf in two respects: The estimate of GCN) is much sharper and the values of 
the test integers n are much smaller. 

If we use only a qualitative form of the lemma in Ruzsa’s solution, we see that, 
for large N, the value of u is uniquely determined by a knowledge of which of the 
numbers {u + 1,u + 2,...,u + N} are primes. We know of no proof of this fact 
which does not use sieve methods in some form. It is not obvious how large N 
must be for this assertion to be true; certainly we must have N > 4, since n = 5 is 
the smallest positive integer for which precisely one of the integers 2 + n,4 + n is 

rime. 

P On the other hand Ruzsa’s solution is an existence proof and does not give an 
explicit procedure for finding a set of test integers n,, although an examination of 
only exp({1 + 0(1)Hlog N}°) different subsets of {1,2,..., N} is needed. Even for 
modest values of N, it is hard to find the smallest possible set {n,,n5,...,,} such 
that a knowledge of which integers in {u+n,,u+n),...,u+n,} are prime 
determines u uniquely. When N = 50, say, such a minimal set must contain an 
element from each of the 1225 sets A, ,(50) with 1 <u <v < 50; this selection 
process is rather delicate, since some of these sets A, ,(50) are rather small, e.g., 
A 9 59(50) = {27, 29, 41}. For N = 5 a minimal set of test integers is given by n = 1, 
n, = 4, n3 = 5, since A, (5) = {1}, A, 45) = {5}, and A; (5) = {4}. 


Solved by A. Nijenhuis, I. Z. Ruzsa, A. Zulauf, and the editors. 


Conditions for a Trivial Monoid 


E 3311 [1989, 155]. Proposed by Sydney Bulman-Fleming and Kenneth McDowell, 
Wilfrid Laurier University, Waterloo, Ontario. 


Suppose S$ is a monoid containing elements a and b such that every element of 
S x S is of the form (au, bu) for some u in S (.e., S X S, considered as a right 
S-set, is cyclic). 

(a) Show that S must be a singleton if S is any one of the following: finite, 
commutative, idempotent, or inverse. 

(b) Show that S need not be a singleton in general. 


Solution by Jestis Ferrer, Universidad Complutense, Madrid, Spain. (a) Assume 
that S is a monoid that satisfies the hypothesis. We first observe that if S is either 
commutative, finite, idempotent, or inverse, then a admits a left inverse in S. 
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This observation is trivial if S is inverse. In any case, there exists w © S such 
that (1, 1) = (aw, bw), where 1 is the identity of S. If S is commutative, then the 
element w is a left inverse for a. If S is finite or idempotent, then there are 
nonnegative integers m,n with m > n and a” = a”. Now 1 = aw =a’*w7= ---, 
sol = a"w" =a™w" =a™~"a"w" =a™—"—'q, and a™~"~! is a left inverse for a. 

Let a’ be a left inverse for a. If (x, y) € S X S, choose u,v € S such that 
(x, y) = (au, bu) and (x, x) = (av, bv). Then u = a'au = a'x = a'av = v and x = 
bv = bu = y, and S is a singleton. 

(b) Let M be the monoid of transformations from the positive integers to 
themselves under ordinary function composition, where (fg )(x) = g(f(x)). Define 
a,bEM by ak) = 2k — 1 and b(k) = 2k. If (x, y) € M x M, defining u € M 
by u(2k — 1) = x(k) and u(2k) = y(k) yields (x, y) = (au, bu), as desired. 

Solved also by O. C. Garcia, F. W. Humburg, A. A. Jagers (The Netherlands), J.-M. Monier 
(France), A. Montgomery, P. G. Walsh (Canada), V. Ziegler (student), and the proposers. One 
anonymous solution was received. 


When Fractions Contribute Nothing 


E 3317 (1989, 254]. Proposed by Peter J. Ferraro, Roselle Park, NJ. 


For any positive integer n and any positive real number a, put 
n—-1 
S,(a) = YL |yatj/n]. 
j=0 


(a) Prove that S,(a@) = n|Va | if la] + 1 is not a perfect square. 
(b) What is the value of S(a@) if la] + 1 is a perfect square? 


Solution by Marcin E. Kuczma, University of Warsaw, Poland. (a) Note that 
S (a) > n| Va |, since each summand is at least lVa |. If equality does not hold, 
then the last summand_ exceeds [Va |, which means that Va <k 
< ya + (n — 1)/n for some integer k. This requires k* + 1/n <a+1<k? + 
1, whence la] + 1 = k?”. 

(b) When |a] + 1 = k? for a positive integer k, we claim that the value of S,(a@) 
is n|Va | + [n(a — la). Let r =|n(a —|a))J; this is just a complicated way of 
writing 


k?+r/n<ati<k?+(rt+1)/n, r € {0,1,...,n — 1}. 


This implies Va + (n —r—1)/n <k < ya+ (n —1r)/n, which is equivalent to 
at s/ny=) ¢ forj2n-r. 

Thus S (a) =(n —r\k - 1) + rk =n(k —1)+4+,7, which equals the formula 

claimed because k — 1 = | Va |. 


Editorial comment. W. Janous and C. Wildhagen noted that the result general- 
izes immediately from square roots to mth roots, and R. J. Hendel noted that the 
generalization goes all the way to any strictly increasing function whose inverse 
maps integers to integers. 
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Solved also by K. F. Andersen, J. C. Binz (Switzerland), W. E. Briggs, D. Callan, B. de Weger (The 
Netherlands), P. H. Duc (Denmark), J. Ferrer (Spain), S. Gaignoux (France), J. Ganapathy, J. Guo 
(China), R. J. Hendel, S. Houchen, A. A. Jagers (The Netherlands), W. Janous (Austria), P. Khajeh- 
Khalili, H. Ki (South Korea), N. F. Lindquist, J. B. Muskat (Israel), J. H. Nieto (Venezuela), T. S. 
Norfolk, J. H. Steelman, C. Wildhagen (The Netherlands), A. Zulauf (New Zealand), M. Zweiback, and 
the proposer. 


A Best-Possible Quadratic Inequality 


E 3320 [1989, 356]. Proposed by M. S. Klamkin, University of Alberta, Edmonton, 
Alberta. 


Determine positive constants a and b such that the inequality 
yz + zx + xy > aly?z? +. 2*x? + xy”) + bryz 


holds for all nonnegative x,y,z with x +y+z=1 and is the best possible 
inequality of this form (in the sense that the inequality need not hold if a or b is 
increased). 


Solution by Mark Ashbaugh, University of Missouri, Columbia, MO. The family 
(a, b) = (a,9 — a) for 0 <a < 4 gives all solution pairs to this problem. Using the 
values (x, y, z) = (1/3, 1/3, 1/3) and (x, y, z) = (1/2, 1/2, 0), we obtain the two 
inequalities a+b <9 and a <4. Thus we need only prove that the desired 
inequality is valid for the pairs in the family described, i.e., that the inequality 


yz + 2x + xy > a(y?2z?7 + .2°x? +.x*y7) + (9-a)xyz forS<a<4 
holds for nonnegative x, y,z withhx +y+z=1.Ifwelet P=(x+y+2z)*(yz + 
x +xy), O =y?z* +27x? +x’y?, and R = (x + y + z)xyz, then this is equivalent 
to proving 
P(x,y,z) > aQ(x,y,z)+(9-a)R(x,y,z) for0<a<4, 
for all nonnegative x, y, z, since these terms are homogeneous. 
The proof is now completed by showing that P > 4Q + 5R and that QO > R, 


because together these imply P > aQ + (4-—a)Q+5R>2>aQ+ (9 —a)R for 0 < 
a < 4. To obtain the first inequality P > 4Q + 5R we note that 


P=20+5R+x°y+uy? +y?z + yz 4+x72 4+ x2? 
and then 


P-—40-5R=xy(x—-y)’ +yz(y —z) +22(x —z)’ > 0. (*) 


To obtain the second inequality, Q > R, we apply the Cauchy-Schwarz inequality 
to the vectors V = (xy, yz, xz) and W = (xz, xy, yz) and conclude that 


Q=|V\|-|Wil>V-W=x2yz + xy°z + x27 = R. (« *) 


The points x, y,z for which equality holds are easy to determine. If a < 4, 
equality occurs in (* *) exactly when x = y = z or at least two of x, y, z are 0, and 
(*) also holds with equality in those cases, so for a < 4 the only instances of 
equality in the triangle given by x +y+z2z=1 and x,y,z >0 are the center 
(1 /3,1/3,1/3) and the corners (1,0,0), (0,1,0), and (0,0,1). When a = 4 the 
situation changes because (* *) is not needed, and (*) also gives equality if one of 
x, y,z is 0 and the other two are equal. Thus if a = 4 we also have equality at 
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(1/2, 1/2,0), 1/2,0,1/2), and (0,1/2,1/2), the midpoints of the sides of the 
above triangle. 

Finally, we note that the best inequality of the form P > bR is P > 9R, which 
follows from the fact that the entire discussion above applies also when a = 0; 
equality holds in P > 9R only when x = y =z or when two of x, y, z are zero. 
The best inequality of the form P>aQ is P > 4Q, also from the discussion 
above; equality holds in P > 4Q only when two of x, y, z are zero or when one of 
them is zero and the other two are equal. 


Solved also by J. Anglesio (France), R. Betts, R. J. Chapman (United Kingdom), E. Cohen (France), 
R. J. Currier, W. K. Delaney, R. J. Hendel, W. Janous (Austria), I. G. Kastanas, M. E. Kuczma 
(Poland), L. Kuipers (Switzerland), E. Levine, H. Lipman, K. Liu (West Germany), K. McInturff, H. K. 
Nickerson, R. Persky, S. Philipp, J. H. Steelman, R. Stong, T. P. Vaughan, K. Zacharias (Germany), 
and the proposer. Two incorrect solutions were received. 


Steiner Trees for the Hypercube 


E 3321 [1989, 356]. Proposed by Robert Spira, Ashland, OR. 


A Steiner tree for a finite set S of points in Euclidean n-space is a collection T 
of straight-line segments such that any two points of S can be joined by a unique 
path in 7; its length is the sum of the segment lengths. It is known that the 
minimum length of a Steiner tree for the corners of a square of side-length one is 
1 + y¥3. Show that for any positive integer n there exists a Steiner tree of length 
1 + (2”~! — 1)y3 for the set of vertices of an n-cube of side-length one. 


Solution by John Henry Steelman, Indiana University of Pennsylvania, Indiana, 
PA. We give a recursive procedure for constructing the desired Steiner tree. The 
case n = 1 is handled by a single line segment. 

Assume a Steiner tree of length 1 + (2”~! — 1)y3 for the set of vertices of an 
n-cube of side-length one has been constructed, in which each vertex of the n-cube 
is the endpoint of just one segment and this segment has length greater than 
V3 /6. Now consider an (n + 1)-cube of side-length one. Select two parallel faces 
and consider the n-cube whose vertices are the midpoints of the edges joining the 
corresponding vertices of these faces. Start with the assumed Steiner tree for the 
vertices of this nm-cube. For each vertex a, of this n-cube, delete a segment of 
length 73 /6 from the segment terminating at a,, then add the segments joining 
the new endpoint to the two vertices of the (m + 1)-cube that are the endpoints of 
the edge containing a;. By the Pythagorean Theorem, these segments each have 
length 3 /3. The net effect of these changes is to add 2"(2¥3 /3 — v3 /6) = 
2”~1V3 to the length of the tree. Thus the resulting tree has length 1 + (2” — 1)y3. 
It is clearly a Steiner tree for the vertices of the (m + 1)-cube and has the 
additional properties required for the induction. 


Editorial comment. Donnelly Johnson, Marcin Kuczma, and Richard Stong all 
generalized the result to rectangular parallelopipeds in R” of side lengths a, < a, 
< ++: <4,, obtaining a Steiner tree of length a, + L77}2'-!a,_ v3. 

For n=3 the problem asserts the existence of a Steiner tree of length 
1 + 3V¥3 = 6.19615242... (for the set of vertices of a three-dimensional cube of 
side-length one). In the survey paper by E. N. Gilbert and H. O. Pollak, “Steiner 
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Minimal Trees,” Siam J. Applied Math., 16 (1968) 1-29, this numerical value is 
somewhat garbled. 


Solved also by R. J. Chapman (England), D. J. Johnson, M. E. Kuczma (Poland), R. Stong, P. J. 
Zwier, and the proposer. 


An Integral Inequality 


E 3322 [1989, 357]. Proposed by Alexandru Lupas, Sibiu, Romania. Suppose g is 
an even real-valued function on [—a, a], suppose g is nondecreasing on [0, a], and 
suppose / is a convex real-valued function on [—a, a]. Prove that 


2af g(t)h(t) dt > [ g(t) dt [ h(t) dt. 


Solution by Duane W. Bailey, Amherst College, Amherst, MA. Let f(t) = h(t) + 
h(—t) for 0<t<a. Since g is even, we have {*%,g(t)h(t) dt = [fg(t) f(t) dt. 
Because / is convex, if 0 < t, < t, < a, then the chord of the graph of h over the 
interval [—t,, — t,] has no greater slope than the chord over [f,, t,], which means 
that 


h(-t,) — h(-t,) e A(t,) — h(t,) 
—th-(-t) — fo — ty ) 


This implies that f is nondecreasing. Because both f and g are nondecreasing on 
[0, a], we can apply Tchebysheff’s inequality to conclude 


fae fOr dt > fo dt fro dt. 


Since g is even, translating back to the interval [—a,a] yields the desired 
inequality. 


Editorial comment. Note that convex functions and nondecreasing functions 
defined on a closed interval are always integrable. Walther Janous observed that 
the inequality of the problem follows from other inequalities of Lupas, most 
published in Publ. Inst. Math. Univ. Beograd and quoted in J. E. Pecari¢, 
Konveksne funkcije. Nejednakosti. Naucna Knjiga. (Beograd, 1987). 

Solved also by U. Abel (West Germany), W. J. Buhler (West Germany), D. Callan, M. Falkowitz 
(Israel), D. L. Farnsworth, M. Fichter (West Germany), S. K. Ghosh (India), J.-P. Grivaux (France), 
T. Herman (Hungary), B. Hughes, V. Jianu-Pambuccian, P. G. Kirmser, M. E. Kuczma (Poland), L. 
Kuipers (Switzerland), K.-W. Lau (Hong Kong), E. Levine, K. Liu (West Germany), L. F. Martins, J.-M. 
Monier (France), S. Philipp, C. J. Reiss, D. S. Rubin, K. Schilling, R. Stong, K. Zacharias (West 
Germany), and the proposer. 


ADVANCED PROBLEMS 


6637. Proposed by Herbert S. Wilf, University of Pennsylvania, Philadelphia, PA. 


Let f(n) be the sum of the first one-third of the coefficients in the expansion of 
(1 +x)”, ie., 


f(n) = » (3") (n =0,1,2,...). 


k=0 
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Prove that 
0 4u2\" u 2u 
EA} -——;— - 
27 
n=0 u— dsin( 5 arcsin 7 2u — 3sin( 5 arcsin 7 


6638. Proposed by Stan Philipp, Pennsylvania State University, Altoona, PA. 
Let 
a, = (-1)'{ i) - 4#(2K) (k =0,1,2,...). 
k k 
(i) Prove that 


y y 


mi 2k+1 0 2°? 2k-2n4+1 
Gi) Prove that 


© 
dL a 
k=0 


ay, T ay 


0 for n = 1,2,3,.... 


1 1 


nT ee De forn = 0.1.2..... 
Iki 2n42 1) I2k-oanat} 2 tna 9%12, 


6639. Proposed by Paul Monsky, Brandeis University, Waltham, MA. 


Let N be a convex centrally symmetric plane n-gon of area 2 divided into n 
triangles by rays from an interior point P to the vertices. Let a,,a,,...,a, be the 
areas of these triangles, taken in cyclic order. 

(i) If N is affinely equivalent to the regular polygon of 2k sides, prove that 

ay + a3 4 oes + Ar, 4 =A, + aAgtT oes + Ax, = 1. 


(ii) In general prove that a,,a,...,a,, Satisfy a polynomial identity with integer 
coefficients and constant term 1. 


SOLUTIONS OF ADVANCED PROBLEMS 
Squares Expressible As a Sum of n Consecutive Squares 


6552 [1987, 694]. Proposed by Moshe Laub, Jerusalem, Israel. 


Let S be the set of positive integers n for which there exists a square equal to 
the sum of the squares of n consecutive positive integers. For example, 11 € S, 
since 


77? = 187 + 197 + +++ +27° + 28? 
Prove that 
(a) S has density zero, 
(b) S is infinite, 
(c) if n € S and n is not itself a square, then there are infinitely many squares 
each of which is equal to the sum of the squares of n consecutive positive integers. 
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Further, since x) > —(m — 1)/2 and yy > (nm — 1)/2, we have y > ygu > 0 and 
X>Yov —u(n — 1)/2 + (n —- 1)(u —-1)/2 =yov — (n —- 1)/2 
>yy-(n—-1)/2>0. 
Thus the theorem is proved. 


Editorial comment. If N(x) is the number of elements of S not exceeding the 
positive number x, the argument of either solution of (a) shows that in fact 
N(x) = O(x(log x)~'/7). (Cf. $183 of Landau’s Handbuch der Lehre von der 
Verteilung der Primzahlen.) Using the arguments of the two solutions of (a) along 
with upper-bound sieve methods, Heini Halberstam and Hans-Egon Richert pro- 
vided us with a proof that actually N(x) = O(x(log x)~'). In the other direction 
either solution of (b) shows that N(x) > cyx for a suitable positive constant c. It 
seems difficult to determine the precise order of magnitude of N(x). 

The elements of S less than 73, along with corresponding values of x for which 
T,(x) is a perfect square, are given in the following table 


n 1 2 11 23 24 26 33 47 49 50 59 
x 1 3 18 7 ] 25 7 539 25 7 22 


Most of these examples were given by E. Barbette in a monograph published in 
1910. Cf. L. E. Dickson, History of the Theory of Numbers, Volume II, pp. 319-323. 
A problem distantly related to the present one is E 3064 [1984, 649; 1987, 190]. 


The solutions by Lossers, Mattics, and the editors were the only ones received. 


Infinitely Differentiable Functions on R’ 
6579 (1988, 773-774]. Proposed by D. K. Cohoon, Temple University, Philadel- 
phia, PA. 


Suppose we define a curl operation on the infinitely differentiable functions 
from R’ to C’ by the rule 


cul(7) = ¥ (= - sis | + [ee si | + bos ale 


imi \\9%i41 9X43 OXi42 OX +6 OX;44  9Xi45 


where eé; is the unit vector in the positive direction on the ith coordinate axis, 
f=flx) =fie, + foes + £303 + fae, + fees + f& + £72, is a C” function from R’ 
to C’, and, for all positive integers j in {1, 2,3, 4,5, 6, 7}, Xj47 =x, and fi,7 =f. 
Show that every infinitely differentiable function from R’ to C’ is a curl plus a 
gradient. 


Solution by Alfred Gray, Department of Mathematics, University of Maryland, 
College Park, Maryland. The operational identity 
curl(curl) = grad(div) — A 
can be verified for any infinitely differentiable vector field by a slightly lengthy but 
purely mechanical process. One simply checks that the :th component on each side 


is the same (details omitted). Since the Laplacian is an epimorphism on the 
infinitely differentiable functions on every open set, the result follows. 


Solved also by the proposer. 
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The Asymptotic Expansion of the Middle Binomial Coefficient 


6581 [1988, 774]. Proposed by Donald E. Knuth and Ilan Vardi, Stanford Univer- 
sity, Stanford, CA. 


Stirling’s approximation leads to the following expansion 


(2) 4 1 fi 1 1 5 21 LO 1 
nn} Varn 8n  128n?  1024n? ~—32768n4 n> }f 
Prove that if this expansion is continued indefinitely, the coefficient of n~* will be 


2'~4*m,, where m, is an integer. What is the exact power of 2 that divides the 
denominator of this coefficient?’ 


Solution 1 by Rolf Richberg, Institut fiir reine und angewandte Mathematik der 
RWTH Aachen, West Germany. For x > 0, define 


1 
Plx+- 

7 2) — vrx r(2x + 1) 
fa) = Ve Fp AF (x 4:1)?” ) 


where the equality of the two expressions follows from the duplication formula for 
the gamma function. By Stirling’s formula (or formula 6.1.47 on p. 257 of 
Abramowitz and Stegun, Handbook of Mathematical Functions, National Bureau of 
Standards, 1964) we have an asymptotic expansion 


fa) ~ Saya (2) 


with a, = 1. Since f(n) = van (2") /4", the problem is to describe the coefficients 
a,, in (2). Denote by a(m) the number of ones in the binary expansion of m_€ N, 
where N denotes the positive integers. We shall show that for any m € N, 


an = Ja(m)—4m aX, (*) 


where a* is an odd integer. 
For the proof of (*) it is convenient to put 


b,=1, b, =(-1)"24"-1a,, meéeEN, (3) 
and to verify (*) in the equivalent form 
b= 201 be meéeN, (**) 


where b* is an odd integer. To do this, we shall set up two functional equations 
for f, each of which gives a recurrence relation for the b,,. The first will show that 
bis always an integer, while the second will yield (**). 

From (1) we have 


m 


x>0. (4) 


1990] PROBLEMS AND SOLUTIONS 627 


From (2) and the binomial expansion 


1 
x+— 
hae 


—k 


we derive the asymptotic expansion 


f ~ Vax * jeu 
k=0 J 


1 
x+—= 
2 i 


2 of om (5) 
-E (E(k etajem Goa) 


mao \ko\m —k 


Upon multiplying the expansions (2) and (5), we obtain from (4) the expansion 


18 


1 ea) m k _> 
9 [20 = ) | Yank Yl, 7 pao (x > 0). 
j= 


0 mM m=0 \k=0 


m 


By equating coefficients and using the identity 


1 _ m 
—— |] _ (—1) (227 | 
2 qm m 
m 
we obtain 
) m m—k ~j 
(—1)"2-( m | -¥> | Jot maya, 
m k=0 j20 \— KJ ; 


| 
we 
M 


— Z (m4) \ aa 
ee ry Aye 
k=0 j=0 \™ kj 2 
Since a) = 1 and a,, = (—1)b,,2'-*” for m € N, this is easily rearranged to give 


bh. = gm-2( 2m | _ >. m— 1 Q3(m—1—-f+2p 
m m j-1 J 


—] |2%m-&-Pb,b, 


for m € N; this shows that b, = 1 and that (by induction on m) 
b,, is an integer for m € N. (6) 
The second functional equation for f, namely 
frei) = (14 5 [i+ f(x), x>0, (7) 
2x x 


is also an immediate consequence of (1). The binomial expansion 


(x+1)~“ a1 Palas 
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together with (2), yields 
f(x+1)~ ¥ | ) Ea) (x > 0), 
m=0\k=0 \/) — 


Upon multiplying the power series expansion of the first two factors on the 
right-hand side of (7) with the asymptotic expansion (2), we find that 


¥ » fence 


m=0Lk=0 
=DVilamn+ Li} 2 4;+5] 2 Ay px ™ = (xX > &) 
m=0 k=O l\m—k m—-k-1 


Now equate coefficients to obtain (for m € N) 


0 EV 2 3 Pe 
-E|c- 1) (mot). (—1)'2I- kot (24-2 | ay. 


In view of (3) this may be rewritten (by shifting the summation on k and putting 
m + 1 in place of m) as 


m _ k 
_ 42m-2 2k-1 2k\ _ yax( ™ 
Pm = 2 matin r) + ¥ [2 male ? lic 


for mEN. 

Now define B(m) for any m € N by m = 2°%m*, where m* is an odd integer. 
Let 2°”) be the lowest power of 2 not occurring in the binary expansion of m. 
Then 6(m) = B(m + 1) and we may write 


Dink (8) 


m=294 22 4 +--+ 429-1 4 255 4 D541 4 0 ee 4 D801 


with 0 <6 <ss <55,, < +++ <5S,_,, Where a = a(m) and 6 = 6(m). It follows 
that m+ 1 = 2° + 2% + 2%s+1 4 +++ +2%«-1, Hence 


a(m+1)=a(m)+1-—-B(m+1), meN, (9) 
and consequently 


m—a(m)<mt+1-a(m+1), meN. (10) 


We now prove (**) by induction on m. Since b, = 1 and a(i) = 1, (**) is 
trivially satisfied for ™ = 1, and we may assume that for some m > 2, 


b, , = 2%" Y-1 b* ,, 0k = 1,2,...,m—1, 


m— 
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where b*_, is an odd integer. Then (8) becomes 


an) +(-a(g)jerrmn, an 


_ 72m-2 ( 
m+1\m 


mb,, 


m-—1 k 
k-2 2k\ — »3x-1{ ™ a(m—k)-+k 
'D | mils ; (2) Om 


From (10) we find that aQm — k) + k > a(m — 1) + 1 for k = 2,...,m — 1, and 
we have, trivially, 

2m—-2>a(m-—1)+m-1>a(m—1) +1. 
Thus, all terms on the right-hand side of (11) are integer multiples of 2°"~ +1, 
with the sole exception of 2%”"~ b*_,. This shows that the number 

2-«"—Dmp _ 2 Ben)— atm Dig *h _ 21-80 m*b 
(recall (9)) is an odd integer. Since m* is odd and b,, is an integer by (6), this 
concludes the proof of (**). 


Solution Il by Donald E. Knuth, Stanford University, Stanford, California. By the 
duplication formula for the gamma function, the given formula can be rewritten as 


1 
(n+ 5 ; 1 
T(n)n'/? 8n 


We shall prove that if p/q is any rational number, 


Mint P/q) , , mi(p.d) | ml P.d)  mx(P.d) 
T(n)n?/4 gen qin? gin? 
where each m,(p, q) is an integer. Tricomi and Erdélyi (Pacific J. Math., 111951) 
133-142) showed that such an expansion exists with 


m,( p,q)/q*—' =M,( p/q) 


where M,(t) is a polynomial of degree 2k. This polynomial can be identified as a 
Stirling number of the first kind, because 


T(n +t) 
I(n) 


when ¢ is a positive integer. The polynomial is well known to be a linear 
combination of binomial coefficients, 


k ; 
t+j-1 
jar \ 2k 


=n(n+1)°::-(n+t-1) 


where the C, , are integers (see Jekuthiel Ginsburg, “Note on Stirling’s numbers,” 
this MONTHLY 35(1928), 77-80; Ira Gessel and Richard P. Stanley, “Stirling 
polynomials,” J. Comb. Theory (A) 24(1978), 24-33). Therefore the result follows 
from Hermite’s theorem that 


P/Q \ 2K-1 
| k Ja 


is an integer (Cours de Hermite, Faculté de Science de Paris, 1882, Legon 19). 
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The result can be sharpened when q = 2, because C, , = 1 and C, ; is even for 
j > 1. Therefore the exact power of 2 dividing the denominator of the coefficient 
of n-* is 4k — v(k), where v(k) is the number of ones in the binary representa- 
tion of k. 


Editorial remark. A problem essentially identical to this one was proposed to the 
MonTHLy in 1985 by Douglas B. Tyler, then at the University of California at 
Davis. There was, however, no proof of the formula involving the number of ones 
in the binary expansion of n, and the problem was rejected at that time. In 1986 
Dean Hickerson responded to an inquiry from Tyler with a ten page manuscript 
about recurrences and Stirling numbers that included a solution of Tyler’s prob- 
lem. In 1988 the current advanced problem editors (an entirely different set of 
people) received and accepted the Knuth-Vardi problem proposal. As indicated 
above, this proposal was accompanied by a proof of the binary expansion formula. 
Upon its publication another Hickerson-Tyler correspondence ensued. In particu- 
lar, a November 27, 1988 letter of Hickerson established that the full solution of 
the Knuth-Vardi problem was an immediate consequence of his 1986 manuscript. 
The editors have examined Hickerson’s letters and the letter rejecting Tyler’s 
original proposal, and publish this note to acknowledge the priority of Tyler and 
Hickerson. 

For other relations between binomial coefficients and expansions of integers in 
various bases (including base 2) see N. J. Fine, “Binomial coefficients modulo a 
prime,” this MonTHLY 54(1947), 589-592, and also the forthcoming book (in 
Russian) “Generalized Pascal’s Triangles and Pyramids, their Fractals, Graphs, 
and Applications” by the Soviet mathematician B. A. Bondarenko (Tashkent). 


The solutions by Hickerson, Knuth, and Richberg were the only ones received. 


A Condition Implying Nilpotence 


6584 [1988, 880]. Proposed by E. T. Parker, University of Illinois at Urbana- 
Champaign. 
Suppose a finite group G satisfies the following condition: If H is a cyclic 


subgroup of G of prime-power order or if H is an elementary abelian subgroup of 
G, then A lies in the center of the normalizer of H in G. 


(a) Show that G need not be abelian. 
(b) Show that G must be nilpotent. 


Solution by Richard M. Stafford, Department of Defense, Fort George Meade, 
Maryland. For (a), the Suzuki 2-groups of type A(n, @) provide an infinite family of 
examples. Such a group can be realized as the group of upper triangular matrices 


of the form 
1 a b 
0 ] a® ’ 
0 O 1 


where a and b are members of the finite field GF(2”) and @ is a nontrivial field 
automorphism of odd order. 
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These groups are of exponent 4 with the property that all involutions lie in the 
center. Thus we need only check that H lies in the center of the normalizer of H 
for cyclic subgroups H of order 4. But this fact is an immediate consequence of 
Lemma 11.3 (p. 265) of B. Huppert and N. Blackburn’s Finite Groups III, 
Springer-Verlag, Berlin-Heidelberg-New York, 1982. . 

We now solve (b). For brevity we refer to groups satisfying the condition of the 
problem as Z-groups. The rest of our notation as well as all results we need for 
our solution may be found in D. Gorenstein’s Finite Groups, Harper and Row, 
New York, 1968. We shall refer to this reference as [GOR]. We now need a lemma. 


Lemma. Let G be a Z-group. Then the following hold: 


(i) Every subgroup of G is a Z-group. 
(ii) If G is a p-group, then ©.,(G) (the subgroup generated by the elements of order 
p) lies in the center of G. In particular 0.,(G) is elementary abelian. 


Proof. Part (i) is obvious. So assume G is a p-group and let E be a maximal 
elementary abelian p-subgroup of G. Since G is a Z-group, it suffices to show that 
E is a normal subgroup of G. Set N = N.(E) and assume by way of contradiction 
that N is a proper subgroup of G. Then N is a proper subgroup of N.(N) 
because G is a p-group (Theorem 1.2.11 (ii) of [GOR]). Now, by the maximality of 
E and the fact that E ¢ Z(N), it follows that E = 0,,(N). Thus E is a characteris- 
tic subgroup of N. But this implies N,(N) = N and we have a contradiction. 

We are now ready to prove that Z-groups are nilpotent. It suffices to prove G 
has a normal p-complement for each prime p which divides the order of G. By the 
Frobenius normal p-complement theorem (Theorem 7.4.5 of [GOR]) this is equiva- 
lent to showing that the factor group of N.(H) with respect to C.(H) is a 
p-group for every nonidentity p-subgroup H of G. In the remainder of the proof 
let N = N.(A) and C = C,(#) for a particular nonidentity p-subgroup H of G. 
Then, by our lemma and the fact that G and H are Z-groups, it follows that 
N € C,(Q,,(47)). Now each element of N — C induces by conjugation an automor- 
phism of H. The fact that N/C is a p-group follows quickly from the fact that any 
automorphism of H whose order is relatively prime to p must act nontrivially on 
©.,(1) provided p is odd (see Theorem 5.3.10 of [GOR] for details). So we assume 
p is 2. We now proceed by contradiction and assume H (our 2-subgroup of G) is of 
minimal order subject to the order of N/C being divisible by an odd prime. Let 
y&N-C be of odd order. It follows from the minimality of A that, if y 
normalizes a proper subgroup of H, then y centralizes that subgroup. The 
structure of H is fairly well determined by Theorem 5.3.7 of [GOR]. In particular 
H is either elementary. abelian or H is of class 2, Z(H) is elementary abelian and 
H/Z(#) elementary abelian. If H is elementary abelian, then y © C as G isa 
Z-group and we have a contradiction. So we may assume the latter case holds. In 
particular H has exponent 4 and y centralizes Z(H). The fact that Z(H) is 
elementary abelian together with our lemma implies 0.,(H) = Z(H). We are now 
ready to obtain a contradiction. Since y € N — C there exist distinct elements of 
order 4, u and v, with y~ ‘uy = v. Moreover since y has odd order it follows that u 
and v lie in distinct cosets of Z(H) in H. Thus wv has order 4 also. Now [u, v] 
commutes with u and vu because A is a class 2 group. Thus by Lemma 2.2.2 of 
[GOR], (uv)? = [v, uJu?v?. But u? and v? € 0,(H) because H is of exponent 4. 
Thus v? = (y~!uy)? = y~'u?y = u? and we have (wv)? =[v,u]. Since uv is of 
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order 4, (uv)! = (uv )(uv)* = uv[v, u] = vu. So u~ (uv )u = vu = (uv)~!. Thus u 
normalizes the cyclic group (uv) of order 4. Since uv has order 4, u does not 
centralize uv. This contradicts the fact that G is a Z-group. 


Editorial comment The condition N(H) = C(H) when H is a cyclic p-group 
having order greater than p is used only when p = 2 and |H|=4. Thus the 
conclusion of the problem remains valid if we assume N(H) = C(H) only when H 
is an elementary abelian subgroup or when H is a cyclic subgroup of order 4. 


A similar solution was provided by Gary L. Walls. The editors wish to thank Everett C. Dade and 
Stephen M. Gagola, Jr., for their help with this problem. 


REVIEWS 


EDITED BY JOSEPH KONHAUSER 
Department of Mathematics, Macalester College, St. Paul, MN 55105 


Lattice Points. By Paul Erdos, Peter M. Gruber and Joseph Hammer, Pitman 
Monographs and Surveys in Pure Mathematics 39, Longman Scientific and 
Technical, co-published with John Wiley and Sons, New York, 1989, vii + 
184 pp. 


WILLIAM FIREY 
Department of Mathematics, Oregon State University, Corvallis, OR 97331 


Pythagoras’ theorem and Euclid’s proof that there are infinitely many prime 
numbers are initial landmarks for geometry and elementary number theory. They 
show the great age of these parts of mathematics. So it is surprising that fruitful 
interactions between these disciplines came, almost all, within the last 100 years, 
more than two millenia after Euclid. To illustrate the interaction I choose first one 
of the earliest results of this sort: Minkowski’s theorem about lattice points in 
convex sets, published in 1893 as a paper read at the International Mathematical 
Congress at Chicago. It asserts that a compact convex set C in R¢% which is 
symmetric with respect to the origin o and has volume at least 2“ must contain a 
point, other than o, with integer coordinates. The idea of the proof is as simple 
and elegant as the theorem’s statement. Suppose C \ {o} has no points with 
integer coordinates. Form all integer translates of (5)C, that is sets of the form 


(Z)C+n= {x = (41,...,Xy)1 x = (S)y +n, y= (y,,--+> Ya) EC, 
n= (ny,...,Mq) © Z%. 


Use the convexity of C and the assertion that C \ {o} has no points of Z“ to prove 
that the sets (5)C + n are disjoint. Also these sets meet any unit cube, cornered at 
points of Z“%, in pieces of total volume equal to that of (5)C. This covered volume is 
(5)* times the volume of C; it is also less than the volume of the unit cube. Hence 
C has volume less than 2%. 

The theorem begins what Minkowski called the geometry of numbers. It has 
beautiful applications. Here is one. If A = [a;,] is a symmetric, positive definite 
d X d matrix, then, for a > 0, 


d d 
x: Q(x) = LE di a,x;x, <a 
i=1 j=l 


is an ellipsoidal solid, centered at 0, of volume a“/*x,/(det A)!/*, where x, is the 
volume of the Euclidean unit ball in R%. In particular, with a = 4(det A/x2)'/%, 
this ellipsoid has volume 2“. Minkowski’s theorem shows that 


min Q(x) < 4(det 4/x2)'”“ 
Z4\{o} 
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Also the same theorem gives information about solutions to Diophantine 
problems of the form: find n in Z@ such that, with (v,n) = D4 ,v,n; 


[(uy,N)| <7y,.-.,|(Ug,N)| < Te 
for given positive real numbers 7,,...,7, and vectors u,,...,u,4 which are inde- 
pendent, so that the determinant det[u,,..., u,] having these vectors as columns is 


not zero. Take C to be the convex, compact polyhedral solid of points +x which 
satisfy 


\<u,, x)| < Trosees |(uag, x)| < Tq; 
its volume is 2¢7, --+ 7,/|det[u, --: u,y]|. Minkowski’s theorem shows that there 
are solutions if 7, --- tz > |detlu, «+: ugl|. 


Notice how Minkowski’s theorem and its applications use interplays of quite 
simple notions from elementary number theory and geometry. To appreciate the 
power of this interplay the reader might attempt purely number-theoretic treat- 
ments of the first application with d = 2. Assistance can be found in reference [1, 
pp. 30-31]. 

Notice also the importance of the set Z%, the simplest geometric lattice in R%. 
More generally, a lattice in R@ is the set of all linear combinations with integer 
coefficients of vectors which form a basis for R*. The book under review surveys 
those topics in which geometric lattices play an important role, the geometry of 
numbers being but one such. 

The work is like a Baedeker’s travel guidebook: all the sights are listed, with 
extensive historical backgrounds for the important ones—in a few major cases 
there are detailed descriptions; excellent maps are provided. A typical chapter 
surveys a topic with a summary of its history, a discursive description of some 
major results, and closes with a nicely classified summary of recent advances. All 
three authors are noted experts in the areas covered; they are clear expositors and 
meticulous scholars; they have produced a fine book. 

The extent of the country covered is large. Here are located parts of two of 
Hilbert’s 23 problems presented to the 1900 International Congress of Mathemati- 
cians at Paris; problem 3 on the equidissectability of polyhedra and problem 18 in 
which Hilbert included the questions of densest packings of equal solids in space. 
Going out from Pick’s elegant formula for the area enclosed by a closed Jordan 
polygon with vertices in Z*, the authors lead us to the algebraic-geometric notion 
of a toric variety and its connection with new proofs of the Fenchel-Aleksandrov 
inequalities for convex compacta. Hilbert’s 18th problem is met again in the study 
of tiling problems and crystallographic groups. 

There is an excursion through positive quadratic forms, beginning with an 
equivalence notion: Q, Q' are equivalent precisely when Q’(x) = Q(Tx) for some 
d X d integer matrix T with determinant +1. This connects with lattices. Gener- 
ate a lattice L from a basis {b,,..., b,}; form the matrix B having these vectors as 
its columns; then 


O(x) =x"B"Bx 
is a positive quadratic form associated with L. Q determines L up to a 


rotation—possibly improper. A different basis for L produces another metric form 
O'; QO, O' will be equivalent. Thus the study of possible bases for L amounts to the 
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study of forms Q’ equivalent to Q. This excursion ends with a survey of locally 
most dense lattice packings of equal balls. For each lattice L there is a maximum 
number p(L) such that packed balls, centered at points of L, have disjoint 
interiors. As before, let B = BCL) signify the matrix associated with a basis for L. 
|det B(L)| is the volume of the parallelohedral solid or cell determined by the basis 
for L. The total volume of the pieces of packed balls falling within one cell equals 
the volume of a ball of radius p(L). Hence, 


r(L) = (p(L))*«/|det B(L))| 


measures the packing density. It is locally most dense just when r(L) > r(L’) for 
all lattices L’ near enough to L in a suitable sense. 

On the frontiers can be found power diagrams, zeta-functions for lattices, 
dimers, rook polynomials, Steiner trees, polyominoes—also known as n-celled 
animals—and other exotica. 

A good guidebook will lure the armchair traveler into an actual visit: this is a 
good guidebook. 
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General, S*, P*, L**. Islands of Truth: A 
Mathematical Mystery Cruise. Ivars Peterson. WH 
Freeman, 1990, xvii + 325 pp, $19.95. [ISBN: 0- 
7167-2113-9] “The pursuit of mathematical truth 
is a tricky business, full of mysteries and surprises.” 
This vivid, compelling, pleasure-filled guide reveals 
to the mathematical tourist many mysteries of mod- 
ern mathematics: knots, tilings, computers, fractals, 
dimensions, surfaces, chaos, decompositions, NP- 
problems, and much, much more. Based on eight 
years of reporting on mathematics for Science News, 
both the choice of and approach to topics provide am- 
ple evidence of the reporter’s craft at its very best. 
An excellent, innovative option for any mathematics 
course used for general education. LAS 


General, P*, L. Renewing U.S. Mathematics: A 
Plan for the 1990s. National Research Council. Na- 
tional Academy Pr, 1990, xiv + 133 pp, (P). [ISBN: 
0-309-04228-3] This is “David II,” the sequel to the 
1984 ground-breaking report from a committee of the 
National Academy of Sciences chaired by Edward E. 
David, Jr. This report recommends continued effort 
to achieve the goals of the 1984 document (“stay the 
course; see it through”) modified, however, by a spe- 
cial focus on human resource issues. Three primary 
recommendations address increasing federal funding 
for mathematical research to $250 million per year; 
improving career paths in the mathematical sciences; 
and increasing to 2600 the number of supported se- 
nior investigators. Special feature: a 45-page ap- 
pendix giving brief descriptions of 27 different areas 
of research accomplishments and related opportuni- 
ties. LAS 


Precalculus, T(13: 1, 2). Precalculus: Func- 
tions and Graphs. Linda Gilbert, Jimmie Gilbert. 
Addison-Wesley, 1990, xiv + 615 pp, $32.76. 
[ISBN: 0-201-51020-0] A pre-precalculus text cov- 
ering standard range of topics. Many routine prob- 
lems, plus a number of problems resembling calculus 
I min/max applications. Each chapter has 1-3 pages 
of “non-solutions” to problems, enabling students to 
see typical errors. FR 


Finite Mathematics, T(14-15:1). Applied Finite 


Mathematics, Third Edition. S.T. Tan. PWS/Kent, 
1990, xiv + 543 pp, $35. (ISBN: 0-534-92140-X] 
Changes from the Second Edition are relatively minor 
and technical. (First Edition, TR, June-July 1983; 
Second Edition, TR, June-July 1987.) JS 


Foundations, T(18), P, L. Lecture Notes in 
Mathematics-1407: Proof Theory, An Introduction. 
Wolfram Pohlers. Springer-Verlag, 1989, vi + 213 
pp, $20 (P). [ISBN: 0-387-51842-8] A self-contained 
introduction to pure number theory, to the axiomati- 
zation of the natural numbers (with careful consider- 
ation for consistency of the axioms), and to ordinal 
analysis, the computation of the least ordinal not 
provable in a formal theory. Follows Hilbert’s pro- 
gram which can -be roughly characterized as an at- 
tempt to axiomatize the whole of mathematics and 
to prove that the axioms are consistent. SB 


Number Theory, T*(16-18), S**, P**, L*. 
Algorithmic Algebraic Number Theory. M. Pohst, 
H. Zassenhaus. Encyclopedia of Math. & Its Ap- 
plic. Cambridge U Pr, 1989, xiv + 465 pp, $89.50. 
[ISBN: 0-521-33060-2] Introduction to constructive 
algebraic number theory and some associated algo- 
rithms, assuming a basic algebra background. Covers 
the determination of the Galois group of an equation 
in Q[z], the computation of the maximal order of the 
unit group of an algebraic number field, and the ideal 
class group. Appendix includes extensive numerical 
tables and a list of algorithms presented in the text. 
A nice companion text (with exercises) for a first 
course in algebraic number theory. Great reference 
for expert and non-expert alike. SB 


Number Theory, P. Lecture Notes in Mathemat- 
ics-1415: Cinquante Ans de Polynémes: Fifty Years 
of Polynomials. Eds: M. Langevin, M. Waldschmidt. 
Springer-Verlag, 1990, ix + 235 pp, $21.80 (P). 
[ISBN: 0-387-52190-9] Proceedings of a conference 
held in honor of Alain Durand (1949-86) at the Insti- 
tut Henri Poincaré in Paris, May 26-27, 1988. Eleven 
papers from the conference, plus four by Durand. All 
but one are in French. DFA 


Linear Algebra, T(16-17: 1), S, L. Lectures on 
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Linear Algebra. I.M. Gel’fand. Dover, 1989, 185 pp, 
$5.95 (P). [ISBN: 0-486-66082-6] Concise treatment 
of basic ideas of linear algebra at a relatively ad- 
vanced level. Not an introductory-level text. Chap- 
ters on n—dimensional spaces, linear and bilinear 
forms; linear transformations; canonical forms; in- 
troduction to tensors. LC 


Algebra, S(18), P. Clifford Theory for Group Rep- 
resentattons. Gregory Karpilovsky. Math. Stud., 
V. 156. North-Holland (US Distr: Elsevier Science), 
1989, x + 364 pp, $110.50. [ISBN: 0-444-87377-5] 
Intended to present state of developments originat- 
ing from Clifford’s work begun in 1937 to obtain ir- 
reducible representations for a finite group G from 
those for a normal subgroup N. Assumes a gradu- 
ate course in algebra. A survey chapter is followed 
by chapters on restriction, induction, and extension. 
Bibliography, index. JS 

Algebra, S(18), P. Unit Groups of Group Rings. 
Gregory Karpilovsky. Pitman Mono. & Surveys in 
Pure & Appl. Math., V. 47. Longman Scientific & 
Technical (US Distr: Wiley), 1989, xiii + 393 pp, 
$143. [ISBN: 0-582-02341-6] Assuming a graduate 
course in algebra as background, first three chapters 
discuss general group rings and the group of units. 
Subsequent chapters deal with unit groups for com- 
mutative groups, work of Roggenkamp and Scott on 
conjugacy, and metabelian integral group rings. Bib- 
liography, index. JS 


Calculus, T(13). Calculus, An Integrated Ap- 
proach, Donald B. Small, John M. Hosack. Intern. 
Ser. in Pure & Appl. Math. McGraw-Hill, 1990, xviii 
+ 685 pp, $34.95. [ISBN: 0-07-058264-5] Here is a 
calculus text that is truly new. It is a new book, not 
a new edition, from a new author, written in a new 
way. Familar in the MAA as a pioneer in computer 
algebra systems, Don Small has given us a lean book 
that should be examined by everyone who is sympa- 
thetic to the spirit of calculus reform. AWR 


Calculus, T*(13: 1, 2). Calculus for the Manage- 
rial, Life, and Social Sciences, Second Edition. S.T. 
Tan. PWS-Kent, 1990, xiv + 568 pp, $35. [ISBN: 
0-534-92145-0] For a one-semester or two-quarter 
introductory course. Intuitive; informal; ample ex- 
amples, exercises, problems (odd answers); numer- 
ous figures, nicely done and helpful. Partial solu- 
tions manual available to students; answer book, test 
book, computerized test bank available to adopters; 
graphics/computation software package accompanies 
book but was not seen by reviewer. Changes in this 
edition (First Edition, TR, May 1988) include pre- 
liminary chapter on precalculus; boxed definitions, 
methods and procedures; so-labelled calculator exer- 
cises; earlier introduction of integration. JDEK 


Real Analysis, T(15). Advanced Calculus, Second 
Edition. David V. Widder. Dover, 1989, xvi + 520 
pp, $10.95 (P). [ISBN: 0-486-66103-2] A classic text 
written when advanced calculus took up where cal- 
culus left off and pushed forward in areas felt to be 
important to applied mathematicians and engineers. 
It was never doubted, of course, that pure mathe- 
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maticians also needed to cover this same material, 
but their ultimate concerns (proofs, examination of 
deeper concepts) were relegated to the last part of 
each chapter. AWR 


Complex Analysis, S(18), P. Several Complez 
Variables I: Introduction to Complez Analysis. Ed: 
A.G. Vitushkin. Encyclop. of Math. Sci., V. 7. 
Springer-Verlag, 1990, 248 pp, $59. [ISBN: 0-387- 
17004-9] Five long survey articles, in expository 
style, on fundamental topics in the theory of func- 
tions of several complex variables: basic ideas, inte- 
gral representations, analytic varieties, holomorphic 
mapping, residue theory. With their extensive bib- 
liographies, the articles provide a wide overview of 
modern work in several complex variables. PZ 


Complex Analysis, S(18), P. Several Complez 
Vartables IV: Algebraic Aspects of Complez Anal- 
ysis. Eds: S.G. Gindikin, G.M. Khenkin. Ency- 
clop. of Math. Sci., V. 10. Springer-Verlag, 1990, 
251 pp, $59.50. [ISBN: 0-387-18174-1] Four long 
survey articles, in expository style, on algebraic top- 
ics in the theory of functions of several complex vari- 
ables: sheaves, vector bundles, deformations of com- 
plex spaces, and complex manifolds. With large bib- 
liographies; a valuable reference work for researchers 
in the field. PZ 

Differential Equations, T(17-18: 1), S, P. Dif- 
ferential Equations: Their Solution Using Symme- 
tries. Hans Stephani. Ed: Malcolm MacCallum. 
Cambridge University Pr, 1989, xii + 260 pp, $17.95 
(P); $44.50. [ISBN: 0-521-36689-5; 0-521-35531-1] 
A nice introduction to the theory and practice of 
finding and using symmetries to solve differential 
equations. Intuitive and practical rather than rig- 
orous, this book is aimed at scientists and applied 
mathematicians. One-half of the book deals with 
ordinary differential equations, the other half with 
partial differential equations. More than 100 exer- 
cises. SP 


Partial Differential Equations, T(16-17: 2), 
S, L. Partial Differential Equations: Analytical 
Solution Techniques. J. Kevorkian. Math. Ser. 
Wadsworth, 1990, xiv + 547 pp, $54.95. [ISBN: 0- 
534-12216-7] This book fills the vacuum between 
texts for a first encounter with partial differential 
equations and graduate texts on theory, and fills it 
very well. Solution techniques for linear and nonlin- 
ear equations, with discussions of physical motiva- 
tions, examples, and lots of problems. Exposition 
is clear, problems are interesting, and the author 
wisely left numerical methods to be covered in other 
texts. JO 


Partial Differential Equations, P. Current 
Progress in Hyperbolic Systems: Riemann Problems 
and Computations. Ed: W. Brent Lindquist. Con- 
temp. Math., V. 100. AMS, 1989, xi + 367 pp, 
$41 (P). [ISBN: 0-8218-5106-3] Proceedings of the 
AMS-SIAM-IMS conference held at Bowdoin College 
in July 1988. Twenty-five papers on gas dynamics, 
viscoelastic materials, phase transitions, and multi- 
phase flow in porous media. JO 
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Partial Differential Equations, P. Inverse Prob- 
lems in Partial Differential Equations. Eds: David 
Colton, Richard Ewing, William Rundell. SIAM, 
1990, 214 pp, $31.50 (P). [ISBN: 0-89871-252-1] 
Contains thirteen of the invited lectures at the con- 
ference on inverse scattering problems held at Hum- 
boldt State University during the summer of 1989. 
Concentrates on the themes of coefficient identifica- 
tion for parabolic equations, impedance tomography, 
and inverse scattering. SP 


Partial Differential Equations, P. Lecture Notes 
in Mathematics-1394: Nonlinear Semigroups, Partial 
Differential Equations and Attractors. Eds: T.L. 
Gill, W.W. Zachary. Springer-Verlag, 1989, ix + 
233 pp, $20 (P). [ISBN: 0-387-51594-1] Proceed- 
ings of the second symposium on this subject held 
at Howard University in Washington, DC in August 
1987. GN 

Numerical Analysis, T(18: 2). Finite Difference 
Schemes and Partial Differential Equations. John 
C. Strikwerda. Math. Ser. Wadsworth, 1989, xii + 
386 pp, $47.95. [ISBN: 0-534-09984-X] Intended for 
use as an introductory graduate text for students in 
applied mathematics or engineering. The necessary 
theory of partial differential equations is developed 
in the text, including the basic theory of well-posed 
initial value problems. Includes exercises at the end 
of each section. GN 

Functional Analysis, T(18: 1), S. Topics in 
Functional Analysts and Applications. S. Kesavan. 
Wiley, 1989, xii + 267 pp, (P). [ISBN: 81-224-0062-0] 
Paperback reprint of a 1989 hardcover (TR, October 
1989). Treats Sobolev spaces, elliptic boundary value 
problems, semigroups of operators, and some nonlin- 
ear analysis. Four appendices, references, index. JS 


Analysis, T(16-17: 1). Lectures on Fourier Series. 
L. Solymar. Oxford U Pr, 1988, x + 120 pp, $19.95 
(P); $45. [ISBN: 0-19-856199-7; 0-19-856198-9] A 
very readable introductory text. Begins with a dis- 
cussion of approximating functions, then continues 
with actually finding Fourier series and demonstrat- 
ing their properties. Text concludes with applica- 
tions to ordinary and partial differential equations. 
Includes many examples throughout, as well as exer- 
cises at the end of each chapter. GN 


Analysis, P. Analysis, et cetera. Eds: Paul H. Ra- 
binowitz, Eduard Zehnder. Academic Pr, 1990, xii 
+ 694 pp, $69.96. :[ISBN: 0-12-574249-5] The title 
is appropriate for a collection of papers selected for 
a volume that commemorates Jurgen Moser’s 60th 
birthday—the idea being to reflect his interests in 
differential equations, dynamical systems, and dif- 
ferential geometry. AWR 


Geometry, P. The Volume of Convez Bodies and 
Banach Space Geometry. Gilles Pisier. Tracts. in 
Math., V. 94. Cambridge University Pr, 1989, xv + 
250 pp, $49.50. (ISBN: 0-521-364655] These notes 
present recent results (many obtained during 1984- 
86) that relate the volume of convex bodies in R® and 
the geometry of the corresponding finite-dimensional 
normed spaces. They draw upon ideas from con- 
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vex sets, probability, entropy numbers, and the local 
theory (K-convexity, cotype, and type) of Banach 
spaces. Contains an extensive bibliography. AWR 


Geometry, S*, P**, L**. Beyond the Third 
Dimension: Geometry, Computer Graphics, and 
Higher Dimensions. Thomas F. Banchoff. Scien- 
tific American Library, 1990, ix + 210 pp. [ISBN: 
0-7167-5025-2] A portrait of dimensions, visually 
rich and intellectually enriching. Slices and contours, 
shadows and perspective, coordinates and configura- 
tion, all beautifully illustrated with full color plates 
and computer-generated drawings. Offers uncom- 
mon mathematical insights into art and architecture, 
science and society. A wonderful gift for mathemati- 
cians to give—to their family and friends, and even 
more, to their students: an open door to the hidden 
beauty of mathematics. LAS 


Optimization, P. Lecture Notes in Mathematics- 
1403: Combinatorial Optimization. Ed: B. Sime- 
one. Springer-Verlag, 1989, v + 312 pp, $28.60 
(P). (ISBN: 0-387-51797-9] Proceedings of the Cen- 
tro Internazionale Matematico Estivo (CIME) Inter- 
national Summer School on combinatorial optimiza- 
tion held at Como, Italy, August 25-September 2, 
1986. LC 


Optimization, P. Lecture Notes in Mathematics- 
1405: Optimization. Ed: S. Dolecki. Springer- 
Verlag, 1989, 220 pp, $20 (P). {ISBN: 0-387-51970-X] 
Conference reports related to the increasing power of 
high speed computers: new optimization techniques 
being brought into the realm of possibility, and the 
attendent questions of complexity. AWR 


Elementary Statistics, T(1, 2).  Understand- 
ing Statistics, Fifth Edition. Lyman Ott, William 
Mendenhall. PWS-Kent, 1990, xvii + 727 pp. [ISBN: 
0-534-92154-X] Text for general statistics for all ma- 
jors; little mathematics. Surveys graphical and 
numerical data description, probability (not much) 
and sampling distributions, inferences for means, 
variances, proportions, regression, analysis of vari- 
ance, count data. This edition adds chapters on 
surveys and reporting results and other additions. 
(First Edition, TR, March 1974; Second Edition, TR, 
November 1976.) TH 


Statistics, P. Lecture Notes in Statistics-59: A Cal- 
culus for Factorial Arrangements. Sudhir Gupta, 
Rahul Mukerjee. Springer-Verlag, 1989, vi + 126 pp, 
$16.20 (P). [ISBN: 0-387-97172-6] Survey of the de- 
velopments since the 1970's of the calculus for facto- 
rial arrangements. Includes an introductory chapter, 
derivations of characterizations for balance and or- 
thogonality in factorial designs, applications in con- 
struction. LC 


Statistics, T*(14-16), L. Statistics: Theory and 
Methods. Donald A. Berry, Bernard W. Lindgren. 
Brooks/Cole, 1990, xii + 763 pp. [ISBN: 0-534- 
09942-4] Calculus-based text covering usual topics 
in course for junior mathematics majors, beginning 
with probability. Uses many real examples for moti- 
vating and explaining. Flow is somewhat disjointed, 
due to examples and many topics. Worth consider- 


1990] 


ing. TH 


Statistics, P, L. Sampling. Eds: P.R. Krishnaiah, 
C.R. Rao. Handbook of Stat., V. 6. North-Holland 
(US Distr: Elsevier Science), 1988, xvi + 594 pp, 
$125. [ISBN: 0-444-70289-X] Twenty-four chapters 
by 38 authors. Nineteen chapters cover general top- 
ics (e.g., systematic sampling, ratio and regression 
estimators) and five cover specific applications (e.g., 
transect sampling to assess wild life populations). TH 


Statistics, T(15), L*. Sample Design in Business 
Research. W. Edwards Deming. Classics Library, 
Wiley, 1990, xx + 517 pp, $29.95 (P). [ISBN: 0- 
471-52370-4] Reprint of a 1960 text. Many stories; 
mathematics later in the book. Sections are “Stan- 
dards of Statistical Practice,” “Replicated Sampling 
Designs,” and “Theory.” TH 


Programming, T(13), C. BASIC Programming 
with the IBM PC, Second Edition. Peter Mears. Ser. 
in Comp. Sci. Brooks/Cole, 1990, xiii + 512 pp, $38 
(P). (ISBN: 0-534-12156-X] Introductory level Basic 
programming text assuming no prior computer expe- 
rience. Includes many clear examples and exercises, 
and a tutorial program. Emphasis is on program- 
ming skills rather than problem solving techniques 
and logic. FR 


Computer Systems. Advances in Software Science 
and Technology, Volume 1. Japan Soc. for Software 
Sci. & Tech. Academic Pr, 1990, ix + 218 pp, $49. 
[ISBN: 0-12-037101-4] First issue of a new mono- 
graph series produced by the Japan Society for Soft- 
ware Science and Technology (JSSST). It is planned 
on being an annual publication whose emphasis is on 
software research and development. Volume 1 con- 
tains nine research contributions in such areas as for- 
mal semantics, attribute grammars, and functional 
languages. Future issues will be addressing a wide 
range of software related topics. GMS 


Computer Systems, P, L*. The Matriz: Com- 
puter Networks and Conferencing Systems World- 
wide. John W. Quarterman. Digital Pr, 1990, 
xxvii + 719 pp, $49.95 (P). [ISBN: 1-55558-033-5] 
The bible—actually, the tower of Babel—of com- 
puter networks. Layers, protocols, administration, 
and standards, followed by descriptions of hundreds 
of different (non-commercial) networks around the 
world—organized by region and country, each with 
name, address, telephone, and network address of a 
contact person. Sequel to an influential 1986 survey 
in the Communications of the ACM. Soon an elec- 
tronic version of the book and its continuous updates 
will be available. LAS 


Applications, P. Selected Papers of S. Chan- 
drasekhar, Volume 4: Plasma Physics, Hydrody- 
namic and Hydromagnetic Stability, and Applica- 
tions of the Tensor-Virial Theorem. S. Chan- 
drasekhar. University of Chicago Pr, 1989, xiii + 
585 pp, $29.95 (P); $79.95. (ISBN: 0-226-10097-9; 0- 
226-10096-0] Thirty-five papers by author and co- 
authors from 1954 through 1971. In three parts: 
plasma physics, hydrodynamic and hydromagnetic 
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stability, and tensor virial theorem and its applica- 
tions. Volume four of six. JDEK 


Applications, P. Selected Papers of S. Chan- 
drasekhar, Volume 3: Stochastic, Statistical, and Hy- 
dromagnetic Problems in Physics and Astronomy. S. 
Chandrasekhar. University of Chicago Pr, 1989, xiv 
+ 642 pp, $34.50 (P); $91.95. [ISBN: 0-226-10095- 
2; 0-226-10094-4] Forty-six papers by author and 
co-authors from 1941 through 1958. In four parts: 
dynamical friction and Brownian motion, statistical 
problems in astronomy, statistical theory of hydro- 
dynamic and hydromagnetic turbulence, and hydro- 
magnetic problems in astrophysics. Volume three of 
six. JDEK 


Applications (Economics), P. Recursive Methods 
in Economic Dynamics. Nancy L. Stokey, Robert 
E. Lucas, Jr. Harvard U Pr, 1989, xviii + 588 
pp, $37.50. [ISBN: 0-674-75096-9] The idea of the 
authors is to develop the substantial mathematical 
tools of recursive methods: metric space concepts 
for optimization, measure theory and Markov the- 
ory for probabilistic techniques, fixed point theorems 
for the study of stability. After presentation of the 
mathematical tools, applications are taken up. This 
is a high-powered book that comes with some high- 
powered endorsements that will be tough to live up 
to. Only time will tell if it achieves the stature of 
Hick’s Value and Capital, but it does break some new 
ground in its full development of recursive methods 
as a unifying thread. AWR 


Applications (Engineering), P. Computational 
Aspects of VLSI Design with an Emphasis on Semi- 
conductor Device Simulation. Ed: Randolph E. 
Bank. Lect. in Appl. Math., V. 25. AMS, 1990, 
viii + 190 pp, $53. [ISBN: 0-8218-1132-0] Mono- 
graph contains reprints of the twelve invited papers 
presented at the 18th SIAM Summer Seminar on 
Applied Mathematics. The papers present advanced 
research on the use of numerical simulation for the 
design of VLSI chips, allowing engineers to test and 
evaluate chip design before going into the more costly 
and time-consuming prototyping phase. This short- 
ens the overall design cycle considerably. GMS 


Applications (Engineering), P. Lecture Notes in 
Engineering-52: Slope Analysis Using Boundary El- 
ements. Y.S. Jiang. Springer-Verlag, 1989, iv + 176 
pp, $27 (P). (ISBN: 0-387-51625-5] Slope stability 
is a classical topic in soil and rock mechanics. These 
notes study slope failure due to slip failure: a mass 
of soil or rock sliding over a surface. The boundary 
element method is studied as a tool in slope failure 
analysis. LC 


Reviewers 


DFA: David F. Appleyard, Carleton; SB: Steve Benson, 
St. Olaf; LC: Laura Chihara, St. Olaf; TH: Timothy Hester- 
berg, St. Olaf; JDEK: Joseph D.E. Konhauser, Macalester; 
GN: Gail Nelson, Carleton; JO: Jeff Ondich, St. Olaf; 
SP: Samuel Patterson, Carleton; FR: Flauren Ricketts, 
Macalester; AWR: A. Wayne Roberts, Macalester; GMS: G. 
Michael Schneider, Macalester; JS: John Schue, Macalester; 
LAS: Lynn Arthur Steen, St. Olaf; PZ: Paul Zorn, St. Olaf. 
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This book is not just for numerical an- 
alysts, it is accessible to anyone with 
some mathematical foundation who 
wants to taste selections from a nu- 
merical analysis menu. The papers do 
a good job in motivating and introduc- 
ing their subjects, and some provide 
a nice historical background. | recom- 
mend the articles in this book for any- 
one interested in learning more about 
the current research interests in numer- 
ical analysis. They may excite their 
readers to investigate further this rel- 
evant field of mathematics. 
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STUDIES IN NUMERICAL ANALYSIS 
Gene H. Golub, Editor 


Golub set high standards in the lead arti- 
cle on numerical solution of polynomials by 
J.H. Wilkinson which won the Chauvenet 
Prize as the best piece of mathematical ex- 
position to appear in 1985. These papers 
describe the wide range of research in nu- 
merical analysis. A variety of problems and 
of computational tools is treated. Some-of 
the computations require only a hand-held 
calculator, while others require a powerful 
computer. 

422 pp., 1985, ISBN 0-88385-126-1 

List: $46.50 MAA Member: $34.50 


Catalog Number MAS-24 


The Mathematical Association of America 


1529 Eighteenth Street, N. W. 
Washington, D. C. 20036 
(202) 387-5200 


Cambridge University Press 


Volterra Integral and Functional 
Equations | 

G. Gripenberg, S.O. Londen, and 

O. Staffans 


Convincingly demonstrates that the theory of 
Volterra equations exhibits a rich variety of 
features not present in the theory of ordinary 
differential equations. 


Encyclopedia of Mathematics and its Applications 34 
1990 701 pp. 37289-5 Hardcover $99.50 


The Mathematics of Projectiles 
in Sport 

Neville de Mestre 

A comprehensive and rigorous account of the 
techniques applied to the motion of projectiles. 
The final section of the book is an enlightening 
collection of sporting applications, ranging from 
the high jump to frisbees and soccer to table 
tennis. 


Australian Mathematical Society Lecture Series 6 
1990 174 pp. 39857-6 Paper $22.95 


Introduction to Lattices 

and Order 

B.A. Davey and H.A. Priestley 
Addresses the importance of ordered structures 
such as sets and lattices. Recognizes the in- 
creased importance of ordered structures in 
theoretical computer science and all areas of 
discrete mathematics. 


1990 150 pp. 36584-8 Hardcover $49.50 
36766-2 Paper $18.95 


Boojums All the Way Through 
Communicating Science in a Prosaic Age 
N. David Mermin 
This collection of nontechnical writings in and 
about phyics deals in a variety of ways with the 
process of communicating modern physics to both 
physicists and nonphysicists. With wry humor and 
a style for liveliness, Mermin demonstrates his 
ability to convey complex ideas with startling 
simplicity. 
1990 308 pp. 38231-9 Hardcover $49.50 

38880-5 Paper $16.95 


Basic Hypergeomeftric Series 
George Gasper and Mizan Rahman 
An authoritative account of basic hypergeometric 
series and their applications. Provides explicit and 
detailed information about summation, transfor- 
mation and expansion formulas, and contour 
integrals. 


Encyclopedia of Mathematics and its Applications 34 
1989 350 pp. 35049-2 Hardcover $59.50 


Mathematics and Cognition 

A Research Synthesis by the Inter- 
national Group for the Psychology 

of Mathematics Education 

Pearla Nesher and Jeremy Kilpatrick 
A distinguished collection describing research 
findings that shed light on the learning of mathe- 
matics from early arithmetic to high levels of 
algebra and geometry. 

International Commission on Mathematical Instruction 
Study Series 


1990 180 pp. 36608-9 Hardcover $39.50 
36787-5 Paper $14.95 


Continuous and Discrete Modules 
Saad M. Mohamed and Bruno J. Muller 
A complete and up-to-date account of the subject 
of continuous and discrete modules that at the 
same time gives a unified picture of the theory. 


London Mathematical Society Lecture Note Series 147 
1990 126 pp. 39975-0 Paper $22.95 


Lectures on Minimal Surfaces 
Volume 1 

Johannes C.C. Nitsche 

A broad survey of parametric minimal surface in 
Euclidean space that extends from the classical 


beginnings to the current situation. 
1989 563 pp. 24427-7 Hardcover $135.00 


Representations of Nilpotent Lie 
Groups and Their Applications 
Part | 

Frederick P. Greenleaf and 

Laurence Corwin 

A survey of harmonic theory developed since the 
pioneering work of Kirillov in 1958. Provides not 
only a modern account of all topics necessary for 
current research, but also includes many com- 
puted examples. 


Cambridge Studies in Advanced Mathematics 
1990 269 pp. 36034-X Hardcover $69.50 


Now in paperback... 


Winner of the Outstanding 
Academic Books/Nonprint Materials 
1988-89 Award 


— Choice Magazine 


Fourier Analysis 
T.W. Korner 


“...an extraordinary and very attractive book... 
| would like to see the book on the desk of every 
pure mathematician with an interest in analysis, 
and of every teacher of applied mathematics 
whose work involves analysis.” — SIAM Review 
1988 591 pp. 38991-7 Paper $34.50 


At bookstores or from 


CAMBRIDGE UNIVERSITY PRESS 
40 West 20th Street, New York, NY 10011. Call toll-free 800-872-7423 outside NY State. 800-227-0247, NY State only. 
MasterCard/VISA accepted. Prices subject to change. 


GEOMETRY & TOPOLOGY 


FIRST CONCEPTS OF TOPOLOGY: 
The Geometry of Mappings of Seg- 
ments, Curves, Circles, and Disks 


W.G. Chinn and N.E. Steenrod 


This clear and winning little book, for 
readers willing to come to genuine 
grips with the idea of a mathematical 
proof, presents topology . . . as math- 
ematicians see it. The parlor tricks are 
gracefully alluded to here and there, 
but they are distinctly for after hours. 
The center of interest is the stuff itself: 
powerful notions of set theory are... 
exploited to define open sets and their 
coverings, and from them [to prove] 
the key theorems. .. One cannot any 
longer doubt that a single stroke of a 
knife exists that divides any irregular 
ham sandwich so that ham and both 
bread slices can be shared with perfect 
fairness by two consumers. 


Philip Morrison in Scientific American 
Contains over one hundred and fifty prob- 
lems and solutions. 

160 pp., 1966, ISBN 0-88385-618-2 

List: $10.00 MAA Member: $8.00 

Catalog Number NML-18 


FROM PYTHAGORAS TO EINSTEIN 
K.O. Friedrichs 


Starting with area and dissection proofs of 
the Pythagorean theorem, Friedrichs pro- 
ceeds gently, step by step, to cover vectors, 
coordinates, elastic and inelastic impacts, 
and relativistic space-time—ending with a 
derivation of a contemporary formula rival- 
ing the Pythagorean theorem in fame, & = 
mec. 


88 pp., 1965, ISBN 0-88385-616-6 
List: $9.50 MAA Member: $7.60 
Catalog Number NML-16 


GEOMETRY REVISITED 
H.S.M. Coxeter and S.L. Greitzer 


After seeing a proof that a segment has 
a unique midpoint, students may wonder 
whether there are any interesting theorems 
in geometry. If either they or their teachers 
get a hold of this book, they will learn that 
there are many beautiful and nontrivial theo- 
rems in geometry. Among those found here 
are the theorems of Ceva, Menelaus, Pap- 
pus, Desargues, Pascal, and Brianchon. A 
nice proof is given of Morley’s remarkable 
theorem on angle trisectors. The transfor- 
mational point of view is emphasized: re- 
flections, rotations, translations, similarities, 
inversions, and affine and projective trans- 
formations. Many fascinating properties of 
circles, triangles, quadrilaterals, and conics 
are developed. 


193 pp., 1967, ISBN 0-88385-619-0 
List: $11.75 MAA Member: $9.50 
Catalog Number NML-19 


The very lucid presentation takes the 
reader from the elementary problems 
of plane Euclidean geometry to the fun- 
damental concepts of non-Euclidean 
geometry . The book Is rich in 
remarkable facts and thereby is very 
effective in promoting the significance 
and the value of geometry in mathe- 
matical teaching, a promotion which is 
very necessary . . . The always origi- 
nal developments use very simple tools 

. and soon proceed to higher con- 
figurations. 


K. Strubecker in Mathematical Reviews 


Order from: 


The Mathematical 
Association of America 

1529 Eighteenth Street, N.W. 
Washington, D.C. 20036 
(202) 387-5200 


ee ee ee 


When was the last 
time a computer 
program helped 
you think about 
mathematics? 


Call or write for our latest software catalog. 
For PC or Macintosh. 


Lascaux Graphics 3220 Steuben Ave. Bronx, NY 10467 (212) 654-7429 


Several Complex Variables 


Proceedings of the Mittag-Leffler Institute, 1987-1988 
Edited by John E. Fornaess 


There was a special year devoted to the topic of several complex variables at the 
Mittag-Leffler Institute in Stockholm, Sweden, and this volume contains the resulting 
survey papers and research papers. The work covers a broad spectrum of recent 
developments in the field. 

The contributors to the volume are H. Alexander; F. Almgren; E. Almar; M. Andersson; 
E. Bedford; J. Belanger; S. Bell; B. Berndtsson; U. Cegrell; C.H. Chang and H.P. Lee; 

J.P. Demailley; P. Dolbeault; A. Dor; F. Forstneric; B. Gaveau, M. Okada, and T. Okada; 

R. Greene and S. Krantz; A. Iordan, C. Laurent-Thiebaut and J. Leiterer; L. Lempert; I. Lieb 
and M. Range’ L. Qi-King; P. Manne; A. Noell; M. Passare; J. Riihentaus; J.P. Rosay and 
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INTERNATIONAL MATHEMATICAL OLYMPIADS 


The International Mathematical Olympiads, 
1959-1977 and 1978-1985 provide a com- 
pilation of problems and solutions of arrest- 
ing ingenuity; all accessible to secondary 
school students. The alternative solutions 
are particularly interesting because they 
show that there are many ways to solve a 
problem. 


INTERNATIONAL MATHEMATICAL 
OLYMPIADS, 1959-1977 


Compilation and solutions by S.L. Greitzer 


116 problems 

204 pp., 1978, ISBN 0-88385-627-1 
List: $11.50 MAA Member: $9.50 
Catalog Number NML-27 


INTERNATIONAL 
MATHEMATICAL 
OLYMPIADS 1959.1¢ 


Order from: 

The Mathematical Association of America 
1529 Eighteenth Street, N. W. 
Washington, D. C. 20036 

(202) 387-5200 


INTERNATIONAL MATHEMATICAL 
OLYMPIADS , AND FORTY 
SUPPLEMENTARY PROBLEMS, 
1978-1985 


Compilation, solutions, and 40 additional 
problems by Murray S. Klamkin 


88 problems in all 

150 pp., 1986, ISBN 0-88385-631-X 
List: $12.95 MAA Member: $10.50 
Catalog Number NML-31 


This sequel of problems used in the In- 
ternational Mathematical Olympiad cer- 
tainly matches the description ’meant 
to challenge.’ The problems contained 
in this publication are the most chal- 
lenging | have seen. For the seri- 
ous problem solver, this is an excellent 
collection of truly challenging problems 
designed to give instruction in the use 
of sophisticated methods and modes of 
attack. 


Richard J. Wylie in The Mathematics Teacher 


Both international Olympiad books avail- 
able as a package: 


List: $21.00 MAA Member: $17.50 
Catalog Number IMO 
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Now there's a better way to teach 
algebra and calculus in the clags- 
room. In fact, two ways. 


First, introduce your students to 
the HP-28S,. It’s the only calcula- 
tor that offers symbolic algebra 
and calculus, 


of teaching by raising 


your overhead. 


Then, introduce yourself to the 
overhead display for the HP-28S. 
It allows you to project your 
calculations on an HP-28S for 
everyone in the classroom to see. 


A scholastic offer for you. 


If your department or students 
purchase a total of 30 HP-28S 
calculators, we'll give you a class- 
room overhead display for the 
HP-28S absolutely free. (A $500 
retail value.) Plus, your very own 


HP-288 calculator free. (A $235 
retail value.) 


To learn more, and get free cur- 
riculum materials, call (503) 
757-2004 between 8am and 3 pm, 
PT. Offer ends October 31, 1989, 


There is a better way. 
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Calculator Support, Hewlett-Packard, 1000 NE Circle Bivd., Corvallis, OR 97330 


USA MATHEMATICAL OLYMPIADS 


Every year 100 of the most mathemati- 
cally talented high school students in the 
country compete in the USA Mathemati- 
cal Olympiad (USAMO). The USAMO is 
the third stage of a three-tiered mathe- 
matical competition for high school stu- 
dents in the United States and Canada 
that begins with the AHSME taken by over 
400,000 students, continues with the Amer- 
ican Invitational Mathematics Exam involv- 
ing 2,000 students, and culminates in the 
100-contestant USAMO. 


USA MATHEMATICAL OLYMPIADS 
1972-1986, PROBLEMS AND 
SOLUTIONS 


Compiled by Murray S. Klamkin 


People delight in working on problems “be- 
cause they are there, ” for the sheer plea- 
sure of meeting a challenge. This is a 
book full of such delights. In it, Murray 
S. Klamkin brings together 75 original USA 
Mathematical Olympiad (USAMO) problems 
for years 1972-1986, with many improve- 
ments, extensions, finger exercises, open 
problems, references and solutions, often 
showing alternative approaches. The prob- 
lems are coded by subject and solutions 
are arranged by subject as an aid to those 
interested in a particular field. Contains 
a glossary of frequently used terms and 
theorems, and a comprehensive bibliogra- 
phy with items numbered and referred to 
in brackets in the text. The problems are 
intriguing and the solutions elegant and in- 
formative. Students and teachers will en- 
joy working these challenging problems. In- 
deed all those those who are mathemat- 
ically inclined will find many delights and 
pleasant challenges in this book. 


180 pp., 1988, ISBN-0-88385-634-4 
List: $13.50 MAA Member: $12.50 
Catalog Number NML-33 
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NEW FOR 1991 


Being first is almost second nature at McGraw-Hill, Whether your students are enrolled in developmental 
math, precalculus, or calculus, McGraw-Hill's texts will inspire your students to learn. 


DISCOVERING CALCULUS WITH THE HP-28s AND 


THE HP-48sx 
Smith/Minton 

a OMETRY SKILLS WITH DISCRETE MATH AND ITS APPLICATIONS, 2/e 
Streeter /Hutchison/Hoelzle Rosen 
BEGINNING ALGEBRA, Form B PPLE ANS ee 
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INTERMEDIATE ALGEBRA, Form B 

INTRODUCTION TO PARTIAL DIFFERENTIAL 
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COLLEGE ALGEBRA WITH TRIGONOMETRY Pinsky 
Rees/Sparks/Rees FUNCTIONAL ANALYSIS, 2/e 
PRECALCULUS: A Bridge to Calculus Rudin 
Runyan/Runyan MATRIX THEORY 
EXPLORATIONS IN CALCULUS WITH A Goldberg 
COMPUTER ALGEBRA SYSTEM BRIDGE TO ABSTRACT MATHEMATICS, 2/e 
Small /Hosack Morash 


For more infermation, please contact yeur local McGraw-Hill 


representative or write: McGraw-Hill College Division, Comp Processing 
and Control, P.O. Box 448, Hightstown, NJ 08520. 


New... 


Writing Mathematics Well, by Leonard Gillman 


64 pp., 


Catalog Number - WMW 


1987, ISBN-0-88385-443-0 


Good writing conveys more than the 
author originally had in mind, while poor 
writing conveys less. Well-written papers 
are more quickly accepted and put into 
print and more widely read and appreci- 
ated than poorly written ones—and for 
notes, monographs, and books the qual- 
ity of writing is more important than it is 
for papers. 

In Writing Mathematics Well, Leo- 
nard Gillman tells his readers how to 
develop a clear and effective style. All 
aspects of mathematical writing are cov- 
ered, from general organization and 
choice of title, to the presentation of 
results, to fine points on using words and 
symbols, to revision, and finally, to the 
mechanics of putting your manuscript into 
print. No book can by itself make you a 
better writer, but this one will alert you to 
the opportunities for better and more 


List: $6.00 MAA Member: $4.50 


forceful writing. It does this both by pre- 
cept and by example. 

A book to be read for its sharpness 
and wit as well as for enlightenment, Writ- 
ing Mathematics Well should be on the 
shelf of anyone who writes or intends to 
write mathematics. It will amuse and 
delight the already careful writer and it will 
help reform and refine the sensibilities of 
those who may be somewhat careless 
about their writing. 


Order from 

The Mathematical Association 
of America 

1529 Eighteenth St. NW 
Washington, D.C. 20036 
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Carus Mathematical Monograph No. 20 


THE GENERALIZED 
RIEMANN INTEGRAL 


Robert M. McLeod 
275 + xiii pages. Hardbound. 


List: $24.95 MAA Member: $22.00 


Vues 


Until recently the most powerful and beautiful tools of integration theory have been 
accessible only to the privileged few whose studies extended through the Lebesgue 
integral. Now a new integral, a generalization of the familiar Riemann integral, has been 
discovered which has all the power and range of the Lebesgue integral but which can 
be readily understood by anyone who has studied calculus through the sophomore level. 

The Generalized Riemann Integral is the first book-length presentation of this 
exciting new development in integration theory. Because of the clarity and organiza- 
tion of the exposition and the inclusion of exercises designed to actively engage the 
reader in the material, it is eminently suitable for use as a textbook at the advanced 
undergraduate or beginning graduate level. Furthermore, because it presents within 
a single volume results which were previously scattered throughout many research 
publications, it will undoubtedly also be of considerable interest to specialists in inte- 
gration theory. 
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List of Contents: Definition of the Generalized Riemann Integral ¢ Basic Properties of the Inte- 
gral ¢ Absolute Integrability and Convergence Theorems ¢ Integration of Subsets of Intervals ¢ 
Measurable Functions ¢ Multiple and Iterated Integrals ¢ Integrals of Stieltjes Type « Compari- 
son of Integrals « Appendix: Solutions of In-Text Exercises 
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When you asked for a 
sensible, no-frills, 
mainstream calculus 
textbook at a reduced 
cost for your students, 
we heard you. 


CALCULUS 
One and Several Variables 


ROBERT ELLIS and DENNY GULICK 
both of University of Maryland 
Hardcover, 1991, Just Published! 


SINGLE-VARIABLE CALCULUS 
WITH DISCRETE MATHEMATICS 
JOHN A. FEROE and 

CHARLES STEINHORN 

both of Vassar College 

Hardcover, 1991 


ELEMENTARY LINEAR 
ALGEBRA WITH APPLICATIONS 
Second Edition 

RICHARD O. HILL, JR. 


Michigan State University 
Hardcover, 1991 
Available in November for spring classes! 


ABSTRACT ALGEBRA 


DENNIS KLETZING, Stetson University 
Hardcover, 1991 


FINITE MATHEMATICS 


WILLIAM OWEN and FRED CUTLIP 
both of Central Washington University 
Hardcover, 1991 
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CALCULATOR ENHANCEMENT 
FOR SINGLE-VARIABLE 
CALCULUS 


JOHN KENNELLY and 
JAMES H. NICHOLSON 
both of Clemson University 
1991, Just Published! 


CALCULATOR ENHANCEMENT 


‘FOR MULTIVARIABLE 


CALCULUS 


JAMES A. RENEKE, Clemson University 
1991, Just Published! 


CALCULATOR ENHANCEMENT 
FOR DIFFERENTIAL EQUATIONS 
THOMAS G. PROCTOR 


Clemson University 


1991, Just Published! 


CALCULATOR ENHANCEMENT 
FOR LINEAR ALGEBRA 


DON LA TORRE, Clemson University 
1991, Just Published! 


CALCULATOR ENHANCEMENT 
FOR INTRODUCTORY 
STATISTICS 


IRIS FETTA, Clemson University 
1991, Just Published! 


HARCOURT BRACE JOVANOVICH, INC. 


College Sales Office 
7555 Caldwell Avenue, Chicago, IL 60648 
(708) 647-8822 
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Springer for Mathematics 
Celebrating the 75th Anniversary of the MAA 


G.A. Edgar, The Ohio State University, 
Columbus, OH 


Measure, Topology, and 
Fractal Geometry 


Intended to provide the mathematics necessary for 
the study of fractal geometry. It includes back- 
ground material on metric topology and measure 
theory. It covers topological dimension and fractal 
dimension (in particular Hausdorff dimension). It 
contains a complete discussion of self-similarity, 
and the more general “graph self-similarity.” 
1990/248 pp., 96 illus./Hardcover/$29.95 

ISBN 0-387-97272-2 

Undergraduate Texts in Mathematics 


J. Stillwell, Monash University, Victoria, 
Australia 


Mathematics and Its History 


This concise, unified view of mathematics and its 
historical development identifies reasons for the 
emergence of the main fields of modern mathe- 
matics, and explains connections between them by 
tracing the course of a few mathematical themes 
from ancient times through the 20th century. 
Students can expect to add to their mathematical 
knowledge as well as gain a new perspective on 
what they already know, and those wishing to 
explore the classics for themselves will benefit 
from Stillwell’s use of modern terms to express his 
interpretation of classical mathematics. 

1989/371 pp., 163 illus./Hardcover/$49.80 
ISBN 0-387-96981-0 

Undergraduate Texts in Mathematics 


To Order: 

Call Toll-Free: 1-800-SPRINGER (in NJ call 201- 
348-4033), or send payment plus $2.50 for postage 
and handling to: Springer-Verlag New York, Inc., 
Attn: S. Klamkin - Dept. S563, 175 Fifth Avenue, 
New York, NY 10010. NY, NJ and CA residents 
please include state sales tax. 


Call or write for information on textbook 
examination copies! 
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S. Lang, Yale University, New Haven, CT 
Undergraduate Algebra 


Second Edition 


Written in the author’s clear style, this new second 
edition includes some additional topics, such as 
Sylow groups, Mason’s theorem on polynomials 
and the analogous abc conjecture over the integers, 
and expands on other topics, including symmetric 
polynomials and field theory. Many new exercises 
and concrete examples of major unsolved problems 
in algebra, written in a language accessible to 
undergraduates, have been added to illuminate the 
vitality of mathematics. 

1990/app. 368 pp., 3 illus./Hardcover/$39.80 
ISBN 0-387-97279-X 

Undergraduate Texts in Mathematics 


K.L. Chung, Stanford University, Stanford, CA 
Elementary Probability Theory 


with Stochastic Processes 
Third Edition, 3rd Printing, 1990 


Here is an introductory text to probability and 
some of its applications. The content is adequate 
for a self-contained elementary course for 
undergraduates or aS a prerequisite to more ad- 
vanced courses on the subject and allied fields such 
as Statistics, operations research, and engineering. 
1979/325 pp., 36 illus./Hardcover/$39.95 

ISBN 0-387-90362-3 

Undergraduate Texts in Mathematics 


M.A. Armstrong, University of Durham, 
England 


Basic Topology 
3rd Printing, 1990 


From the reviews: 
‘The book is very good, its material sensibly cho- 
sen for an undergraduate course, and with a clear 
exposition. It has good and plentiful illustrations - 
for the author, topology is above all a geometrical 
subject - and many exercises.” 

— Mathematical Gazette 
1983/250 pp./Hardcover/$34.00 
ISBN 0-387-90839-0 
Undergraduate Texts in Mathematics 
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Special Geometry Issue 


The American Mathematical Monthly is pleased to present, in this issue, a 
tribute to geometry, one of the oldest-newest parts of our subject, in the form of 
six articles by some of its leading practitioners. 

The poem below, which was part of Professor Chern’s article “What is 
geometry?,” is displayed here because it beautifully sets the tone of the issue. 


—Herbert S. Wilf, for the editors 


A Poem 


Physics and geometry are one family 

Together and holding hands they roam to the 
limits of outer space. 

Black hole and monopole exhaust the secret of myths; 

Fiber and connection weave to interlace 
the roseate clouds. 

Evolution equations describe solitons; 

Dual curvature defines instantons. 

Surprisingly, Math. has earned its rightful place 

for man and in sky; 

Fondling flowers with a smile—just wish 

nothing is said! 


(translated by Harry Wang, from the original on p. 686.) 
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Convexity 


MARCEL BERGER, IHES, Bures-sur-Yvette, France 


MarceL BerGeEr: Born Paris, France, 1927. After Ph.D. on “Holonomy 
groups of Riemannian manifolds” under A. Lichnerowicz, was professor at 
the universities of Strasbourg, Nice and Paris 1955-1974. From 1974-1985 
Director of Research with the CNRS (Centre National de la Recherche 
Scientifique). Since 1985, Director of the IHES (Institut des Hautes Etudes 
Scientifiques). Visited a full year MIT 1956-57 and University of California, 
Berkeley 1961-62. President of the French Mathematical Society 1979-81. 
Corresponding member of the French Academy of Sciences since 1982. 
Rademacher lecturer (Univ. of Pennsylvania) 1981. Editor and managing 
editor of various mathematical journals, managing editor of the yellow 
Springer Grundlehren des Mathematischen Wissenschaften. Published around 
45 papers on Riemannian geometry and three books (all with Springer): 
Geometry I and IT, (with Gostiaux) Differential Geometry: Manifolds, Curves 
and Surfaces, and (with Berry, Pansu and St. Raymond) Problems in 
Geometry. 


Convexity, as we shall see, is a very old topic which can be traced at very least to 
Archimedes. It has more or less always been in favor, and now it is making a very 
strong comeback. This can be attributed in part to the rise of linear programming 
and the computer era starting from the ’60s. But the geometric method in analysis 
has come up with wonderful results including some spectacular ones on convex 
bodies. 

At the same time, convexity is an extremely simple and natural notion. So we 
think the reader will readily appreciate what follows. Interesting in itself, it will 
also illustrate some facts about mathematics, facts that are more or less classical, 
but always important to realize, paradoxical though they may be. First, questions 
or problems arise that are very simple to formulate (as in number theory) but to 
which the answers are either still unknown or have been found only very recently, 
often using very hard techniques from other parts of mathematics. The second fact 
is that for elementary geometric problems formulated in our ordinary 2 or 
3-dimensional space, one is forced to use abstraction and, among other things, to 
“so to the infinite” or to higher dimensional spaces. Finally, intuition is sometimes 
quite misleading. 

Because of the restricted length of this article, I cannot be exhaustive. Indeed I 
had to select only a few topics. Selection was based on naturalness and simplicity, 
my own taste, and illustration of the facts I have just mentioned. Standard topics 
which really could not be included receive a passing mention in the last section. 
The material has been organized as follows. 


Convexity is a natural notion; historical examples 

Rigorous definitions; examples 

The John-Loewner ellipsoid; applications 

Convex functions; examples and applications 

. Polytopes: four “elementary” problems 

. Two algebraic operations on the set of all convex bodies: duality and addition 


ANRYWNE 
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7. Topology in the set of all convex bodies: intuition is dangerous 
8. A brief look at some topics in convexity 


To prepare the present text I received help from, among others, Peter Gruber 
and Vitali Milman. It is a pleasure for me to thank them here. 


1. Convexity is a natural notion; historical examples 
In art, the words concave and convex are commonly used, as shown in the 
following comment on a modern sculpture: 


This sculpture reflects the influence of Cubism in its optical juxtaposition of 
interchangeable concave and convex forms, and in its use of a void to express 
a mass (from Treasures of the Israel Museum, Jerusalem, Geneva, 1985). 


eG 
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Woman Combing Hair (Bronze)— Alexander Archipenko (1887-1964) 
(Courtesy the Israel Museum, Jerusalem) 


The same applies to anatomy textbooks: 


Menisci or half-moon fibro-cartilages. Arranged in this manner, the glenoid 
cavities do not adapt to the femur condyles. They fit together by means of meniscl 
or half-moon fibro-cartilages placed between the tibia and the femur. 
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Fibro-cartilages, like glenoid cavities, are external or internal. Each is a 
triangular prismatic blade curved into a crescent. 

They are seen to possess: a concave upper face, connected with the femur 
condyles; an underside, in contact with the periphery of the corresponding glenoid 
cavity; an outer or peripheral face (the base of the prism), which is convex, very 
thick, adhering to the articular capsule; an inner or central edge, which is 
concave, trenchant, and whose concavity faces the centre of the glenoid cavity. 
... (translation from a classic French book on anatomy by Rouviére). 


Note that convexity and concavity appear to be such commonsense notions that 
neither is defined in texts on art or anatomy! 

Archimedes (circa 250 B.c.) explicitly stated that the inner curve of the figure 
below is shorter than the outer one, if the inner one is convex. This is obviously 
false if it is not. 


Poinsot (circa 1800) looked into convexity when he studied statics, stating, for 
example, that to ensure stability for the table below one needs the vertical line 
through the center of gravity to intersect the supporting plane inside the convex 
envelope (see section 2) of the set made up by the legs of the table. 


Fourier also studied statics around the same time and was led to study 
simultaneous linear inequalities like these: 


useless 


He was smart enough to realize the need to determine which ones were really 
relevant (see section 5). This was the origin of linear programming which began to 
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develop in the ’60s. It consists for example in asking: 
Find the maximum of y = x, — x, + 2x, if 


X, + X_ + 3x, +%, <5 
X, +X, —- 4x, <2 
x, => 0. 


A large variety of computer programs are available to solve such problems. The 
remark announced above is that most of the interest in linear programming 
(through computers) lies in working with very large numbers of inequalities and 
variables. Hence the need (by no means a mathematician’s luxury) to work and 
develop intuition in high-dimensional spaces. 

Back in the 1720s Newton was already using convexity in a basic manner to 
solve the problem of finding the local shape of a real algebraic plane curve at a 
singular point, however complicated. His solution was quite complete, as the 
following example shows: 


2y° + y4x3 — Ty4x? + 3y3x? — y2x4 — Sy2x + wet + 2x° = 0. 
The singularity to be studied is at the origin (0, 0). If this is not the case adjust it by 
suitable translation of coordinates. Now put a dot at (m, n) in the lattice N x N of 


integral points in the plane R* for every Am nx” y” in the equation defining the 
curve, for which a,, , # 0: 


Then draw the convex envelope (paying attention only to the west and south parts) 
of this set of dots, thereby obtaining a number of segments. For each segment, 
extract the two terms that correspond to the end points of that segment from the 
original equation of the curve. Draw each of these two-term curves locally. 
Newton’s theorem is that, whatever the other dots, the union of the preceding 
curves drawn gives the local shape of the total curve. See [14], [36] for more details 
and the general theory. 


2y° — 5y*x =0 x? — 5xy? =0 
4 4 
2y? —5x=0 x4 — Sy? =0 
4 4 
5x = 2y? x? = JSy and x2 = —ySy 


(x? — 5xy? + 2y> = 0) 


three branches 
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2. Rigorous definitions; examples 


In chapters 2 to 6 we will work exclusively in the linear space R% = 
{x = (x,,...,X,): x; © R}. According to the nature of the problem that we will be 
studying, we consider it as endowed or not with its canonical Euclidean metric 
d(x, y) = V(x, _ yi) ++ + (xy - Ya) 

A subset K of R@ is said to be convex if for every x and y in K the segment 
[x, y] of R¢ whose ends are x and y is all contained in K:[x, y] C K. The pictures 
below show convex and nonconvex sets. Because the open disk x? + y* < 1, plus 
any part of its boundary (the circle x* + y” = 1) is convex, we will always work 
with open or closed convex sets. Note that things are different for the square. 
Recall, if need be, that a subset K of R® is said to be open if any of its points x is 
the center of an open ball B(x, r) = {y: d(x, y) < r} (r > 0) entirely contained in 
K. It is said to be closed if its complement R% \ K is open. It is equivalent to say 
that the limit x of every convergent sequence {x;} is still in K. 


nonconvex convex nonconvex 


- | 
> E 
LD ~ 7% 


Three immediate properties of convex sets are the following: 

The first was already known to Archimedes: if K is a plate in R? or a body in 
R°, of some material, then it contains its center of mass (center of gravity). Note 
that constant density is not required. 

The second property: look at the two following metrics on a subset K of R%. 
The first, dp is the so-called induced one (from the Euclidean structure of R2) 
dpa(x, y) = d(x, y) for any x,y © K and the Euclidean metric d on R*%. The 
second, called intrinsic, denoted by d,, is defined as the infimum d,(x, y) of the 
length in R® of all curves from x to y which lie entirely within K. Then it is easy 
to see that K is convex if and only if d, is identical with dp. 

The third property belongs to the realm of algebraic topology, which studies the 
properties of objects that depend only on maps preserving the topology, in 
particular those properties that are invariant by continuous deformation. The fact 
is that all open convex sets are equivalent, in particular equivalent to R@ itself. So 
for an algebraic topologist convex sets are of no interest. The proof is simple: build 
a continuous map K — R?@ by picking any x € K and stretching any non-infinite 
ray from x in K to the infinite associated ray in R“. 


K 


N 
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The following explicit examples of convex sets are basic: 
(i) Each of the two half-spaces (open or closed) defined by an affine hyperplane 
{namely the set given by an equation like 


d 
, 4;x; = b, 
i=1 
where the a,’s are not all 0) is convex. Half-spaces are the building blocks of 
convexity, since it is classical and not hard to prove that every closed convex set is 


the intersection of the closed half-spaces which contain it (see section 6). Moreover 
one can always achieve it with a denumerable family. 


i 
pom 


(ii) The full (closed) ellipsoids 


Hl 


It is very important to remark that—when no Euclidean structure is imposed on 
R4—all ellipsoids are equivalent (affinely). They are equivalent, in particular, to 
the standard closed ball D4_ ,x? < 1. 


Gii) A closed parallelepiped is a set which can be written (after translation) as 
{(x,,...,Xq): [x,| < 1 Wi = 1,..., d} in a suitable basis of R“. They are all equiva- 
lent—affinely—to the standard cube |x,|< 1 Gi = 1,...,d) (the coordinates are 
the standard ones of R%). But as soon as metric considerations are imposed, then 
parallelepipeds can be very different. 


(iv) Any intersection (not even necessarily countable) of convex sets is convex. 
Hence, it makes sense to speak of the smallest convex set containing a given set 
A CR?@ and to call it the convex hull of A. It will be denoted by conv(A) in the 
sequel. Polytopes are the convex hulls of finite sets. Note that not all points are 
necessary; the really necessary ones are called the vertices, or extreme points, of 
the polytope. 
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(v) The sets |x|* + |y|* < 1 in R? are convex for every real number A > 1. The 
proof is not obvious, but see section 4. 


3. The John-Loewner ellipsoid; applications 


We present here a result that is both simple—though nontrivial—and extremely 
powerful. It was found independently by F. John (studying mechanics) and 
C. Loewner (studying complex variable maps), both in the 40s. (The case of d = 2 
goes back to Behrend in the ’30s.) It says that given any bounded set A (with 
nonvoid interior) in R%, there exists one and only one ellipsoid E of minimal 
volume containing A (recall that ellipsoids are centered at the origin). 


UT 


U) 


The volume of an ellipsoid 04_,(x?7/a?) < 1 is to be understood in the elementary 
sense, namely as the canonical measure on R“@ (with the Euclidean standard 
metric). It is given by 


Vol 


I 


» x < 1 = [TT vor Tx? < 1} 


i-1 4i i=l 


This will follow for example from the determinant change of variable rule in 
integration theory. We denote by B(d) the volume of the standard ball DL“ ,x? < 1. 
The value of this important function can be found in some calculus books (see [6]) 
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namely 
ak 
B(d = 2k) = aT 
Qk+1ak 
d=2k +1) = -—-———_ 
BC 1°3-5-++ =(2k +1) 
or in one shot 
gt /2 
B(d) = —~—~ 
r(“ + | 


if you know the [’-function. 

If you have never thought about it, study the behavior of B(d) as d increases. 
Compute in particular the first value of d for which B(d) < 1 (the volume of the 
cube of side equal to 1). An asymptotic evaluation of B(d) is important (see 
section 6). From Stirling’s formula one gets 


ore (d+2)/2 
| asd > o, 


B(d) ~ constant - | 


It also permits one to compare B(d) with the volume, equal to 2%, of the cube 
circumscribed by the unit ball. See an interesting application at the end of 
section 5.A. 

Let us come back to the John-Loewner assertion. Existence is an easy compact- 
ness argument. Just be careful to avoid the degeneracy of ellipsoids. This is 
guaranteed precisely by the non-void interior condition. To prove uniqueness 
argue by contradiction. Show that, given two distinct ellipsoids of the same volume, 
there exists a third one of smaller volume which contains their intersection. 
Simplify the computation by using the reduction of positive quadratic forms to 
simultaneous diagonal forms. Another proof is given in the next section. 


The John-Loewner ellipsoid is widely used nowadays in the study of convex sets, 
for both pure and applied mathematics, see [19]. For our purpose let us mention 
three applications to various fields. 

The first application is to the theory of quadrics (conics for the plane). An 
ellipsoid E has the property (called diametrical) of admitting an affine hyperplane 
symmetry for any direction 6 of lines in R%. That is to say there exists a hyperplane 
H associated to 6 such that the hyperplane symmetry defined by the pair (6, H;) 
leaves E invariant. Are the ellipsoids the only subsets A of R% enjoying this 
property for any direction of line? The answer is easily seen to be yes if we use the 


658 MARCEL BERGER [October 


John-Loewner ellipsoid E of the set A. A bare-handed proof—you should try 
it—is already extremely involved (and not very enlightening) when d = 2. For 
d = 3, it was done by Bertrand and by Brunn. 


tp 
WEF 


The second application is in geometric-group theory: it states that every compact 
subgroup G of the full linear group GL(d;R) of R*% leaves invariant some 
Euclidean structure on R%. Take any point x # 0 in R¢% and introduce the John 
ellipsoid of its orbit G(x). This ellipsoid yields the desired quadratic form; the 
uniqueness is of course basic. Note more generally that the compact group we are 
studying can be taken in the full group Aff(d; R) of all affine transformations of R@ 
(we permit translations). For by the Archimedes result of our introduction, the 
center of mass of the orbit G(x) is invariant under G so we can take it to be the 
origin 0. Properly rephrased, the above existence result is group theoretical: two 
maximal compact subgroups G,G’' of GL(d;R) are necessarily conjugate, i.e., 
there exists g in GL(d;R) such that G’ = gGg™!. This is actually a special case of 
a general result of Elie Cartan to the effect that this conjugacy of compact maximal 
subgroups is valid in any Lie group. 

The third application is in fact what F. John had in mind. Call a convex body a 
non-void-interior compact set of R*. Then for every convex body K of R¢% which is 
symmetric around the origin (Vx € K then —x € K) there exists an ellipsoid E 
such that E C K CAK and A < yd. The bound is clearly optimal as shown by the 


Oo 


Again the proof works by contradiction. Using duality (see section 6 if necessary) 
introduce the ellipsoid of maximal volume contained in K and think of it (after a 
suitable linear transformation—remember (ii) in section 2) as the unit sphere 


in R2. 
x 
K 
—_—— —_——F 


1990] CONVEXITY 659 


Assume there exists an x € K with ||x|| > /2. By the symmetry assumption K 
will contain the domain shaded above. Elementary calculus (again use a reduction 
to the standard cube) shows that the sphere inscribed in that cube is contained in 
K and has a larger volume than that of any inscribed ellipsoid. 

It is worth mentioning the yd -squeezing property for two reasons. 

The first is the Banach-Mazur metric structure on the set of all symmetric convex 
bodies in R¢ which is defined as follows: for two convex bodies K, H (symmetric in 
the origin) call A the smallest number for which there exists a linear transforma- 
tion f of R? such that f(H) CK Cf(AH). Then d(K,H) =A is called the 
Banach-Mazur distance (it is in fact a so-called multiplicative distance and strictly 
speaking after dividing the set of convex bodies by the set of linear isomorphisms). 
John’s result is now simply that the distance between any symmetric convex body 
and the unit ball is always < logyd . It says that a Banach space structure on R@ is 


never too far from a Euclidean structure! 
aN fH) 


iL 


The second reason has to do with the very simple question: what is the greatest 
Banach-Mazur distance to the unit cube? This is a case (see the introduction) 
where the answer is not known today. The aim is to squeeze a symmetric convex 
body between two homothetic parallelepipeds as close together as possible. The 
value Vd is wrong. Szarek proved in 1987 that one needs at least Vd log d. The 
optimal bounds are still unknown. Two more comments. First, Szarek’s construc- 
tion of examples is not really explicit but based on probability theory. This 
technique is now widely used in convexity theory. The idea is that for a “general” 
convex body K the points of contact of K with the optimal cube are far from an 
orthonormal basis. On the contrary the angles are quite small. Second—a very 
general remark—for practical applications to theoretical harmonic analysis or to 
numerical analysis, it is more important to have asymptotic estimation when the 
dimension d goes to infinity than explicit values. In that direction, Szarek and 
Talagrand proved in 1988 that asymptotically the Banach-Mazur distance between 
the cube and any symmetric convex body is always of the order of d’/*. The 
conjectured order is Vd log d (up to some universal constant). See [33]. 


4. Convex functions; examples and applications 


The simplest notion to start with is that of a convex real valued function defined 
on an interval—say closed [a, b]—of the real line: 
f: [a,b] — R is said to be convex (resp. strictly convex) if: 
f(Ax + (1 —A)y) sAf(x) + (1 — A) f(y) (resp. <) 
Va<x<y<b, 0O<A <1. 
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Ax+(1-—A)y 


Otherwise stated: the graph of f is always below the chord segment joining every 
pair of its points. More simply: f convex is equivalent to requiring that the epigraph 
of f (namely the set {(x,y): x €[a,b], y = f(x)} CR’) is convex. A trivial 
induction yields 


i > A < A, f(x;) Vn Vx; € [a,b] WA, >0 with )) A, = 1. 
i=1 i=1 i=1 
Convex functions are necessarily quite regular: they everywhere admit a right and a 
left derivative which need not coincide, though the set at which they do not 
coincide can be at most countable. In particular they are almost everywhere (i.e., 
up to a set of measure 0) differentiable, in fact continuously differentiable. They 
also admit almost everywhere a second derivative f” => 0. Most important is the 
converse: if f” exists and is non-negative (resp., positive) everywhere then f is 
convex (resp. strictly convex). This easy result is tremendously powerful. Two 
standard and basic examples are 

Gi) f(x) = —logx on [1, o yields the inequality 


>», A,a; = [ [a> [2,20 yA; = 7 
i=1 i=] 


i=1 
in particular 


a, +++ +a,\" 
| |. 
n 
This is easy for n = 2, but from n = 3 on it is not so obvious. 
Gi) f(x) =x”, p = 1. After tricky (but not deep) manipulations, one gets the 
Holder inequality: 
1/p 1/q 1 1 
Div < (Lx?} (Ly? with - + —=1 
i i i Pp q 


and the Minkowski inequality: 
1/p 1/p 
[Dlx + yl?) < (Llu?) + (Lil?) 


The latter says that the set |x|* + |y|* <1 is convex in R’. This is not obvious, 
even for A = 4. 

The notion of convexity can be extended to numerical functions defined on a 
subset K of R*: 


1/p 


fle + (1—-Dy) <if(x) + A —-)fly) 
so that an obvious necessary condition for this condition to make sense is that K 
should be a convex subset of R“. General convex functions are also quite regular. 
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They always admit a differential almost everywhere, in fact they are almost 
everywhere of class C'. More: almost everywhere of class C’, i.e., they admit a 
second differential f’(x): R4 x R¢ — R which is there a positive quadratic form. 
The converse—analogous to the interval case—is also true, 

Convex functions have two basic properties, widely used in theoretical and 
applied mathematics. Their maximum is attained only at the boundary of their 
domain of definition, A strictly convex function admits at most one minimum. 
John-Loewner ellipsoid uniqueness might also be proved using the latter property. 
Identify ellipsoids in R* with quadratic forms (positive definite) and consider the 
set of all of them whose seat is in R“¢*!/?, This is done, for example, with the 
map 


td | 


q ji 

It is typical of how mathematicians build up successive levels of abstraction. The 

volume Vol(E(q)) of the ellipsoid associated with q is equal to det(q)~'/* - B(d), 

where det stands for determinant. Then check that the function g > det(q)~'/” is 
strictly convex. 

We will give ample consideration to the regularity of convex functions in section 

7, A basic example is given in sections 5 and 6, the Brunn-Minkowski inequality. 


5. Polytopes: Four “elementary” problems 


Recall that a polytope is the convex hull of finite points in some R‘; if d = 2 we 
call it a polygon (convex), and if d = 3 a polyhedron (convex), Its vertices are the 
really necessary points; any that are not useful should be thrown out. It is not hard 
to see (in section 6, for example) that a polytope is also a finite intersection of 
half-spaces, Conversely, one should add compactness. The faces (d — 1)- 
dimensional faces, to be exact) are the intersections of the polytope with the really 
necessary hyperplanes which define it. Such a face is a polytope in its hyperplane. 

By induction one defines the i-faces of a polytope (i = 0,1,...,d —-1). The 
(d — 1)-faces are the faces, the 0-faces are the vertices, the 1-faces are the edges. 
In dimension 3 this exhausts them. 


<> 


“the most popular 
polyhedron” 
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Examples of polytopes are parallelepipeds. The simplest are the simplices. A 
simplex is a polytope generated by the minimum number of points in R@ for it to be 
a convex body (in particular not contained in some hyperplane). This number is 
d+ 1. All simplices are affinely the same. Thus, not surprisingly, simplices are of 
basic utility (e.g., the simplex method of Dantzig in linear analysis). 


d = 2: triangle d = 3: tetrahedron 


Because of their simple definition and the everyday look they have about them, 
it is natural to expect that: 

a) everything about polytopes has been known for a long time 

b) everything about polytopes is easy to prove. 
In the light of our introduction, the reader will already have guessed that both 
statements are false. We will illustrate this with four topics. 


A. Hyperplane sections of the cube 


Consider the unit cube C = [— 3, $]@ in R® and cut it by hyperplanes. Which 


hyperplane cuts C with maximum volume ((d — 1)-dimensional measure, that is to 
say its volume for standard measure in the hyperplane where it stands, for the 
natural Euclidean structure)? 


For d = 2 the answer is ¥2, attained by either of the two diagonals. For d = 3 the 
smart reader will guess that the most wonderful section of the cube is the regular 
hexagon obtained by cutting it through the origin by a hyperplane orthogonal to a 
diagonal. But he will be wrong because its area is equal to 3V3 /4, whereas the 
section through two opposite diagonals of parallel faces has area V2 > 373 /4. It 
was conjectured for quite a while that V2 is the optimum (clearly attained) for any 
d. This was proved by K. Ball only in 1986, see [2]. The proof is by no means 
elementary. It is based on probability theory and, at the root of it lies the Fourier 
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transform, applied twice to get estimates of the volume of sections by hyperplanes. 
One then gets the following beautiful formula for the section of C by the 
hyperplane H given by the equation L¢_,a,x; = 0(0“,a? = 1): 


© sina,t sin a jt 


1 
volume(C 1 H) = — [ a 
TT “—o at A yt 
Then one finishes the proof with Hdlder inequality (under the integral form of the 
finite form given in section 4) and the fact that 
? v2 
dt< — ifp2>2 


1.0 
rl. ip 
and equality only if p = 2. This latter inequality is quite subtle to prove. 

An interesting consequence of Ball’s result is the very easy disproof of a (quite 
intuitive) conjecture of Herbert Busemann: if two convex bodies K and H (say 
symmetric in the origin) are such that the volume of the section of K by any 
hyperplane P through the origin is always smaller than the volume of the section 
of H by P, then the volume of K is smaller than the volume of H. This is trivially 
true when d = 2; it is open for d = 3,...,7. A quite involved counterexample was 
given by Larman and Rogers in 1975 for d = 12. But from Ball’s result it follows 
immediately that the conjecture is false for any d > 10 when K is the unit cube in 
R‘ and H the sphere in R® of radius so adjusted that its volume is equal to one. 


sin ¢ 
t 


Here is a small remark in the same spirit. Cut any symmetric convex body by 
parallel hyperplanes. You expect the volumes of these sections to be a monotone 
function, starting at a minimum when the hyperplane just touches the body and 
reaching a maximum through the origin. This is easy to prove with the Brunn- 
Minkowski inequality (see section 6), but however obvious it may seem, no 
elementary proof is known. Further: the Brunn-Minkowski inequality is equivalent 
to the fact that the 1/(d — 1) power of the volume of the parallel hyperplane 
sections of any convex body (symmetric or not) in R% is a concave function. It is 
only in dimension two that this is completely elementary. 

Note also that, although apparently obvious, it is not easy to prove that any 
hyperplane section through the center of a unit cube has a volume greater than or 
equal to one. It was proven by Vaaler only in 1979! 
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B. Combinatorics 


Let us denote by f, the number of the i-dimensional faces of a polytope T of R%. 
Here i runs from 0 to d — 1. Are there necessary (and sufficient) conditions for a 
given sequence (fy, f,,..., f,_,) of integers to be that of some polytope in R4? 
The case d = 2 is trivial: fy =f, is both necessary and sufficient. When d = 3 
Euler found in 1750 the famous necessary condition f, — f, + f, = 2 (which seems 
also to have been found—though none too explicitly—by Descartes). It is not hard 
to extend this relation to any d in the form f, —f, + °°: +(-1)7"'f,_, = (-1)%. 

It was only in the 1920’s that Steinitz found the set of necessary and sufficient 
conditions when d = 3, It reads: 


The inequalities were known to Descartes and Euler as necessary ones. For d > 4 
such a set is unknown. But in the 1980’s two basic results were obtained. They 
concern generic polytopes, namely the so-called simplicial polytopes. These are 
polytopes (in R“%) all of whose (d — 1)-faces are ((d — 1) — 1) simplices, Equiva- 
lently, their vertices are in general position, that is to say that there are never more 
than d of them belonging to the one affine hyperplane. They are completely 
flexible in the sense that every vertex can be moved a little bit when all the others 
are kept fixed. 

For simplicial polytopes a necessary and sufficient set of conditions is known, 
though only since 1980. It is quite complicated. The set of conjectured necessary 
and sufficient conditions was discovered by McMullen. The way the proofs go is 
most interesting. The sufficiency—due to Billera and Lee—is obtained by exhibit- 
ing the desired polytope from the very involved commutative algebra construction 
of a Cohen-Macaulay ring. The proof of the necessity, due to Stanley, is even more 
surprising. It is obtained (following an initial idea of Demazure) by associating with 
a convex polytope a complex projective algebraic variety and applying to it the 
so-called “hard Lefschetz theorem.” This story again illustrates the need to use 
complex number geometry even in order to solve real number problems. See 
[29], [32]. 


C. Can you move a polytope? Can you put it into a computer? 


Consider a polyhedron in R° and try to move (slightly) its vertices under the 
condition that the polyhedron remains of the same combinatorial type, i.e. it keeps 
the same number of faces, each face keeping the same number of vertices. More 
algebraically, during the move you want to respect the linear relation which exists 
between the vertices of the initial polytope. Of course you discard the trivial 
solutions such as an affine linear transformation of the whole space. The end of 
the last section shows that, for a simplicial polyhedron (i.e., one of which all the 
faces are triangles) the movability is maximum, every vertex can be moved (slightly) 
independently of the others. The problem begins to be interesting when one has 
faces with four or more vertices. In the figures below it will be seen that you 
cannot move only one vertex, like a, because this will force new faces into 
existence. For the dodecahedron, one way is to move the plane of a face slightly. 
But this is possible only because a dodecahedron is cosimplicial (see section 7), 
that is that when considered as intersection of half-spaces, only three planes meet 
at one vertex. But this trick does not work for polyhedron No. 3. It can be done by 
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turning the plane of the face abcd around a and c. Then the general situation 
seems a very complicated problem for a general polyhedron. In 1920 Steinitz 
proved a more general theorem about the general polyhedron, to the effect that 
some graphs can always be realized in R°® by a convex polyhedron. As a corollary, 
any convex polyhedron has sufficient freedom of movement to be approximated as 
well as required by a polyhedron all of whose vertices have rational number 
coordinates (call it a rational polyhedron). 


#2 


# 3 #4 


This is important because, once a coordinate system is fixed, a computer can 
recognize only rational polyhedrons when they are entered by giving the three 
coordinates of the vertices. This means that, for example, the affine linear 
relations between vertices have to be truly respected, not only approximated. To 
end with dimension 3, let us mention that we still do not know how to compute the 
effective degree of freedom when the combinatorial type is given. 

How about higher dimensions? The situation here is dramatically different 
because in 1967 Perles found a polytope in R® with twelve vertices which cannot be 
approximated by a rational one. The idea is roughly as follows. The arrangement 
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of faces will force, in the diagram below, 


the dotted points to be on lines as indicated. Now an elementary computation in 
the projective plane (using, if you know it, the notion of cross-ratio) shows that 
whatever the Euclidean metric is, we should have 


EA FB io 32 v5 34+ 75 
EB FA equa to Or . 


2 
But neither of these numbers is rational! See [22]. 

In 1987 Sturmfels found a polytope in R° with the non-rational approximation 
property. Whether this can be done in R° or even R’% is still an open question. The 
above shows, in particular, that you cannot implement in a computer every 
diagram exactly respecting the alignments. For the aficionados, a computer expert 
will tell you that there are other “theoretical” ways to enter such a diagram into a 
computer. 

A comment on rigidity is in order. For the above, the movability was combinato- 
rial, not metric. It was permitted to change the lengths of the edges (and the 
congruence of the faces). If, moreover, we insist, when moving a polyhedron in R°, 
that all the faces remain equal polygons (this is automatically true if all faces are 
triangles and if the edges are kept at fixed lengths) then Cauchy proved rigidity 
back in 1812, namely that only a global Euclidean displacement can achieve it. In 
fact there was a small gap in Cauchy’s proof that Hadamard and Steinitz filled in 
independently. Note that it is false for plane polygons as soon as they have at least 
four vertices. 

With a little bit of spherical geometry and induction on dimension, Cauchy’s 
theorem implies rigidity in any dimension larger than three. Cauchy’s proof is very 
subtle. See, for example [6]. 


D. Fillers 


Call a polygon P a filler of the Euclidean plane R’ if the whole plane can be 
filled up (‘tiled’) with congruent (equal) copies of P; the copies should moreover 
meet edge to edge. 

We first want to find the shape of all possible fillers. Any triangle and any 
quadrilateral will do. Then there are examples with pentagons and hexagons. The 
hexagonal case is completely understood, the pentagonal one not yet ((24)). 
Geometers (in fact crystallographers) have known since the end of the 19th century 
that no convex k-gon with k = 7 can be a filler. Convexity is of course required. It 
would seem that there is no completely elementary proof. The known proofs use 
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not only Euler’s formula but also a “going to infinity” argument. For fillers of the 
plane see Thurston’s text in the present issue, and the basic reference [24]. 


same for 
quadrilaterals 


no limit on the 
, number of edges! 
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CY 7 but not congruent tiles 
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We now look at R°. Call a filler a polytope P such that the whole space R? can 
be filled with congruent copies of P, still matching face to face. 

Here two surprises await us. The first is that there do exist extremely compli- 
cated fillers. Engel found one in 1980 with 38 faces and 70 vertices. Some faces are 
extremely small. This is surprising at first: any filler is surrounded by 38 others and 
some of the contacts are very tiny. The construction used a computer because of 
these small faces. It is based on the classical and important notion of what is 
known as the Dirichlet-Voronoi domains. We start with a lattice A in R° and 
define P by P = {x € R°®: WA € A: d(x, 0) < d(x, A)}. Then, by the very construc- 
tion, the polytopes {P + A: A € A} fill R°. Now Engel’s example is obtained by 
extending the Dirichlet-Voronoi technique as follows. We consider discrete groups 
G of isometries of R? which contain a lattice of translations, but in fact G can also 
contain rotations and be much bigger than only a lattice group. Then the Engel 
filler P is of the type 


P = {x € R°: Vg € G: d(x,0) < d(x, g(0))}. 
This P is in general called the fundamental domain of the group G. 


But there is another surprise: there is no known bound for the number of 
vertices or faces of an R° filler when it is not obtained by a discrete group of 
isometries techniques. We do not even know that it is bounded! See [23]. 


Two views of a 38-faced polytope with 70 vertices, discovered by Engel (1980) 
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6. Two algebraic operations on the set of all convex bodies: 
Duality and addition 


The typical evolution of mathematical understanding consists in reaching higher 
and higher levels of abstraction, by the building of structures. In order to study 
individual convex bodies, we are going, here and in the next section, to study the 
set of all of them (in a given dimension, say d) and to introduce algebraic and 
topological structures on this set. 

The duality notion, a very useful one in mathematics (think of the Fourier 
transform analysis, homology—cohomology, ...), will be introduced through the 
expedient of a practical problem, that of calculating the volume of a convex body 
by computer. Looking at polygons for curves in the plane it looks simple: 


Archimedes on 7r 


But in large dimensions we will meet surprises. 
A. Evaluating the volume of a convex body by computer 


Let us assume that a convex body K is described, in the computer, by a 
membership oracle: input a point x, and the oracle announces whether or not 
x EK. 


Lovasz found, in 1985, a polynomial time algorithm that will give for vol(K) a 
lower bound denoted by vol(K). The crucial problem lies in how good that 
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approximation is. Barany and Furedi, in 1987, show it to be a complete disaster. 
They prove that for an input which is polynomial in d, say of n = d* points, one 
cannot in general do better than 


vol( K ) d ay? 
vol(K) * | | 


2ac log d 


For the proof we consider the simplest convex body, the unit ball B%. It can be 
proved that the maximal volume of a polytope with n = d“ points contained in B%, 
if denoted by V(d, n), obeys 


V(d,n) 2ac log d 
ae < ——— 
vol(B“) — | d | 


The idea is roughly as follows. By Caratheodory’s theorem (see section 8.E) 
every point of the convex hull of n = d% points belongs to a simplex of R%. If the 
points are within distance 1 from the origin, subtle estimates can be obtained for 
the volume of such a simplex from some of its k-faces. Then one will have at most 
(") such situations to fill out the polytope and a 1/k! factor coming from the 
volume of a k-dimensional simplex. The subtlety now lies in a smart choice for k. 
The answer is to take k as the integral part of d/2log n. Then, because of the 
formulas given at the beginning of section 3 for the volume of the unit ball B? and 
(") = n!/k\(n — k)!, one concludes, after computations using the Stirling formula 
for evaluating p! when p is large, which asserts that 


d/2 


p! 
V2p7 (=| 
e 


A comment: ellipsoids are really the worst convex sets for the above inequality. 
This was proved in 1951 by Macbeath. When the number of points n is given, for 
convex bodies K in a given R%, the bound 


Sup( Volume of a polytope contained in K with n vertices) 
vol( K ) 
is minimum for ellipsoids. 
There is still 6ne hope. Try to estimate vol(K) from below by vol(K) with a 


membership oracle, and try to get an upper bound vol(K) with a membership and 
a separation oracle together, which are easy to implement. When the computer 
answers x ¢ K it will at the same time give you a half-space containing K: 
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Still according to Lovadsz, there is also a polynomial algorithm giving vol(K) = 
vol( K ). Moreover, 


vol( K) ; 
vol(K) <d*. 


This is bad for large ds. But one might hope to be smart enough in choosing points 
to get a good 


vol( K) 
vol( K ) 


This amounts to being able to play with points and half-spaces (or, say, affine 
hyperplanes). This is precisely what the classical duality in Euclidean spaces 
achieves for us. The final result will be given in C below (after this duality has 
been explained in B). 

But before we end with a great classic of convexity, let us recall that Minkowski 
showed just before 1900 that any convex body admits at least one supporting 
hyperplane H (affine here) for any point x of its boundary: 


K 


H 


namely, K lies entirely in one of the two closed half-spaces determined by H. In 
most books this is called the Hahn-Banach theorem because Hahn and Banach 
proved it, much later, in the infinite-dimensional case. This basic result for the 
study of convex sets shows in particular what we announced in section 2: a closed 
convex set is the intersection of the closed half-spaces which contain it. 


B. The Euclidean duality 


We consider R¢ with its standard Euclidean structure. The duality between 
points of R@ and its affine hyperplanes is geometrically defined as follows: 


H(x) 


To x # 0 associate the line 0x and call the dual of x the hyperplane H(x) which 
is orthogonal to 0x and cut it at the point x’ such that 0x - Ox’ = 1. Conversely, x, 
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clearly unique for any hyperplane H not containing 0, is called the dual (the pole) 
of H. Check for yourself in the plane what the dual of the intersection of two lines 
is in R?. Algebraically, things are much simpler: 

H(x) = {y © R*%: (xly) = 1} 
where (-|- ) denotes the usual scalar product. 

The polar (reciprocal) convex body K* of a given convex body K is either the 
convex hull of the poles of the supporting hyperplanes of H or the intersection of 
the half spaces bounded by the hyperplanes which are the dual of the points of K 
(of course you need to use the boundary points of K). This duality is excellent if K 
is a convex body that contains 0 in its interior. From now on only such K’s will be 
considered. Then (K*)* = K. 

Examples are: 


(i) the unit ball B? is its own dual 
Gi) ellipsoids have ellipsoids as duals; check that then 


vol( E)vol( E*) = (vol( B4))° = B*(d) 


(iii) the dual of a cube is called a cross polytope. Precisely the dual of [—1, 1]* = 
K is K* = convex hull of the point +e, (where {e,} is the standard base). 
Note that a cube has 2% vertices, the cross polytope only 2d. Note here 

d d 


2 
*\ — 9d _— 
vol( K )vol( K*) = 2 a a 


(iv) more generally the dual of a polytope is a polytope, the duality exchanging 
vertices of one with faces of the other. 
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C. Back to estimating volumes: the invariant vol( K ) vol( K*) 


Here we work with convex sets K symmetric in the origin. The picture and the 
way Lovasz obtains vol(K ) and vol(K ) show that if we want to compute vol( B“) by 
his method, we have to estimate — 


vol(B¢) — vol(K*) 


x —_~ > —___, 
vol(B“) ~ vol(K) 


where K is one of the polytopes contained in B“ as in the figure in subsection A. 
We rewrite the right-hand quotient of ® as 


vol( B“) \* 


vol( K ) 


ft 
(vol(B*))” 


The last two terms are estimated above. The plan is to work with a smart K, that is 
to approximate B% by K such that vol(K)vol(K*) is very small. 

Note that vol(K )vol(K *) is an invariant of the linear shape of K only because 
linear transformations will affect vol(K) by det f when it affects vol(K*) by 
(det f)~'. It was conjectured by Mahler that for any d and any K: 

4d 
B?(d) > vol( K )vol( K*) > 77h 


vol( K )vol( K*) 


the left inequality being characteristic of ellipsoids and the right one of cubes or 
cross polytopes. 

Assume Mahler’s conjecture on the right side. Then the hopes mentioned above 
are dashed. Because then always 


vol( K*) d d 
vol( K ) = | | 


ae log d 


using the above results. 

As to Mahler’s double conjecture, the situation today is the following. The left 
bound is true and characterizes the ellipsoid. The bound was proved by Blaschke 
for d = 2,3 and by L. Santalo (with some restrictions) for any d. The fact that 
equality characterizes the ellipsoid for any d was settled only by Saint Raymond in 
1981. The exact right-hand bound is unknown today even for d = 3. It is known 
only for d = 2 (try it, yourself to feel the difficulty there already is for d = 2). In 
1985, J. Bourgain and V. Milman showed that the order of magnitude conjectured 
by Mahler is the right one. There exists c > 0 

x: 
vol( K )vol( K*) = 7 VK Vd. 


So the error on computing volumes will have to be as bad as 


d \4 
licea| 


Thus “it is impossible to compute the volume with a decent error in polynomial 
time.” In other words a computation of volume applicable to any convex body 
should be of another type. 
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The Bourgain-Milman proof is quite involved. It uses probability theory and fine 
estimates, see [8], [5]. 

Note that the Mahler conjecture is useful in a completely different context, that 
of simultaneous approximations of real numbers by rationals, see [17], page 31. 


D. The Minkowski addition 


For K, H convex sets of R%, the set defined as 
K+H={x+y:x€K, ye H} 


is still convex. 


disk K 
A 
radius ¢ 


A+ K = {x: d(x, A) < &} 


Note that if you change the origin in R¢ the new K + H will differ from the old 
one by only a translation, so it can be considered as well-defined in shape. The 
notion is then affine. More generally, AK + wH = {Ax + py: x € K,y © HA} is 
still convex for any A, uw > 0. The picture below shows 

A+2B 1 2 

—.— = ~A+ 7B 

3 3 3 

for two simplices in R° and also that the sum of some K with a ball of radius « is 
nothing but the set of points within a distance « of K. We have no time to 
elaborate on this. Suffice it to note that it is the starting point of one of the first 
rigorous proofs of the standard isoperimetric inequality in R? which states that 
among all convex bodies of given volume, balls are those with the smallest 
boundary volume (here (d — 1)-volume which is length when d = 2 and area when 
d = 3), see section 8. Let us mention here the basic Brunn-Minkowski inequality 


(vol(A + B))'/“ > (vol(A))'/% + (vol(B))'". 


This was essentially discovered by Brunn in 1888. The reader will easily see that 
this is exactly equivalent to the claim that the (d — 1)~'th power of the (d — 1)- 
dimensional volume of the sections of a given convex body in R®@ is a concave 
function when the cutting hyperplane moves from the left to the right: see sec- 
tion 5.A. 
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In both of the above figures, the Minkowski addition has a regularization effect; 
in both cases K + H has a smoother boundary than K, H. This is naive for the 
general case but was discovered only in 1988 by Kieselman. Even when K and H 
have the most regular boundary possible—namely, C”, or real analytic or even 
more: algebraic—the boundary of K + H can be non-smooth. The results are of 
local nature. The negative one consists simply of the epigraphs K, H (see section 
4) of x*/4 and x°/6 for which 

x® 3 


K + H is the epigraph of an qi 


+ something of class C’. 

So it is only six times differentiable. But it is the worst possible. The positive result 
is: if the boundary of K and the boundary of H are of class C”, then the boundary 
of K + H is always of class C*°/? = C°t+@/9), See [26]. 


K 
QTY 


x x 


— ‘ 3 
6 x =lxP”? +(e C7). 


K+H 


Gj tip 


x 


> 
an 


So the Minkowski addition has no regularizing effect when regularity means local 
smoothness (differentiability properties). But it does have a global regularizing 
effect when regularity means being close to ellipsoids. It was indeed proved by 
Vitali Milman that for any convex body in R®@ there exist two affine isomorphisms 
A and B of R? such that if one sets T = K + A(K) then the dual T* of T has the 
advantage that 7* + B(T*) has a Banach-Mazur distance to the sphere always 
bounded by a number independent of the dimension d. 


7. Topology on the set of all convex bodies: intuition is dangerous 
A. Topology 


We are going to see with the help of topology that intuition can again be 
misleading. Namely that if any convex body has a boundary which is almost 
everywhere C' and C? differentiable, in the set of all convex bodies almost every 
convex body has a boundary which is never everywhere C’. For even worse, see 
below. Topology is needed to give a precise meaning to “almost every convex 
body.” 

For the sake of simplicity we work in the set % of all convex bodies symmetric 
in the origin of a given R%. We recall that these are convex sets which are also 
compact and with a non-void interior. We specify .# in R% only when needed. 
Topology on the convex body set is best given by the Banach-Mazur metric. 

We have already seen in section 3 the Banach-Mazur distance on .%. It has the 
advantage of being linear-invariant. But, stricto sensu, it is a distance in fact only 
when convex sets which differ by some linear transformation are identified. A 
non-linear-invariant distance is the Hausdorff one. Fix on R* some Euclidean 
metric. Then d(K, H) is defined by the smallest « such that K CH + ¢B% and 
HcK+eB%, i., K is within distance « from H and conversely. It is easy to 
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prove that the Hausdorff distance gives the same topology on .% as the Banach- 
Mazur one (just beware of the quotient to be done). 

Remark that no reasonable measure on % is known. To define the analogue of 
“almost every” without measure but when one has a topology, we use Baire’s idea: 
negligible sets will be by definition those which are a denumerable union of sets 
with void interior (these latter are also called meager). 


B. Probable regularity of the boundary of a convex set 


Consider a convex body K and its boundary 0K. We want to study the regularity 
of this set which is a curve when d = 2 and a surface when d = 3. In general a 
hypersurface. This is a local question. In particular, taking x € 0K and a support- 
ing hyperplane H at x we can locally consider K as the epigraph of a convex 
numerical function f: U — R where U is a neighborhood of x in H. Then 0K will 
locally coincide with the graph of f. 

We can apply the results quoted in section 4. So if we build the totality of 0K 
with such graphs we know that 0K is almost everywhere (this for the natural 
measure on 0K obtained from the Lebesgue measures on the H’s through the f’s) 
a differentiable manifold of class C? (in particular almost everywhere of class C’). 
The question we address now is: what are the chances for 0K to be everywhere of 
class C' when we pick K at random in .™. A nice answer was provided by Klee in 
1959. He proved that, up to a negligible set in K, 0K is always C! everywhere, 
hence a nice C'-submanifold of R@ (of codimension one). 

In contrast to this the C? situation is dramatically different: Gruber in 1977 
proved that the set of K in .% whose 0K is almost everywhere C* is negligible. 
Worse: Zamfirescu in 1980 proved that, except in a negligible set, 0K has horrible 
curvature properties: curvature does not exist on a non-denumerable set and 
where it does exist it is equal to zero! The reason is that that is exactly how the 
curvature of a polytope behaves and it so happens that in .% polytopes are dense. 
See [20], [37]. 


8. Brief survey of other important topics in convexity 


For lack of space we cannot treat these in detail but only briefly mention what 
they are about and provide some references. 


A. Inequalities 


The most important one is Brunn-Minkowski’s mention in section 6. It yields 
the classical isoperimetric inequality in R*. For other proofs, other inequalities, 
especially the mixed volume inequalities, which are quite deep and hard to prove, 
see [11], [27]. 


B. Almost spherical sections, concentration phenomena, applications 


The isoperimetric inequality on spheres was used by Paul Lévy back in the ’30s 
to prove concentration properties for functions. It was used by Milman in 1971 to 
prove the inspiring theorem of Dvoretsky (1961) that any convex body admits 
almost ellipsoidal sections for suitable codimension. This was the starting point of 
a whole series of recent works in the same vein. Its importance lies, besides its 
strong geometrical appeal, in its application (through the so-called asymptotic 
methods) to Banach spaces of infinite dimension. For this see [28]. 
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C. Convexity in analysis 


Recently discoveries have been made of various functions which are log-con- 
cave, like vol!”¢ in the Brunn-Minkowski inequality. It seems to have started with 
the important Brascamp-Lieb inequality which solved, in particular, the long-stand- 
ing conjecture: the level lines of the first eigenfunction of the Dirichlet problem of 
a convex domain are convex hypersurfaces. In fact the complete heat kernel has a 
convexity property. See [3], [9]. 


D. Packing and covering 


These two notions are very important. They have been connected, since 
Minkowski, with number geometry, rational approximations. But also with error 
detecting codes. Interesting references are [13], [17]. 

They are also connected with notions of entropy and infinite-dimensional 
Banach geometry: see [28]. 


E. Caratheodory, Helly, Radon 
Three elementary (and very visualizable) statements are: 


the Caratheodory theorem: the convex hull of a set in R% is generated by the 
positive barycenters of its (d + 1)-uples 

the Helly theorem: a family of compact convex sets has a non-void 
intersection as soon as any d+ 1 of its sets has a 
non-void intersection 

the Radon theorem: let T be a subset of at least d + 2 points in R%. 
Then there always exists a disjoint decomposition 
T =T, UT, such that conv(T,) N conv(T,) = ©. 


None of these theorems are very hard to prove. Nor are they too elementary. The 
very interesting fact is that any one of them easily implies the two others, 
Moreover they have an enormous number of applications and are linked a great 
deal with combinatorics. See, for example [15], [18]. 


F. Convexity in other spaces 


Convexity can be defined in more general spaces than the Euclidean ones. In 
particular, it is widely used nowadays in Riemannian Geometry. See [4], [31]. 


G. Last but not least: The moment map 


This map was introduced very recently. The fact that it is convex in certain cases 
is very powerful and turns out to be a surprising link between mechanics, Kahler 
geometry, Lie group theory and eigenvalues of matrices, See [1], [25]. 
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To avoid misunderstanding I will not give a definition of geometry as in the 
customary mathematical treatment of a topic. I will only try to discuss its major 
historical developments. 


1. Geometry as a logical system; Euclid. Euclid’s “Elements of Geometry” (ca. 
300 B.c.) is one of the great achievements of the human mind. It makes geometry 
into a deductive science and the geometrical phenomena as the logical conclusions 
of a system of axioms and postulates. The content is not restricted to geometry as 
we now understand the term. Its main geometrical results are: 

a) Pythagoras’ Theorem. 


c* =a*+ b2 


b 
Fic. 1 
b) Angle-sum of a triangle. 
QS 
+ B+ y = 180° 
~ at+tBpty 
LO 
Fic. 2 


The result b) is derived using the fifth, or the last, postulate, which says: “And 
that, if a straight line falling on two straight lines make the angles, internal and on 
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the same side, less than two right angles, the two straight lines, being produced 
indefinitely, meet on the side on which are the angles less than two right angles.” 


a +B < 180° 


Fic. 3 


Euclid realized that the parallel postulate was not as transparent as his other 
axioms and postulates. Efforts were made to prove it as a consequence. Their 
failure led to the discovery of non-Euclidean geometry by C. F. Gauss, John 
Bolyai, and N. I. Lobachevski in the early 19th century. 

The “Elements” treated rectilinear figures and the circle. The last three of its 
thirteen Books were devoted to solid geometry. 


2. Coordinatization of space; Descartes. The introduction of coordinates by 
Descartes (1596-1650) was a revolution in geometry. In the plane it can be 
described by the following figure: 


4) a 


Fic. 4 


where the role of the two coordinates x, y is not symmetric. Descartes’ work was 
published in 1637 as an appendix, entitled “La géométrie”’, to his famous book on 
philosophy [6]. At about the same time Fermat (1601-1665) also found the concept 
of coordinates and used them to treat successfully geometric problems by algebraic 
methods. But Fermat’s work was published only posthumously [7]. 

One immediate ‘consequence was the study of curves defined by arbitrary 
equations 


F(x,y) = 0, (1) 


thus enlarging the scope of the figures. 

Fermat went on to introduce some of the fundamental concepts of the calculus, 
such as the tangent line and the maxima and minima. 

From two dimensions one goes to n dimensions, and to an infinite number of 
dimensions. In these spaces one studies loci defined by arbitrary systems of 
equations. Thus a great vista was opened, and geometry and algebra became 
inseparable. 
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A mystery is the role of differentiation. The analytic method is most effective 
when the functions involved are smooth. Hence I wish to quote a philosophical 
question posed by Clifford Taubes [15]: Do humans really take derivatives? Can 
they tell the difference? 

Coordinate geometry paved the way to applications to physics. An example was 
Newton’s derivation of the Kepler laws from his law of gravitation. Kepler’s first 
law says that the planetary orbits are ellipses with the sun as their common focus. 
The proof was possible only after an analytic theory of conics had been estab- 
lished. 


3. Space based on the group concept; Klein’s Erlanger Programm. Works on 
geometry led to the development of projective geometry, among whose founders 
were: J. V. Poncelet (1788-1867), A. F. Md6bius (1790-1868), M. Chasles 
(1793-1880), and J. Steiner (1796-1863). Projective geometry studies the geometri- 
cal properties arising from the linear subspaces of a space and the transformations 
generated by projections and sections. Other geometries resulted, the most notable 
ones being affine geometry and conformal geometry. 

In 1872 Felix Klein formulated his Erlanger Programm [1], [11], which defines 
geometry as the study of the properties of a space that are invariant under a group 
of transformations. Thus there is a geometry corresponding to every group of 
transformations acting on a space. The basic notion is “group” and the notion of a 
space is now greatly expanded. In a certain sense the group of projective 
collineations is the most encompassing group and projective geometry occupies a 
dominant position. 

The most important application of the Erlanger Programm was the treatment of 
non-Euclidean geometry by the so-called Cayley-Klein projective metric [12]. The 
hyperbolic space can be identified with the interior of a hypersphere and the 
non-euclidean motions with the group of projective collineations leaving invariant 
the hypersphere. The same group may appear as a group of transformations in 
different spaces. As a result the same algebraic argument could give entirely 
different geometric theorems. For example, everybody knows that the three 
medians of a triangle meet in a point. By using Study’s dual numbers this translates 
into the following theorem of J. Petersen and F. Morley: Let ABCDEF be a skew 
hexagon such that consecutive sides are perpendicular. The three common perpen- 
diculars of the pairs of opposite sides AB, DE; BC, EF; CD, FA have a common 
perpendicular. See [13]. 

Sophus Lie founded a theory of general transformation groups, which became a 
fundamental tool of all geometry. 


4. Localization of geometry; Gauss and Riemann. In his monograph on surface 
theory published in 1827 [8], Gauss (1777-1855) developed the geometry on a 
surface based on its fundamental form. This was generalized by B. Riemann 
(1826-1866) to nm dimensions in his Habilitationschrift in 1854 [14]. Riemannian 
geometry is the geometry based on the quadratic differential form 


ds? = )'g,,(u) dui du*, Sin = Shir 1<i,k<n (2) 


in the space of the coordinates u',..., 4”, where the form is positive definite, or at 
least non-degenerate. Given ds*, one can define the arc length of a curve, the 
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angle between two intersecting curves, the volume of a domain, and other geomet- 
rical concepts. 

The main characteristic of this geometry is that it is local: it is valid in a 
neighborhood of the u-space. Because of this feature it fits well with field theory in 
physics. Einstein’s general theory of relativity interprets the physical universe as a 
four-dimensional Lorentzian space (with a ds? of signature + + + — ) satisfying 
the field equations 


1 
Rix — 758i = 8rxT;,, (3) 


where R,, is the Ricci curvature tensor, R is the scalar curvature, « is a constant, 
and T;, is the energy-stress tensor. 

It is soon observed that most properties of Riemannian geometry derive from its 
Levi-Civita parallelism, an infinitesimal transport of the tangent spaces. In other 
words, Riemannian geometry studies the tangent bundle of a Riemannian space 
with the Levi-Civita connection. 


5. Globalization; topology. Riemannian geometry and its generalizations in 
differential geometry are local in character. It seems a mystery to me that we do 
need a whole space to piece the neighborhoods togetHer. This is achieved by 
topology. The notion of a differentiable manifold is one of the most sophisticated 
concepts in mathematics. The idea was clear to Riemann. The first mathematical 
formulation of a topological manifold was made by D. Hilbert in 1902 [10], [17]. 
Hermann Weyl identified the Riemann surfaces with one-dimensional complex 
manifolds and used it as the central theme of his epoch-making book Die Idee der 
Riemannschen Fldche (16]. On the topological side “neighborhood” became the 
basic concept in Hausdorff’s topology [9]. 

Hassler Whitney saw the merit of establishing an imbedding theorem on 
differentiable manifolds (1936), thus beginning the serious study of differential 
topology. That derivatives play a role in topology came as a shock when J. Milnor 
discovered the exotic differentiable structures on the seven-dimensional sphere 
(1956). By studying the Yang-Mills equations on a four-dimensional manifold, 
S. Donaldson found in 1983 a remarkable theorem on the intersection-form, which 
led to the existence of an infinite number of differentiable structures on R*. 

With the foundation of differentiable manifolds laid, geometrical structures can 
now be defined on them, such as the Riemannian structure, the complex structure, 
the conformal structure, the projective structure based on a system of paths, etc. 
Tools are developed for their treatment, of which the most important are the 
exterior differential calculus and the tensor analysis. 

A fundamental notion is “curvature,” in its different forms. Its simplest mani- 
festation is the circle in plane Euclidean geometry. It could also be the force of a 
physical system or the strength of a gravitational or electro-magnetic field. In 
mathematical terms it measures the non-commutativity of covariant differentiation. 

It is remarkable that suitable algebraic combinations of curvature give topologi- 
cal invariants. To illustrate this we wish to state the Gauss-Bonnet theorem. Let D 
be a domain with a sectionally smooth boundary on a two-dimensional Riemannian 
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manifold. Then the Gauss-Bonnet theorem is the formula 


Yi (7 - a) +f keds + f [Kad = 27yx(D), (4) 


where the first term is the sum of the exterior angles at the corners, the second 
term is the integral of the geodesic curvature along the sides, the third term is the 
integral of the Gaussian curvature over D, and y(D) is the Euler characteristic of 
D. For a rectilinear triangle in the Euclidean plane this is the theorem on the 
angle-sum stated in §1. For higher dimensions we will only give, for the sake of 
simplicity, the theorem for a compact oriented Riemannian manifold M of 
dimension 2 without boundary. Let R;,,, be the Riemann-Christoffel tensor and 
let 


0;; = ye Rite du® A du! (5) 
be the “‘curvature form’’. Let 
Pf = Le, ve inf di, i, Ast" A Oi tion (6) 


be the pfaffian, where e; ..;, is +1 or —1 according as its indices form an even or 
odd permutation of 1,...,2n, and is otherwise zero, and the sum is extended over 
all indices from 1 to 2m. Then the Gauss-Bonnet theorem says 


] 
(—1)" Saag J PP = x), (7) 


aw”’n! 


where 7(M) is the Euler-Poincaré characteristic of M. 


6. Connections in a fiber bundle; Elie Cartan. A notion which includes both 
Klein’s homogeneous spaces and Riemann’s local geometry is Cartan’s generalized 
spaces (espaces généralisés). In modern terms it is called “a connection in a fiber 
bundle.” It is a straightforward generalization of the Levi-Civita parallelism, which 
is a connection in the tangent bundle of a Riemannian manifold. In general, we 
have a fiber bundle 7: E — M, whose fibers 7~'(x), x € M, are homogeneous 
spaces acted on by a Lie group G. A connection is an infinitesimal transport of the 
fibers compatible with the group action by G. 

I wish to illustrate this more precisely in the case of a complex vector bundle, 
where the fibers are complex vector spaces C, of dimension g and G = GL (q;C) 
[4]. The importance of complex numbers in geometry is a mystery to me. It is well 
organized and complete. One manifestation is the simple behaviour of the group 
GL(q;C): its maximal compact subgroup U(q) has no torsion and has as Weyl 
group the group of all permutations on q letters. 

We shall call a frame an ordered set of linearly independent vectors e,,...,e, € 
a '(x), x € M. In a neighborhood U where a frame field e,(x),..., ex), x € U, 
is defined, a connection is given by the infinitesimal displacement 


De, = diwkeg, 1<a,B <a, (8) 
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where w® are linear differential forms in U. We call w® the connection forms and 
the matrix 


w = (w®) (9) 
the connection matrix. Under a change of the frame field 
e, = Yiabe,, A =(a8), (10) 
the connection matrix is changed as follows: 
w'A =dA+ Aw. (11) 
We introduce the curvature matrix 
Q=dw-wra, (12) 
which is a matrix of exterior two-forms. By exterior differentiation of (11) we get 
0’ = ADAM. (13) 


It follows that the exterior polynomial 
i 
det{ 1 + 50 =l1+ c,( 0) + ::: +¢,(Q), (14) 


in which c,(Q) is a 2a-form, is independent of the choice of the frame field and is 
hence globally defined in M. Moreover, each c, is closed, i.e., 


dc, = 0. (15) 


The form c,(Q) has been called the ath Chern form of the connection and its 
cohomology class {c,(Q)} in the sense of de Rham cohomology is an element of 
the cohomology group H2°(M; Z) and is called the ath Chern class of the bundle 
E. These characteristic classes are the simplest and most fundamental global 
invariants of a complex vector bundle. They have the advantage of possessing a 
local representation, by curvature. 

As in the Gauss-Bonnet formula such a representation is of great importance, 
because the forms c,(Q) themselves have a geometrical significance. Moreover, let 
qm’: P —M be the bundle of frames of the complex vector bundle. Then the 
pull-back 7’°c, becomes a derived form, i.e., 


w'*c, = dTc,; (16) 


where Tc,, a form of degree 2a — 1 in P, is uniquely determined by certain 
properties. This operation is called transgression and Tc, have been called the 
Chern-Simons forms [5]. These forms have played a role in three-dimensional 
topology and in recent works of E. Witten on quantum field theory [20]. 

This theory can be developed for any fiber bundle; see [3]. The above provides 
the geometrical basis of gauge field theory in physics. Here M is a four-dimen- 
sional Lorentzian manifold, so that the Hodge *-operator is defined, and we 
define the codifferential 


6 = *d¥* (17) 
There is a discrepancy of terminology and notation, as given by the following table: 
mathematics physics 
connection w gauge potential A 


curvature 20, strength F 
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Maxwell’s theory is based on a U (1)-bundle over M, and his field equations can be 
written 


dA=F, 6F=J, (18) 
where J is the current vector. Actually, Maxwell wrote the first equation as 
dF = 0, (19) 


which is a consequence. For most applications (19) is sufficient. But a critical study 
of an experiment proposed by Boehm and Aharanov and performed by Chambers 
shows that (18) are the correct equations [21]. A generalization of (18) to an SU(2) 
bundle over M gives the Yang-Mills equations 


DA=F, 85F=J. (20) 


It is indeed remarkable that developments in geometry have been consistently 
parallel to those in physics. 


7. An application to biology. So far the most far-reaching applications of 
geometry are to physics, from which it is indeed inseparable. I wish to mention an 
application to biology, namely, to the structures of DNA molecules. This is known 
to be a “double helix”, which geometrically means a pair of closed curves. Their 
geometrical invariants will clearly be of significance in biology. The following three 
are most important: 1) The linking number introduced by Gauss; 2) the total twist, 
which is essentially the integral of the torsion; 3) the writhing number. 

James White proved that between these invariants there is the relation [18] 


Lk = Tw + Wr. (21) 
This formula is of fundamental importance in molecular biology. 


8. Conclusion. Contemporary geometry is thus a far cry from Euclid. To 
summarize, I would like to consider the following as the major developments in the 
history of geometry: 


1) Axioms (Euclid); 

2) Coordinates (Descartes, Fermat); 

3) Calculus (Newton, Leibniz); 

4) Groups (Klein, Lie); 

5) Manifolds (Riemann); 

6) Fiber bundles (Elie Cartan, Whitney). 


A property is geometric, if it does not deal directly with numbers or if it 
happens on a manifold, where the coordinates themselves have no meaning. Going 
to several variables, algebra and analysis have a tendency to be involved with 
geometry. 

This story is clearly one-sided and incomplete, representing only my personal 
viewpoint, and my limitations. It is clear that the story will not end here. Recent 
developments in theoretical physics, such as geometric quantum field theory, string 
theory, etc, are pushing for a much more general definition of geometry [19]. 

It is satisfying to note that so far almost all the sophisticated notions introduced 
in geometry have been found useful. 

Finally, I wish to call attention to an early paper of mine [2], which could be 
read as a companion to this one. 
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Computational geometry is concerned with finding efficient algorithms, or 
computational procedures, for solving a wide spectrum of geometric problems. 
Such problems can arise from computer graphics, robotics and motion planning, 
computer-aided design and manufacturing, or one of the many other applied areas 
where geometry comes into play. In spite of the practical flavor of these problems, 
results from classical and modern geometry quite often can play key roles in the 
design of efficient methods for solving them. Indeed, Euclidean geometry could be 
thought of as perhaps the earliest systematic study of algorithms: the ruler-and- 
compass constructions in Euclidean geometry are just algorithms based on a well 
defined set of elementary operations. In this venerable subject also occur some of 
the first provably unsolvable problems, such as squaring the circle, trisecting the 
angle and doubling the cube, much in the same spirit as the more recent negative 
solution of Hilbert’s tenth problem (implying the nonexistence of an algorithm for 
solving general diophantine equations) [9] and the celebrated (but still unresolved) 
P vs. NP question [13]. 

We begin our discussion of computational geometry with several basic concepts. 


Voronoi Diagrams 


Consider the following problem, known as the post office problem: We are given 
a set S of n points in the plane (considered as post offices or sites). When an 
arbitrary new point (x, y) (say, a residence) is given, we must find out which post 
office is closest to (x, y). 


687 
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In the example above, one expects many queries, with different values of (x, y), 
to be asked with respect to the same set of post offices. Thus it pays to 
“preprocess” the set §, that is, to have some useful computation about S done in 
advance, so that when a query arrives, the answer can be determined quickly. In 
this particular case, precomputing a map of “postal regions” would seem to be a 
good idea. For each post office p, the locus of points (x, y) that are closer to p 
than to any other post office in § is a convex region V(p), called the Voronoi region 
or Voronoi polygon associated with p. The n polygons V(p) form a partition of the 
plane, called the Voronoi diagram of S, denoted by Vor(S). (Figure 1(b).) 


(a) (b) (c) 
A point set S Vor(S) DT(S) 


The earliest significant use of Voronoi regions seems to have occurred in the 
work of Gauss, Dirichlet and Voronoi [29] in their investigations on the reducibility 
of positive definite quadratic forms (see [21], [17] for a summary). However, the 
same concept has evolved independently in a variety of other scientific disciplines. 
For example, in crystallography, one simple model of crystal growth starts with a 
fixed collection of sites in 2- or 3-space, and allows crystals to begin growing from 
each site, spreading out at a uniform rate in all directions, until all space is filled 
(see [14]). The “crystals” (called “Wirkungsbereiche” or “domains of action”) then 
consist of all points nearest to a particular site, and consequently are just the 
Voronoi regions for the original set of points (FIGURE 2). 

Similarly, the statistical analysis of meteorological data led to the formulation 
(around 1910) of Voronoi regions under the name “Thiessen polygons.” The 
reader is referred to [17] for a survey of these applications, and to [2] for a more 
complete discussion of the uses of Voronoi regions in statistics, geography, 
interpolation, physical chemistry, and many other fields. 

The Voronoi diagram enjoys many interesting properties. First of all, the 
straight-line dual of the Voronoi diagram is a triangulation of S, called the 
Delaunay triangulation and denoted by DT(S) (Figure 1(c).) It is easy to see that: 

P1. If p is a nearest neighbor of q, then polygons V(p) and V(q) are adjacent 
in the Voronoi diagram. This means that p and gq are connected by an edge in 
DT(S). 

P2. Points p,q,r form a triangle in DT(S) if and only if the circle through p, 
q, and r has no point of S in its interior. 
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(a) (b) (c) 


(d) (e) 


Fic. 2. Model of 2-dimensional crystal growth. 


It follows from property P1 that, to find a closest pair of points occurring in S,” 
we need only consider the O(n) pairs which are joined by edges in the Delaunay 
triangulation for S. 

Some further useful properties of these diagrams can be established without 
much difficulty. For example, the Delaunay triangulation contains all minimum 
spanning trees of the point set. A minimum spanning tree (MST) is a tree (i.e., a 
connected graph with no cycles) that connects all points of V, such that the sum of 
the edge lengths in the tree is as small as possible. The MST of a point set S may 
not be unique, but any such tree must use only edges present in D7T(S). We state 
this as: 

P3. MST(S) c DT(S). 


Proof. Let e = (p,q) € MST(S). If e € DT(S), then by property P2, the circle 
with (p,q) as diameter must contain some point of S, say r, in its interior. When 
we remove the edge e from MST(S), the set S breaks up into two connected 
components. But the two components can be rejoined by using one of the edges 
(p,r) or (q, r); since either edge is shorter than the original edge (p, q), we have a 
contradiction. m 


Property P3 implies that we can find a MST for a set of points by first 
constructing its Delaunay triangulation DT(S) and then, since the triangulation is 
a planar graph, apply a particularly efficient (e.g., linear-time) MST algorithm for 
planar graphs to DT(S). 

For a set S of n points in the plane, there may be exponentially many different 
triangulations. Depending on the application, one may be interested in triangula- 
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tions that have certain special characteristics. Not surprisingly, the Delaunay 
triangulation DT(S) satisfies a number of extremal properties, one of which is that 
it maximizes the minimum angle among all triangulations of S. In other words, 
DT(S) does the best possible job in avoiding triangles with small angles (see [11] 
for a proof). 


Construction of Voronoi Diagrams 


How difficult is it to construct the Voronoi diagram Vor(S) from S$? A 
straightforward approach is to compute the polygons V(p) one by one. Write 
S = {p,,...,p,}. To find V(p,), say, we can compute, for successively larger values 
of j, the polygon U; which is the locus of all points that are closer to p, than to any 
point in {p,,..., Dj. Once U; is known, we can obtain U;,, by intersecting the 
convex polygon U; with the perpendicular bisector between p, and p,,,. This step 
can be accomplished by a fast binary (or logarithmic) search, because the vertices 
of a convex polygon admit a nice ordering (first increasing and then decreasing in 
their x-coordinates), and a line can intersect this polygon in at most two places. 
Thus, the polygon V(p,) = U, can be computed in O(n log n) steps, and therefore 
the whole diagram Vor(S) in a total of O(n? log n) steps. 

To compute Vor(S) more efficiently, we must try to construct several polygons 
V(p;) simultaneously. Here we can invoke a general principle, called the sweep 
technique, which is often useful in the design of geometric algorithms. Imagine that 
a horizontal line is being swept up the plane from y = —% to y= +o, Any 
cross-section of the two-dimensional diagram Vor(S) as defined by the sweep-line 
is a simple one-dimensional configuration. Furthermore, the configuration under- 
goes essential (i.e., topological) changes rather infrequently—only when the 
sweep-line first enters some new region V(p;) , or has just left such a region; in 
either case, the change is local. The difficult part, of course, is in knowing when a 
change should occur, since the sweep-line will enter a region V(p;) before it 
actually encounters the point p,. It turns out that this difficulty can be overcome by 
a clever idea due to Fortune [12]. One first performs a non-linear transformation T 
of the plane, with 7 depending on the set S. Under this transformation, the new 
diagram 7(Vor(S)) has the property that the lowest point of the Voronoi region 
T(V(p;)) is always at the site T(p,) itself (Figure 3). Thus, we will create a 
Voronoi region only when the sweep-line reaches a new site, and delete that 
region when its topmost point is reached. Finally, the real Voronoi diagram can be 
easily reconstructed from its transformed image. The total running time of this 
algorithm is O(n log n), since each insertion/deletion operation can be done in 
O(log n) steps with a standard one-dimensional search structure. 


(a) (b) 


Fic. 3. Fortune’s transformation. 
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The transformation used by Fortune maps a point r in the region V(p) to 
r+ (0, y) where y = d(r, p), the distance from r to p; thus, lines are transformed 
into hyperbolic arcs. This transformation can be intuitively understood as follows. 
Think of the ordinary Voronoi diagram as being generated by a physical process in 
which an obstacle is placed at each site, and an expandable disk is moved around 
so that at any instant, the disk has the maximum possible radius permitted by the 
obstacles. The locus of the center of the disk, then, defines the Voronoi diagram. 
If, instead of tracing the center, we trace out the locus of the topmost point of the 
disk at all times, the result would be precisely the transformed Voronoi diagram 
defined above. 


Convex Hulls 


It is known that the Voronoi diagram for a finite convex planar set, that is, a set 
of n points forming the vertices of a convex polygon, can be computed in only 
O(n) time [1]. Indeed, convexity is a strong geometric condition that often leads to 
particularly efficient solutions to computational problems. Consequently, a com- 
mon strategy used in practice for solving problems on a general point set may 
involve either first decomposing the set into convex pieces, or approximating the 
set with its convex hull, i.c., the smallest convex set containing the given set. This 
makes the construction of the convex hull of a set a basic problem in computa- 
tional geometry. 

The first O(n log n) convex hull algorithm in two dimensions is due to Graham 
[15]. The algorithm first sorts the points by polar angle about an interior point, and 
then scans the sorted list to discard any point that would cause a reflex angle to 
occur (FiGuRE 4(a)). Note that one may occasionally have to back up a long way 
and discard many points in order to restore convexity. However, as can be easily 
argued, the overall cost of the scanning step is still only linear in the total number 
of points. The convex hull can also be constructed by a divide-and-conquer 
algorithm ([25]). In this approach, we partition the points into two equal subsets P, 
and P, by a vertical line, find their convex hulls recursively, and merge them to 
form the desired convex hull. The merging step is accomplished by finding the 
upper bridge b, and the lower bridge b, between P, and Pp (FiGurE 4(b)). To find 
the upper bridge, we take the line segment / connecting the rightmost point p of 
P,, and the leftmost point g of Pp, and move its endpoints upward iteratively along 
the boundaries of.P, and Pr, until / reaches the highest possible position, which 
now defines b,. The lower bridge b, is found analogously. The time T(1) required 
satisfies T(n) < 2T(n/2) + O(n) which implies T(n) = O(n log n). If the input 
points are sorted, or more generally, form the (ordered) vertices of a simple 
polygon (i.e., a polygon without self-intersections), the convex hull can in fact be 
found in linear time (Graham and Yao [16], Lee [20]). 

How fast can we compute the convex hull of a set in three dimensions? The 
divide-and-conquer algorithm described above, as it turns out, can be made to 
work in three dimensions within the same time bound O(n log n). In the crucial 
merging step of the algorithm, the hull of the union of two non-intersecting 
polyhedra can be formed by “gift wrapping” the two objects in one package, so to 
speak. The merging cost is proportional to the total size of the incidence graphs of 
the two polyhedra (i.e., the graphs induced by the vertices and edges of the 
polyhedra), and is O(n) since these incidence graphs are planar [24]. 

For dimensions d > 4, the situation becomes more complex since a d-polytope 
with n vertices can have as many as Cn!4/?! faces ((17]). There are two types of 
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Vertex p, is eliminated 
because Z p; Py Pz; is reflex; 
then p, is eliminated because 
ZDP1P3D, is reflex. 


(a) 


Graham’s scan 


—_— | 
—_—_ 
—_—— 


(b) 


Divide and conquer 


Fic. 4 


convex hull algorithms, depending on whether the algorithm only enumerates the 
facets, i.c., the (d — 1)-faces, of the convex hull or produces the complete facial 
lattice, i.e., a description of all faces and incidence relationships of the convex 
polytope. Studies of these convex hull algorithms can be found in Chand and 
Kapur [6] and Seidel [26]. We note that, through duality, the problem of enumerat- 
ing the facets of the convex hull of a point set is equivalent to the problem of 
enumerating the vertices of a polytope defined by a set of linear inequalities; in the 
latter form this problem has also been widely explored in the optimization 
literature (See Dyer [10] for a survey). 
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Connection between Voronoi Diagrams and Convex Hulls 


Brown [5] was the first to observe that, through an appropriate transform the 
Voronoi diagram of a set S in R*% corresponds to the convex hull of 
the transformed set in R“*'. We illustrate this for the simple case d = 2. Consider 
the mapping 

a: (x,y) (x,y, x* + y7) 

which “lifts” a point (x, y) in the plane onto the paraboloid z = x” + y”. Note that 
cocircular points in the plane are mapped by a into coplanar points in space 
(FIGURE 5(a)). Given n sites S in the plane, their image points under a form the 
vertices of a convex polyhedron P (since the paraboloid is convex). This means, for 
any “downward-looking” face f of the polyhedron P, its defining plane must lie 
“below” all other vertices of P. Therefore, by our earlier observation, the projec- 
tion of f corresponds to a circle with no points of S in its interior. This implies 
that the projection of all the downward-looking faces of the polyhedron P gives 
precisely the faces of the Delaunay triangulation of the planar set S (FIGURE 5(b)). 
This relationship readily generalizes to higher dimensions, and enables one to use 
convex hull algorithms in d+ 1 dimensions to obtain Voronoi diagrams in d 
dimensions! 


i _ 
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(a) (b) 


Fic. 5. Voronoi diagrams and convex hulls. 


Combinatorial Geometry 


The analysis of geometric algorithms often requires detailed combinatorial 
knowledge of the geometric structures involved. The book by Edelsbrunner [11] 
discusses many topics which overlap both computational and combinatorial geome- 
try. We give one such example here. 

Many algorithms in computational geometry involve calculation of the lower 
envelope (i.e., pointwise minimum) of a collection of functions. For example, 
suppose we are to render the 2-dimensional image of a 3-dimensional terrain, 
which is a polyhedral surface with the property that any line parallel to the z-axis 
intersects the surface at most once. From any viewpoint v, the “upper rim” (also 
called the silhouette) of the terrain as seen from v can be expressed as the lower 
envelope of a set of line segments in two dimensions (FIGuRE 6). 
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Fic. 6. The lower envelope of a set of line segments. 


Lower envelopes are closely related to Davenport-Schinzel sequences (or DS 
sequences, for short). Specifically, an (, s)-Davenport-Schinzel sequence is a 
sequence composed of n symbols satisfying: (i) no two adjacent elements are equal, 
and (ii) the sequence does not contain as a (not necessarily consecutive) subse- 
quence of length s + 2 any alternation of two distinct symbols. The parameter s is 
called the order of the DS sequence. In the above example, the lower envelope of 
a set of n segments {/,,...,/,} in the plane is a DS sequence of order 3, since no 
alternation of the form ---/; °°: J, +++ d; ++: 1, +++ 2; +++ can occur in the 
lower envelope. It has recently been shown that the maximum length A,(1) of an 
(n,3)-DS sequence is A,(n) = @(na(n)), where a(n) is the extremely slowly 
growing functional inverse of the Ackermann function ((19]). In fact, this maximum 
value can be realized by a rather intricate arrangement of n line segments [30]. 
Thus the silhouette of a general polyhedral surface has complexity @(na(n)), and 
can be computed in O(na(n) log n) time [8]. Other geometric problems where 
Davenport-Schinzel sequences naturally arise include the calculation of Euclidean 
shortest paths in 3-space amidst polyhedral obstacles, hidden-surface removal from 
a varying viewpoint, and the computation of other time-varying geometric configu- 
rations ((4], [27]). 


Space Partitions and Range Search 


The post office problem mentioned earlier belongs to a class of problems 
studied in computational geometry known as “range search problems”. In the most 
general setting, the database $ to be searched may contain higher order objects 
than points, and the queries can be of a more complex nature. Still, let us confine 
our discussion here to the case when S is a set of points in R®. An important type 
of query is the “half-space” query, in which an arbitrary linear inequality is 
specified, and all the points in S that satisfy the inequality should be identified 
quickly. As an example of a half-space query, consider a database where the first 
two coordinates contain, respectively, the income and the number of children of a 
household, and a query asks for all households with monthly income exceeding 
$1000 plus $200 for each child. 

The first nontrivial solution to this problem was proposed by Willard [31]. It 
makes use of the following well-known fact: a density function over a bounded 
region can be partitioned by two straight lines /,,/, into four quadrants with equal 
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mass. Translating this result into the discrete case, it says that a set S of n points 
in the plane can be partitioned by two lines into four (open) quadrants such that at 
most n/4 points are contained in the interior of each quadrant. One can then 
further partition the points in each quadrant, and repeat this process until all 
points are separated. This gives a tree structure for representing the set S (see 
FiGurE 7). For a query half-plane Q, since the line defining Q can intersect at 
most three of the four quadrants in each partition, one of the quadrants is either 
entirely inside or outside of Q, and hence can be exempted from further search. 
The search time 7(n) thus satisfies a recurrence relation T(n) = 3T(n/4) + O(), 
which yields T(n) = O(n*) with a = log, 3 = 0.774. 

Can we extend such 4-partitions in R* to 8-partitions in R*? The answer turns 
out to be yes. Given a set S of n points in R°, one can always find three planes 


Is 


Le 


ST 


Lio 


(a) 
A 4-partition for 14 points 


(b) 


The corresponding tree structure 


Fic. 7 
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that form an 8-partition in the sense that at most n /8 points of S lie in each of the 
eight open regions. Thus, we can represent a point set in R° by recursive 
8-partitions. Since an arbitrary plane in R° can intersect at most seven of the eight 
open regions defined by an 8-partition, this leads to a retrieval time of O(n%) with 
a = log,7 = 0.936 for half-space queries. 

The 8-partition proposition stated above is considerably more difficult to prove 
than the two-dimensional analogue [34]. However, we note that a weaker version 
of it actually suffices to guarantee the same query time O(n!°88’). In this version, 
the partition consists of a bisecting plane H for S, together with two separate 
4-partitions for the sets S*,S~ above and below H, with the condition that the 
half-axes of the two partitions are lined up exactly (FiGuRE 8). 


A 


Fic. 8. An 8-partition with axis A and center C. 


To show that such a partition exists, we invoke the well known Borsuk-Ulam 
theorem. 


THEOREM (Borsuk-Ulam, [23]. Let f: S¢ +» R® be a continuous, antipodal map, 
i.e., f(—v) = —f(v) ‘for v € S4%, the unit d-sphere. Then there is a point v € S® such 


that f(v) = 0. 


To obtain the desired partition, consider p,(S*), the projection of S* onto H 
along direction v, where v is any vector in the southern hemisphere of S*. We find 
a (unique) 4-partition (L,, L,) for the planar set p,(S*) with the constraint that 
L, is parallel to the x-axis. Call the intersection point of L, and L, the center of 
p,(S*), denoted by C(v). Similarly define the center C’(v) for p_,(S~), which is 
the projection of S~ along the direction —v. Applying the Borsuk-Ulam theorem 
for d = 2, it follows that the function f(v) = C(v) — C’(v) must vanish for some 
vector vy. This gives us what we want: separate 4-partitions for S* and S~ which 
share a common axis A, where A has direction v, and goes through the point 
C(v,) = Cv). See [32] for complete details. 
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It is easy to see that the resulting 8-partition does have the property that any 
query plane can intersect at most 7 out of the 8 regions. Furthermore, this version 
can be generalized to higher dimensions by an inductive argument, thus allowing 
us to solve half-space queries in any number of dimensions. 

It is an interesting open question whether the (strong) 8-partition proposition 
can be generalized to four dimensions; that is, given a density function over a 
bounded region in four dimensions, whether it can always be partitioned by 4 
hyperplanes into 16 orthants with equal mass. However, it is not hard to show that 
(strong) 2%-partitions are not always possible in dimensions d > 5. For example, 
take thirteen small balls of equal mass in R°, and let the balls be in general 
position, so that no hyperplane can intersect more than five of them. Then, in any 
2°-partition, each ball must be cut by at least two hyperplanes, since otherwise 
some orthant would contain at least half of a ball, and thus at least 1/26 of the 
total mass, which is more than the required fraction 1/32. Therefore, for all 
thirteen balls, at least 26 instances of hyperplane-ball intersections are needed. 
Since the balls are in general position, five hyperplanes can provide at most 25 
such instances and we have a contradiction. 


Lower Bounds 


An important, and often difficult, challenge in algorithm design is to know 
whether an algorithm can still be improved upon, or in fact one has already found 
the fastest algorithm possible. To resolve such questions,-we must establish lower 
bounds to the computational complexity of a problem in some fairly general model. 

For geometric problems, a natural model for studying computational complexity 
is the algebraic computation tree model. In this model, an algorithm can in one step 
either perform an arithmetic operation, or make a comparison and then branch 
according to the result of the comparison. (Thus, each comparison in fact corrre- 
sponds to a polynomial test of arbitrary degree evaluated at the input.) Figure 9 
gives an example of an algebraic computation tree for testing whether a point 
(x, y) lies in the region 


W = {(x,y)lyy > 1, -2.5<x+y < 3}. 


Let us consider the problem of finding a closest pair among n points in the plane. 
As we saw before, this problem can be solved in O(n log n) time by first construct- 
ing the Voronoi diagram and then performing a search in linear time. It turns out 
that (n log n) is also a lower bound for this problem in the algebraic computa- 
tion tree model. In fact, the simpler problem of deciding whether the input set 
contains a repeated point, i.e., whether p; = p; for some i #j, has complexity 
Q(n log n) even in one dimension! This result is due to Ben-Or [3], and the main 
idea of the proof is the following. The upper bound follows simply by sorting the n 
points and testing adjacent pairs for equality. The lower bound is obtained by 
mapping the polynomial inequalities to a suitable algebraic variety and then 
associating the number of computational steps with the Betti numbers of the 
algebraic variety. Let x = (x,, X,,...,x,) © R” denote the input, and let T be an 
algebraic computation tree that decides whether x € W where W is defined by 


W= (Cts tones) [1 (x, -~x,;) # 0} CR". 


L+¥]J 
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Fic. 9. An algebraic computation tree for testing “Is (x, y) € W.” 


It is easy to see that W has n! connected components. At the same time, by 
invoking a theorem of Milnor-Thom ((22], [28]), one can show that the algebraic 
variety V associated with a leaf of T has at most 3”*” components, if the path to 
that leaf involves h operations. For a leaf associated with a “yes” answer, each 
connected component of V must be completely contained in some connected 
component of W. We thus conclude that the maximum path length h has to satisfy 
3°3"*" > n!, which implies h = O(n log n). 


Common Techniques 


As we have seen, computational geometry uses concepts and results from 
classical geometry, topology, combinatorics, as well as standard algorithmic tech- 
niques such as sorting and searching, graph manipulations, and linear program- 
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ming. In addition, certain special techniques and paradigms have emerged and 
found repeated application in computational geometry. We briefly summarize 
some of the most common ones below. Much more comprehensive surveys can be 
found in [11], [25] and [33]. 


1. Divide and conquer. The basic idea is to divide the problem into two or 
more subproblems, solve them recursively and then merge the results to obtain a 
solution to the original problem. Although this technique is common in algorithm 
design, it seems particularly well suited for solving geometric problems: the 
dividing step can naturally be accomplished by splitting along a line (or a hyper- 
plane in d dimensions), and the merging step can take advantage of reduced 
problem complexity across the dividing hyperplane. The convex hull algorithms in 
two and three dimensions are good examples. 


2. The sweep technique. This is a technique designed to reduce the dimension 
of a (static) geometric problem at the cost of changing its mode from static to 
dynamic. For example, in the construction of Voronoi diagrams, the sweep-line 
induces one-dimensional cross-sections which change dynamically (but only locally) 
when the sweep-line moves from one (discrete) position to the next. The sweep 
technique is also quite useful in higher dimensions. 


3. Geometric transformations. Through a geometric transformation, one can 
often state a given problem in an alternative form which, though equivalent, may 
shed new light on the problem. One of the transforms used most often in 
computational geometry is the dual transform between points and hyperplanes. 
Other examples include the hyperbolic transform of Fortune for Voronoi diagrams 
and the correspondence between Voronoi diagrams and convex hulls (both dis- 
cussed earlier), the use of Pliicker coordinates to represent a line in three 
dimensions as a point in five dimensions, and the skewed projection determined by 
a pair of lines in three dimensions. 


4. The locus approach. This approach applies to geometric query problems and 
it works as follows. We interpret the query as a point in some space (through an 
initial transformation, if necessary), and then partition that space into a set of 
non-overlapping regions so that the answer is invariant for all query points that fall 
in the same region. Several ingredients are involved in this approach: transforma- 
tion of the query into a point, definition and construction of a suitable space 
partition, and an algorithm for point-location. Using the Voronoi diagram to solve 
the nearest-neighbor query is a basic example of this approach. 


5. Random sampling. The idea is to use random sampling of the input objects 
to carry out divide-and-conquer efficiently, that is, to split up the original problem 
into subproblems each guaranteed to be of small size [7]. The expected perfor- 
mance of the resulting algorithm is with respect to the built-in randomization 
process, and does not depend on any assumption about the input distribution. This 
technique often yields simple algorithms with good expected performance. 


Conclusion 


We have tried here to give a brief sketch of some of the main ideas underlying 
the dynamically growing field of computational geometry. It is the natural conver- 
gence of ideas from many areas of mathematics such as topology, combinatorics, 
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algebra, probability and, of course, geometry, with those from computer science, 
such as graph algorithms, data structures, and optimization. We feel confident that 
the current trend of studying computational questions on geometric objects will 
continue to suggest new classes of problems which in turn will enrich and become 
a permanent part of the glorious tradition of geometry. 
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In this article we will focus on the subject of embedded minimal surfaces. We 
will try to illustrate how some of the modern-day work in the subject—both in 
construction of examples and in development of a global theory—is dependent 
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upon understanding the catenoid. As we will see in the next section, the catenoid 
is the unique nonplanar minimal surface of revolution, and so is really the simplest 
complete minimal surface, after the plane. Nonetheless, we will try to show that it 
is the most important example. 

A surface S in R° is said to be minimal if each point has a neighborhood @c S 
that is the surface of least area for its boundary 0@. Such a surface may have 
infinite area, the flat plane for example. Alternatively, a minimal surface is one for 
which every compact subdomain @ is a critical point for area among surfaces with 
boundary equal to 0@. From the point of view of local geometry, this is equivalent 
to the requirement that the mean curvature H is identically zero. The mean 
curvature is the average of the principal curvatures, so H = 0 means that at each 
point the average normal curvature is zero. From a physical perspective, loosely 
speaking, one imagines the surface as made up of very many rubber bands, 
stretched in all directions; on a minimal surface the forces due to the rubber bands 
balance out, and the surface does not need to move to reduce tension. This is not 
to say that a minimal surface is necessarily stable’. In fact this is not the case for 
most of the surfaces. For example, among complete immersed minimal surfaces in 
R°, the only stable orientable example is the flat plane [18, 22]. 

One of the most important facts about minimal surfaces is that their coordinate 
functions are harmonic*. Harmonic coordinates actually characterize minimal 
surfaces, as can be seen from the following general relation 


A,X = 2HN, 


where X: M > R? is an isometric immersion and N is the oriented unit-normal 
vector field associated to the immersion. When H = 0, it follows that any linear 
function on R°, when restricted to M, is harmonic: 


A,X, = 0, 


where X,(p) =U: p. 
In fact, notice that if a surface has a single coordinate function X, that is 
harmonic, then combining the previous two formulae in this special case gives 


0=A,X,=A,X-+v =2HN -v. 


a 


"By stable, we mean that each compact subdomain is a local minimum for area among surfaces 
sharing the same boundary. 
“A function f: S — R” on a surface is harmonic if 


Apf = 0 
in the induced metric, where A, is the Laplace-Beltrami operator. In conformal coordinates, 
ds* = d( du? + du3) A, =A7'A, 


where 


is the usual Laplacian. 
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In a neighborhood of any point where the Gauss normal mapping is nondegener- 
ate, we must have H = 0. Thus on the open set where the normal mapping is 
nondegenerate, H vanishes. Assuming the surface is connected, this means that 
either the surface is minimal or the Gauss map is of rank less than 2 everywhere, 
which implies it is flat (K = 0). Simple analysis will show that this latter situation 
can only happen if the surface is a cylinder over a plane curve. Thus, among 
surfaces with nondegenerate Gauss mappings, minimal surfaces are characterized 
by the existence of a single harmonic coordinate function. 

The power of this analytical characterization becomes evident if we work with 
conformal coordinates (u,,u,); that is, coordinates in terms of which the metric 
looks like ds* = A(du? + du3). Let 


0 1/0 0 4 0 1{ 0a 0 
=u, +l), — = ~|— —i-——] and ~=-7{[—— +1— ]}. 
a 2 Z Ou, 


Then for any f: M > R” 


of 
Oz0z- 


AA, f=Af=4 


For a minimal surface, this means that 
0 (0X 
e(it)-° 
Oz \ 0z 
° . 0X ° 
1.e., the locally defined map op from M to C? is holomorphic.’ But 
Zz 


aX 1[aX ax 
Oz 2 


l 
Ou, du, 


a complex 3-vector whose real and imaginary parts consist of real 3-vectors forming 
an orthogonal basis for the tangent space of X(M) at each point. Since 


we may use this information to prove that the Gauss mapping of X is a 
meromorphic mapping to S*. Let a: S* > C U {} be stereographic projection. If 
we let g = a °N be the stereographic projection of the Gauss map, it follows that 
g is conformal if and only if X is minimal. 

We have seen that on a minimal surface M every linear function on R? restricts 
to a harmonic function on M. Their conjugate harmonic functions are not, in 
general, globally defined. But the differentials of the conjugate harmonic functions 
are, which means that every linear function gives rise to a holomorphic one-form 


>We may consider M as a Riemann surface, with complex coordinates given by local conformal 
coordinates. 
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on M. One of the most important tools in the classical study of minimal surfaces, 
the Enneper-Weierstrass representation, makes essential use of this analytic data. 
(See Figure EW.) In fact, given a meromorphic function and a holomorphic 
one-form on a Riemann surface they determine, locally, a minimal] surface. The 
global use of the Enneper-Weierstrass representation will be central in our later 
discussion. 


1. The flux formula, the catenoid and embedded minimal annuli 


A simple but very useful consequence of this analytical characterization of 
minimal surfaces by harmonic coordinate functions is the flux formula. According 
to Stoke’s Theorem, for any C?-function f: S > R” 


[Anfda = [ aa) ds, (1.1) 


where dA is the element of area on S, ds is arc-length on 0S, n is the outward 
pointing conormal to S along 0S and df(n) is the directional derivative of f in the 
direction 7. Applying (1.1) to an isometric immersion X: S — R°, we have that 
dX(n) is the image in R° of the outward-pointing conormal; writing n* = dX(7) 
we have [,A,fdA = f,,.n* ds. (See Ficure 1.) If X is minimal we have 


[ nv ds =0. (1.2) 
os 


S with the conormal n* 
along its boundary 


een en eens 
rr hi. ee 
e . 


Fic. 1 


In particular, if 7 is any fixed vector in R° 
i n* -ods = 0. (1.3) 
as 


The integral in (1.3) can be thought of as the flux through 0S of the flow in R° 
with constant velocity vector v. While (1.2) and (1.3) are quite simple and were 
undoubtedly known in the 19th century, they have only recently come into 
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widespread use in the study of minimal and constant mean curvature surfaces 
[39, 40]. 

As a sample application of the flux formula, we consider the catenoid. It was 
Euler who discovered the catenoid, which was the first nonplanar example of a 
minimal surface. He did this by finding the surface of revolution that was a critical 
point for area. Consider a surface of revolution about the z-axis with profile curve 
(r(t), t) in the (x, z)-plane. Let S be the compact portion S§ of the surface that is 
between z =f, and z =t,. S is bounded by two circles of radii r(t,) and r(t,), 
respectively. Let a(t) be the angle between the tangent plane to S at level z =t 
and the plane {z = t}. Then computing the flux in the z-direction (v’ = (0,0, 1)), 
gives by (1.3) 


i sin a(t,) ds = i sin a(t,) ds 
SO{z=t} SO{z=ty} 


or 27r(t,)sin a(t,) = 27r(t,)sin a(t,). But t, and t, are arbitrary, so r(t)sin a(t) 
=, or 
1 


OTP 


This ordinary differential equation is satisfied by the functions 


=. (1.4) 


cosh( At + B) 
r(t) = —_ A #0, (1.5) 


and these are all the solutions. Thus, nonplanar minimal surfaces of revolution are 
all catenoids*. Note that two catenoids corresponding to different solutions of (1.5) 
differ by a homothety and translation, and so there is really only one nonplanar 
minimal surface of revolution. The catenoid has the following properties: 

C1) It meets each plane {z = ¢} in a circle; 

C2) Topologically it is an annulus; conformally it is a sphere minus two points; 

C3) Its Gauss map N is one-to-one and onto the sphere minus the poles; 

C4) The portion of the catenoid outside of a sufficiently large compact set of R° 
consists of two unbounded components. 

When one tries to look at minimal surfaces similar to but more general than the 
catenoid, things get:very much harder. For example, J. C. C. Nitsche [49] has 
proved that any minimal annulus that meets every {z = t} plane in a single 
star-shaped curve must be the catenoid. He asked whether “star-shaped curve” 
could be weakened to “Jordan curve.” 

Q1) Is there a minimal annulus other than the catenoid that meets every 
horizontal plane in a single Jordan curve? 

The affirmative answer to this question is known as the “Nitsche Conjecture.” 
However, it is still an open problem. More generally, the catenoid is the only 


*The name derives from the fact that these profile curves are also the solutions to the differential 
equation for the shape of a hanging chain, a catenary, namely (1.4). 


1990] 


Enneper-Weierstrass Representation 


Given a Riemann surface M, a meromorphic 
function g and a holomorphic one-form 7 on M, 
there exists a conformal minimal mapping 


X = (Xj, X2, X3) 
whose Gauss map WN has stereographic projection 


g and whose third coordinate function satisfies 


ONS 5 
rz ¢ =2¢g7. 


Specifically 


X(p)= Ref '®, where 


0 
® = (1 -g7,i(1 +g7),2g)n. 
The metric induced by this mapping is given by 
d\df|*, where 


A = 2Inl(1 + lel’). 


Thus X is regular if and only if the zeros of 


Fic. 
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(order 2m) occur precisely at the poles (order m) 
of g. Isolated branch points may occur where this 
condition is violated. 


The metric is complete provided 
i] Ads =” 
5 


for any divergent curve 6 on M. 


For any nontrivial closed curve y C M, there is an 
associated period vector 


T, = Ref ®. 


Y 


If any 7, #0, then the mapping X is not well 
defined on M, but only on some covering of M. 
To produce a well-defined immersion of M, it is 
required that 


T, =0 
for all closed cycles on M. 
Perodic surfaces may be constructed by having 


one or more nonvanishing periods. 


EW 


known complete, properly> embedded minimal surface that is topologically an 
annulus. 

One would like to know all the properly embedded minimal surfaces of finite 
topology. A surface is said to have finite topology if it is homeomorphic to a 
compact surface from which a finite number of points have been removed. If such 
a surface is properly embedded in R°, then each of these points corresponds to a 
distinct way to diverge in R°. The image of a small disk neighborhood of a 
puncture point is called an end of the surface. The catenoid (see properties C2 
and C4) has two ends. Until recently, the only known examples of such surfaces 
were the plane, the helicoid and the catenoid. The helicoid is intimately related to 
the catenoid. (See Figure CH.) 


2. Complete embedded minimal surfaces with finite total curvature 


Finite topology and finite total curvature® are intimately related. If we take the 
example of the catenoid, we can get a feeling for this relationship. It can be 


“A map F: M - N is proper if every divergent sequence in M is mapped into a divergent sequence 
in N. 
°A surface is said to have finite total curvature if [|K|dA < ©. 
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The Catenoid /Helicoid The period vector 
{0} T, = [© = (0,0,— 4rsin 9) 
& =Z y 
n = cz * dz, Case 1. Catenoid: 6 = 0; T, = (0,0, 0) 
c =e’? 
Choose py = 1, 


®(z) =c((z-?- 1), i(z7 + 1),2z~') dz, 
X(z) =X), X2, X3 
= —Re(c(z7! +2), 


i. 
| 


X, = +Im(c(z7!-2z), 


X, = 2(cos 6 In|z| — sin 6 - arg z) 


Rays emanating from 0 go to meridians. Circles 
about 0 go to circles in planes {x3 = const.} 


. Sa 


Wenn 
SSS 
Nig 


T 
Case 2. Helicoid: 6 = >? 
T, = (0,0,- 47) 


A = 2A\z|~* + 1) and is independent of 0. 


Completeness: 


[ras = 2 for? + 1)|dt| = ~ 
1 1 Rays go to horizontal straight lines 
Circles go to helices 


0, 0, _» _ 
i} Ads = f (°° + I) Idel = & y goes to the x-axis. 


Fic. CH 


guessed from the pictures, or directly computed from the formula for the catenoid, 
that the image under the Gauss normal map of each circle on the catenoid is a 
latitude circle on the sphere. Each meridian is covered, exactly once, so the entire 
sphere is covered, except for the poles. According to C3, we may identify the entire 
sphere with the catenoid by adding two points at infinity. Thus the meromorphic 
Gauss map is used to identify the compactified surface with the Riemann sphere, 
and the total curvature is just the area of the Gaussian image, counted alge- 
braically; i.e., —47r. This relationship between conformal type and total curvature 
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was established in general by Osserman [52, 53] who proved that a complete 
minimal surface of finite total curvature: 

K1) is conformally equivalent to a compact Riemann surface that has been 
punctured in a finite number of points and; 

K2) has a Gauss normal mapping, defined and meromorphic on the punctured 
surface, which extends to a meromorphic function on the compactification. Also 
the holomorphic one-form associated to any coordinate function extends meromor- 
phically to the conformal compactification. 

Intuitively, the Gauss mapping provides a finite-to-one covering of the Riemann 
sphere. Because the curvature must go to zero quickly as one diverges, the 
principal curvatures, which are equal in absolute value on a minimal surface, must 
also go to zero quickly. This, together with the assumption of completeness, forces 
the normal to have a limiting value at the ends. 

According to K1, finite total curvature implies finite topology. But the converse 
is not true, as is evident from the helicoid. The helicoid is topologically a once 
punctured sphere (i.e., it is simply-connected), but since it is periodic and not flat, 
it has infinite total curvature. (See Figure CH for a picture of the helicoid.) 

Considering the fact that the helicoid was discovered in 1776 by Meusnier [47], 
it is surprising that it is at the present time the only known example of a properly 
embedded minimal surface of finite topology but infinite total curvature. 

The authors have recently proved that on a properly embedded minimal surface 
only two annular ends, at most, can have infinite total curvature: all the rest must 
have finite total curvature [33]. This has interesting geometric consequences. It is 
well known [59] that: 

K3) if an end of a properly embedded minimal surface has finite total curva- 
ture, it is asymptotic to either the end of a plane or the end of a catenoid. 
Moreover all these ends are parallel in the sense that their limit normals are 
parallel. 

Specifically, 

K3') each end will eventually be a graph, over the exterior of a compact set in a 
plane (perpendicular to the limit normal at the end), of the form: 

X, + Cox 
u(x,,x7) = cy talnp + ee te(p),  p =lx,.x9)| 


If a = 0, the end is asymptotic to a plane, otherwise it is asymptotic to a catenoid. 

If the end has finite total curvature, it is forced to be annular by K3. If we 
consider properly embedded minimal surfaces with finite topology, all the ends are 
annular by definition. The result mentioned above implies that, at most, two of the 
ends have infinite total curvature. Thus, according to K3, all the other ends look 
like the ends of planes or of catenoids. 

Two important questions to be answered are the following. Given a properly 
embedded minimal surface: 

Q2) If it is simply connected, must it be the plane or the helicoid?; 

Q3) If it is not simply connected but has finite topology, must it have finite total 
curvature? 

The question Q3 is a strong generalization of Q1, as we will now explain. 
Consider first the general case of a complete embedded minimal surface with finite 
total curvature. We know from K1 that it is a compact surface of finite genus, say 
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k, all of whose ends, say it has r of them, are annular. Assuming that our surface is 
not simply connected, each end possesses a distinguished nontrivial homotopy 
class, represented by a closed curve. This curve wraps once around the puncture 
point corresponding to that end. If we look for a length-minimizing curve in this 
homotopy class, we will always find one. The intuitive reason is this: the only thing 
that can happen to prevent this is that a sequence of curves, whose length goes to 
the minimum, will diverge in space. But by K3, this end ultimately looks like the 
end of a plane or of a catenoid, so it turns out that such a diverging sequence of 
loops will have to have unbounded length. It can be shown quite easily that the 
minimum-length curve, certainly a closed geodesic, exists [10, 25]. 

We observe that this closed geodesic is unique. If not, one of two things will 
happen: (i) either there are two closed geodesics in this homotopy class that do not 
intersect and bound an annulus; or (ii) there are two closed geodesics in the class 
that intersect, from which we can choose two subarcs that bound a simply 
connected region. We will apply the Gauss-Bonnet Formula, which states that 


[ Kaa =| k,ds + )) B; + 27y(R) (2.1) 


on any compact subregion R of a Riemannian surface, where x, is the geodesic 
curvature of dR, the B, are the exterior angles at the corners (if any) of OR, and 
y(R) is the Euler characteristic of R. Since K <0 almost everywhere on the 
surface, and our boundaries are geodesics: 


SB, + 2ry(R) <0. (2.2) 


In case (i), there are no boundary angles and the geodesics must bound an 
annulus. Since the Euler characteristic of an annulus is equal to zero, we arrive at 
a contradiction in (2.2). In case (ii), two geodesic arcs form a geodesic 2-gon, 
bounding a disk-type region with Euler characteristic equal to one. Again a 
contradiction arises from (2.2), since the exterior angles are no bigger than 7 in 
absolute value. 

By the same reasoning, it can be shown that closed-geodesic representatives of 
two distinct ends cannot cross. Let us assume for the moment that they do not 
meet tangentially. By removing each of the r distinct ends that they bound, we are 
left with a surface ~“ of genus k, bounded by r closed geodesics. By the Gauss- 
Bonnet Formula (2.1), the total curvature of this surface is 


Qay(.Y) = 2n(2 -2k-71). (2.3) 


Moreover, each closed geodesic bounds an end which is an annulus. 

What is the total curvature of an embedded annular end that is bounded by 
each one of these geodesics? By K3’ each of these ends is ultimately the graph, 
over a plane P orthogonal to the limit normal of the end, of a function that is 
asymptotic to a constant (planar end) or to a: In(p) + constant, where p is 
distance from the origin in P, and a is a real constant. In both cases, for large p, 
the graph C, over the circle of radius p is very close to a circle of radius p, and the 
graph itself for large p is very, very flat. Thus it is easy to believe and not hard to 
prove that the geodesic curvature of C, on the surface is asymptotically equal to 
the geodesic curvature of the circle of radius p in P. But this is +27. Again we 
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apply the Gauss-Bonnet Formula (2.1) to the annulus bounded by the distin- 
guished geodesic and C,,. Using the fact that the Euler characteristic of an annulus 
is zero and taking the limit as p — ©, we get 


[KaA = -2r, (2.4) 


for any embedded end of finite total curvature. Since we have r ends, they 
contribute —27rr to the total curvature. It follows from this and (2.3) that 

K4) on a complete embedded minimal surface of finite total curvature with 
embedded ends of genus k and r ends is exactly 


[Kaa = 2n(2 — 2k — 2r) (2.5) 


(or, equivalently, the degree of the Gauss map is r + k — 1). 

Remember we had to assume that the distinct closed geodesics corresponding to 
the ends did not touch. But what if they do? In general, if two geodesics meet at a 
point and are tangent there, they coincide. Thus if two closed-geodesic boundaries 
for distinct ends touch, they coincide, which implies that the entire surface is the 
union of two annuli, whose intersection is a closed geodesic. This means that the 
surface itself is a complete, embedded, minimal annulus, with total curvature equal 
to —47r, according to 2.4. (In particular, formula K4 is true in general.) One can 
now argue in various ways to conclude that this minimal surface must be the 
catenoid. For example, we invoke a theorem of Osserman [53, 54], which states 
that 

C5) The only complete nonsimply-connected minimal surface with total curva- 
ture —47r is the catenoid. 

Or, we could quote a more recent theorem of Schoen [59]: 

C6) The only complete embedded minimal surface of finite total curvature with 
two ends is the catenoid. 

Or, finally we could appeal to a still more recent theorem of Lopez and Ros 
[43]: 

C7) The only complete nonflat embedded minimal surface with finite total 
curvature and genus zero is the catenoid. 

From this discussion, we see that once one knows that a complete embedded 
minimal annulus has finite total curvature it is the catenoid. In this sense Q3, 
considered as a conjecture, is a strong generalization of QI, the Nitsche Conjec- 
ture. 


2.1. New finite total curvature examples. The reader will notice that no exam- 
ples of complete embedded minimal surfaces of finite topology have been men- 
tioned, other than the plane, the helicoid and the catenoid. Questions Q2 and Q3 
being open at the present time, any other examples would have to have finite total 
curvature. How would one construct such an example? 

Notice that, according to C6 and C7, any example will have to have genus k > 1 
and at least 3 ends (r = 3). 

Using properties K1—K4, combined with the Enneper-Weierstrass representa- 
tion, one can follow this recipe to try to produce a new example: 

S1) Choose a compact Riemann surface & of genus k, removing from it a 
finite number r of points. On &, specify a meromorphic function g and a 
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holomorphic one-form 7. To satisfy K4 make sure that the degree of g is 
k +r-— 1. To satisfy the second part of K3, g should be chosen to have zeros and 
poles at the ends. 

S2) The Enneper-Weierstrass representation will produce a multivalued map of 
& into R*, which is a conformal minimal immersion of & minus the r punctures. 
Make sure the map is regular and complete. This can be done easily by looking at 
the poles and zeros of g and 7. (See Figure EW.) 

In general, the integration of the Enneper-Weierstrass one-forms around a 
nontrivial cycle produces a complex 3-vector, whose real part is what we refer to as 
the period associated to this cycle. Geometrically, the period is a translation in R° 
produced by integration around the cycle. In general the Enneper-Weierstrass 
representation produces a periodic surface in R* with the periods generating all 
the basic translations. In general these periods will be neither zero nor linearly 
independent (and there may be many of them). For example, the catenoid has no 
period corresponding to the cycle represented by the unit circle, while the helicoid 
has a period consisting of a vertical translation (see FigurE CH). Usually the 
periods of an immersion result in a very messy surface in R°, not in general as nice 
as the helicoid. (In Sections 3 and 4, we will discuss periodic minimal surfaces 
constructed by trying to make one, two or three of the periods of an immersion 
both nonzero and linearly independent. 

S3) Make sure that the mapping given by the Enneper-Weierstrass integrals has 
no residues at the punctures (the ends are well-defined) and no periods on any 
cycle on &. This will mean that the Enneper-Weierstrass representation produces 
a finite total curvature minimal surface that has embedded, parallel ends. If the 
ends have distinct logarithmic growth (see K3’), this implies that the surface is 
embedded outside of a sufficiently large compact set of R°. Then: 

S4) Show that the surface is embedded. 

In 1981, C. Costa [12, 13] discovered an example that was conformally the 
square torus punctured in the half-periods. He did this by using elliptic functions 
to produce the Weierstrass analytical data; specifically g = cg’, and the one-form 
7 was g ‘| dz where q is the Weierstrass g-function. The surface was constructed 
to be complete and to be asymptotic, outside of a compact set to the union of a 
vertical catenoid ends and a horizontal plane separating them. (The value of c was 
computed algebraically to satisfy $3.) In particular, Costa’s example was embedded 
outside of a compact set. Therefore, this example satisfies S1, S2 and S3. In 1984, 
the authors [31] were able to prove that this surface was embedded. 

We were subsequently able to construct examples of every positive genus k > 2 
which were complete, embedded and possessed three ends [29]. Soon thereafter we 
proved that each of these surfaces, including the genus-one example, lies in a 
one-parameter family of embedded minimal surfaces of the same topological type 
[30]. In the genus-one case, Costa has recently shown that this family constitutes 
the only genus one complete minimal surfaces with three ends, all of which are 
embedded and parallel [14]. 

These surfaces are defined by meromorphic data on compact Riemann surfaces, 
which data must be integrated to produce the coordinates of the surface, as 
specified in S1 and S82. The residue problem S3 must then be solved. The 
construction uses symmetry to annihilate the residues at the ends and to reduce 
the number of independent periods. For example, the surfaces that generalize 
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IMAGE 1. Embedded finite total curvature examples of genus 3 and 4 with 3 ends. 


Costa’s construction may be produced as follows: 
G1) The underlying Riemann surface is the (k + 1)-fold covering of the sphere 
given by 


wktl = zk( 2? ~ 1), 


punctured at the points (1,0), (— 1,0) and (~, ©), The Gauss map and one-form in 
FIGURE EW are given by 


g=c/w, 1 =(z/w) az, 
where c is a real constant to be determined. The genus of this surface is k and it 
has 3 punctures. The degree of g is k + 2 as required. Also, g takes on the values 
0 or © at the puncture points. 
G2) Using the Enneper-Weierstrass representation (see FIGURE EW), we can 
compute that 


1 
z+1 


X;(z) = constant + c{In —In 


z-—1 


In this way these surfaces are similar to the catenoid; the top and bottom ends are 
of catenoid type. (See K3’.) This gives completeness at the ends at (1,0) and 
(—1,0). At (@,), g is not one-to-one, as it would have to be on a catenoid type 
end. Therefore this end is flat. Observing that the first two components of the 
Enneper-Weierstrass integrand have poles at (2, ©) one can show that the surface 
is complete at this end. The one-form 7 has a zero (order = 2k) on the punctured 
surface only at (0,0), which is the only finite point where g has a pole (of 
order = k). This insures regularity (see FigurE EW). 

G3) The fact that there is so much symmetry simplifies the period computation 
enormously. This surface is constructed so that the image of any arc that projects 
onto the real axis and runs between ends is mapped by a branch of the immersion 
given by the Enneper-Weierstrass representation into a planar geodesic in a 
vertical plane. These planes must make an angle of 7j/(k + 1) with the (x, z)- 
plane, 0 <j <k. The image of a closed curve around any of the ends must 
therefore be invariant under reflection in each of these planes, which implies that 


714 DAVID HOFFMAN AND WILLIAM H. MEEKS, III [October 


the residue at the end is orthogonal to each of these planes. Since there are more 
than one of them, this means that the residue is equal to zero. 

For each k, the surface has k + 1 vertical planes of reflective symmetry. Any 
lift of a segment of the real axis is mapped into one of these planes. Thus we can 
divide the underlying Riemann surface into 2(k + 1) regions, corresponding to the 
2(k + 1) distinct components of lifts of C—R. Furthermore, we can choose one 
closed curve in the z-plane whose lifts generate the homology of Riemann surface. 
Killing the period of the Enneper-Weierstrass integrand on this closed curve will 
kill all the periods on the surface. One can then vary the parameter in the 
definition of the g to kill this period: the proof provides an explicit closed formula 
for the value of c. 

At this point, one knows that the surface exists and is minimal and has finite 
total curvature. 

G4) At this point, we have a finite total curvature minimal surface that is 
embedded outside of a compact set. (The ends are embedded and disjoint.) A 
separate argument must be given to prove that the entire surface is embedded. 
Again symmetry is crucial. The surface can be decomposed into disjoint congruent 
pieces, each of which turns out to be a graph. This means the entire surface is 
embedded. 

The authors also proved the existence of these surfaces by minimax methods 
[32]. This approach gives embeddedness automatically along with existence, but 
says nothing about uniqueness which can be proved by the analytic method. Pitts 
and Rubenstein have independently provided a minimax construction of these 
examples [55]. 


2.2. More finite total curvature examples. The authors and Michael Callahan 
[9] have constructed examples of complete embedded minimal surfaces of finite 
total curvature with four ends (two catenoidal, two flat). In principle, such 
examples exist with every even positive genus. Again, the equation of the underly- 
ing Riemann surface is deduced from symmetry considerations; the Gauss map 
and Weierstrass data are given in terms of roots of rational functions on a 
Riemann surface. In this respect, it is the same as the construction of the 
three-ended examples. However, there are in this construction two free variables 
(corresponding to the positioning of the middle flat ends and the height of the 
axial saddle point) and two independent periods. The proofs of existence and 
embeddedness of these surfaces depends heavily on machine computation. The 
computer is used to find the values of the free variables that annihilate the 
periods. Then, the degree of the period mapping is calculated, again with the aid 
of the computer. Finally, although the surface is symmetric, it is sufficiently 
complicated to make all the known methods for proving embeddedness inapplica- 
ble. The proof depends on a mathematical argument that, in essence, shows that 
the method used to produce the accurate coordinates of the surface, a program 
called MESH, devised and refined by J. Hoffman, is accurate [8]. Even this proof 
of embeddedness requires a machine computation, of a very simple sort, to 
complete it. 

What is the general pattern? Karcher observed that these four-ended surfaces 
could probably be made more complicated by the insertion of a handle through 
their waist. This was carried out by M. Wohlgemuth, with the assistance of E. Boix 
in Amherst (Spring 1989). In this problem, the number of periods and free 
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parameters is three. Numerical and graphical searches were essential to the 
discovery of these surfaces, as was the use of MESH, which had been refined by 
J. Hoffman, working with Boix and Wohlgemuth, to deal with singularities and cuts 
in the domain of integration. The resulting surfaces have genus 3k. This construc- 
tion, combined with recent work of Wohlgemuth on constructing doubly-periodic 
surfaces by handle growing, indicates that there are probably a great number of 
different embedded minimal surfaces of finite total curvature and that if they are 
sufficiently symmetric, handles can be added to make them more complex without 
destroying the symmetry or adding any ends. 

An interesting question is whether this handle adding construction can be 
carried out ad infinitum to create a limit surface > with an isolated singular point 
p, where the attaching handles accumulate. It is known [24] that a stable minimal 
variety or a minimal variety with finite topology cannot have an isolated “interior” 
singularity; i.e., such varieties extend smoothly across the potential singularity. 


3. Minimal surfaces foliated by convex curves in parallel planes 


3.1. Minimal annuli. Since it is a surface of revolution, the catenoid is fibred by 
circles in parallel planes. It is natural to try to classify all minimal annuli that are 
so constructed. We can argue in a straightforward fashion that a complete minimal 
annulus that is foliated by circles must in fact be the catenoid. Since any compact 
piece R bounded by parallel circles is an annulus (Euler characterize 0) each of 
whose boundary circles has curvature 27 in absolute value, we may use the 
Gauss-Bonnet Formula (2.1) to conclude that 


[Kda 


<4. 


Thus the entire surface is an embedded complete annulus with finite total 
curvature (less than or equal to 477 in absolute value). We can use C5 to conclude 
that this annulus is the catenoid. 

In the next section, we will describe the classification of compact minimal 
annuli fibred by circles in parallel planes. First, we will describe some interesting 
results, old and new, about minimal annuli foliated by convex curves in parallel 
planes. 


Fic. 2. Al) Minimal annuli bounded by convex curves in parallel planes are fibred by convex curves. 


3.2. Schiffman’s First and Second Theorems and extensions. Schiffman [57] 
proved two beautiful theorems on the geometry of a minimally immersed annulus 
A bounded by a pair [T° of smooth convex Jordan curves in parallel planes. 
Schiffman’s first theorem states that 

Al) A intersects each plane P, parallel to and between these planes, in a 
convex curve. In particular A is embedded. 

Schiffman’s second theorem states that 

A2) when the convex curves [ are circles, then AM P is also a circle. 
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Thus, when [ consists of circles, A is foliated by circles. We will now give a 
proof of A1. (For a proof of A2 different from Schiffman’s theorem see [46].) 

Assume that the boundary curves of A are smooth convex curves in parallel 
planes. By uniformization, any annulus A is conformally equivalent to some 
annulus A(r) = {z € C|1 < |z| < r} for some unique r = r(A) > 1. After rotation 
and homothety, we may assume that A is the image of a conformal minimal 
immersion X: A(r) > R° with X(|z|= 1) C Py and X(\z| =r) C Piy,), where 
P, = {x, = t}. The third coordinate function X,: A(r) > R is harmonic, and In|z| 
is a harmonic function with the same boundary values. Note that both In|z| and 
X,(z) are harmonic as functions on A(r), because we are in conformal coordi- 
nates. We can therefore apply the maximum principle for harmonic functions to 
conclude that X; = In|z|. Since VX, is never zero on A(r), it follows that the 
tangent plane to A is never horizontal. Thus g(z), the stereographic projection of 
the Gauss map, is never 0 or ». This implies that @ = arg g is well-defined, 
mod 277. Notice that the horizontal projection of the normal to A at a point 
p © P,QA is orthogonal to the plane curve P, A at p and it is never zero. 
Hence ¢ is also the angle of the normal to P, A. If we parametrize A(r) by 
polar coordinates (r, 6), then Pi.) A is parametrized by X(ce’’), 0 < @ < 27. 
Observe that the convexity of Pj.) MA is equivalent to (0/00)¢(ce”’) never 
changing sign. But g is holomorphic in z, and for any holomorphic function F, 
(0 /00\arg F) is harmonic. Since both boundary curves of A have the same 
orientation, the harmonic function (0/00)¢ has a fixed sign on 0A. By the 
maximum principle, it is never zero in the interior, and we are done. 

Recently Meeks and White have proven a theorem that augments Schiffman’s 
first theorem [46]. The proof of this theorem depends on stability properties of 
compact minimal surfaces. 

We shall call a compact minimal surface M stable if, with respect to any 
nontrivial normal variation fixing the boundary, the second derivative of area is 
positive. If the second derivative of area is negative for some variation, then M is 
called unstable. If M is neither stable nor unstable, we will call it almost-stable. 

A3) If T is a pair of smooth, convex Jordan curves in distinct parallel planes, 
then exactly one of the following holds: 


1. [' is not the boundary of any connected compact minimal surface, with or 
without branch points; 

2. IT is the boundary of exactly one minimal annulus and this annulus is 
almost-stable. In this case, I’ bounds on other connected compact branched 
minimal surface; 

3. [ is the boundary of exactly two minimal annuli; one stable and one 
unstable. 


What other compact minimal surfaces may have I as their boundary in case 3 is 
unknown. That is: 

Q4) Given two convex curves in parallel planes, can they bound a minimal 
surface of genus greater than zero? 

In certain cases it is known that every connected branched minimal surface with 
boundary I described in A3 is actually an annulus. For example, R. Schoen [59] 
proved that when [ is contained in parallel horizontal planes and is invariant 
under reflection in two vertical planes, then every branched minimal surface with 
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Fic. 3. It is not known whether convex curves in parallel planes can bound a surface of genus greater 
than zero. 


boundary I is actually an annulus. (This was used in the proof of C6.) Thus, in 
certain cases, A3 shows that I is the boundary of 0, 1, or 2 minimal annuli and no 
other branched minimal surfaces. 

As a simple corollary to A3, we can prove Schiffman’s Symmetry Theorem. 

A4) A minimal annulus bounded by convex curves in parallel planes has the 
symmetries of its boundary. 

Suppose A is such an annulus and S: R° > R° is a symmetry of the boundary 
of A. Then S(A) is another minimal annulus with the same boundary and S(A) 
has the same stability properties as A. Hence by A3, SCA) must equal A, which 
proves A4. Statement A4 was proved by Schiffman using a different method [58]. 

The proof of A3 uses some new techniques that come from global analysis. 
Consider the space .4 of smooth embedded minimal annuli in R? with boundary 
curves in two fixed parallel planes. Let p: &Z-— @ be the associated projection 
onto the space of pairs of smooth Jordan curves that are contained in the parallel 
planes. A basic result (see [4, 6, 63, 64, 65,]) is that .@ and @ are smooth infinite 
dimensional manifolds and p is a proper Fredholm map of index zero. In this 
setting one can apply the classical results of global analysis, such as the Sard-Smale 
Transversality Theorem [62], to analyze the geometry of minimal annuli with 
boundary [ € @. 

In order to explain how these techniques might apply to prove A3, we examine a 
special case. Suppose I is a pair of coaxial circles in parallel planes P,, P, and 4 
is the corresponding space of minimal annuli for P,, P,. Since a minimal annulus 
A with dA = IT must have all the symmetry of its boundary, A must be a surface of 
revolution. This means A must be a catenoid, but notice that this does not mean 
A is the only catenoid with boundary I’. One can check directly from the equation 
of a catenoid (1.5) that I’ is the boundary of n catenoids, where n = 0, 1, or 2. If 
n = 1, then the tinique catenoid with boundary I is almost-stable. If m = 2, then I 
is the boundary of one stable and one unstable catenoid. 

Now shrink the circles in T through a one-parameter family ['(¢), 0 < t < 1, of 
coaxial circles in P, U P, so that [(0) = I and the circles in (1) are so small that 
['(1) does not bound a catenoid. Consider I(t) to be a path in @ and we get the 
picture described in FiGure 4. 


Ss 
dM 
p as 
Ss 
€ 0 1 
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That the above picture generalizes to a pair of convex curves, [ Cc P, U P, is 
used to prove A3. One shrinks [ through a one-parameter family I(t), 0 < t < 1, 
of smaller and smaller convex curves in P, U P, so that ['(0) = TF and P'(1) bounds 
no minimal annulus. Then A = p~‘(T[0,1]) <4 is either empty, consists of a 
unique minimal annulus with boundary I’, or is a compact arc. In the last case, the 
projection of this arc onto I[0,1] has one fold point > corresponding to an 
almost-stable minimal annulus and A — {3} consists of two components, each 
consisting entirely of stable or unstable annulli. 


3.3. Minimal surfaces foliated by circles and lines in parallel planes: Riemann’s 
examples. Recall that the catenoid is a complete minimal surface that is foliated 
by circles in parallel planes. By A2, a minimal annulus with circle boundaries on 
parallel planes is foliated by circles in parallel planes. Given two circles, in parallel 
planes, whose centers are sufficiently close together, it is possible to find a minimal 
annulus spanning them’. Thus, there exist compact minimal annuli, other than 
portions of a catenoid, that are foliated by circles in parallel planes. It is a 
surprising fact that these circle-foliated minimal annuli are subsets of complete, 
embedded, periodic minimal surfaces. (This was proved by Enneper [19]. See also 
[51], Sections 94-96 and [35].) These minimal surfaces, foliated by circles and lines 
in parallel planes, were discovered by Riemann [56], who showed that they could 
be described in terms of elliptic functions on rectangular tori. We will now give a 
geometric argument that determines the analytic description of these circle-folia- 
ted surfaces. Along the way we will prove that they are subsets of complete, 
embedded, periodic minimal surfaces. 

First we recall the Schwarz reflection principle for minimal surfaces ((60], also 
see [41] for the proof). This principle states that 

R1) A minimal surface M with a line in its boundary may be extended to a 
larger minimal surface by reflection across that line, and 

R2) A minimal surface M, which meets a plane orthogonally, may be extended 
to a larger minimal surface by reflection in that plane. 

By reflection in a line, we mean rotation by 7 about the line. Minimal surfaces 
are analytic surfaces, as can be deduced from the Enneper-Weierstrass representa- 
tion or from the fact that their coordinate functions are harmonic. It follows from 
this fact and R1 and R2 that if a minimal surface contains a line (resp., a planar 
geodesic) in its interior that surface is invariant under rotation about that line 
(resp. reflection in that plane). The Schwarz Reflection Principles for minimal 
surfaces follow from the Reflection Principle for harmonic and analytic functions. 

In 1951 H. Lewy [42] gave an important generalization of the Schwarz reflection 
principle. He proved that 

R3) a minimal surface with analytic boundary curves, is contained in the 
interior of a larger minimal surface. 

Suppose now that A is a compact minimal annulus that is foliated by circles in 
parallel planes. We first show that A is contained in a larger compact annulus A 


7You can create such a surface by fashioning two circles out of wire and dipping them in a soap 
solution. For advice about details and more discussion of soap films as minimal surfaces, see [7, 15, 27, 
51]. The general problem of finding a minimal surface with a given contour as boundary is called 
Plateau’s Problem after the Belgian physicist J. A. F. Plateau. This problem is a central one in the study 
of minimal surfaces, which we only mention in passing in this article. 
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foliated by circles in parallel planes. Since the boundary of A is analytic and A is 
everywhere transverse to horizontal planes, Lewy’s theorem shows that A Is 
contained in the interior of another compact minimal annulus A, whose boundary 
consists of analytic curves in planes parallel to the planes containing 0A. Given an 
analytic surface and a family of parallel planes, parametrized by their distance 
from a fixed plane, that intersect the surface transversally, the intersection of each 
plane with the surface is an analytic curve. The family of such curves depends 
analytically on the parameter. In the case we are considering, the analytic one- 
parameter family consists of circles for some nonempty closed interval correspond- 
ing to the original annulus A. Hence A must be fibred by circles in parallel planes. 
In particular, A is bounded by circles. 

Let F be the maximal open annulus that contains A and is fibred by closed 
curves in parallel planes. It follows that F is fibred by circles. Any pair of these 
circles is symmetric with respect to some vertical plane P. Either that plane is 
unique, or that pair of circles has centers lying on a vertical line, in which case 
every rotation about that axis is a symmetry. From A4, it follows that the annulus 
bounded by these circles either possesses a unique plane of vertical symmetry or is 
a surface of rotation. In the latter case, we can conclude immediately that we have 
a piece of the catenoid and that F is a catenoid. In the former case, we can 
conclude that every compact subannulus of F bounded by circles has a unique 
fixed plane of symmetry P and hence F has the same property. 

PF consists of two arcs 0,,0,, each a planar geodesic line of curvature, 
meeting each circle in F orthogonally at antipodal points. It follows that the 
tangent vectors to the circles along these arcs are principal directions. Without loss 
of generality, we may presume that P is the (x, z)-plane. Since the surface is 
minimal, the principal curvatures at each point are equal in absolute value, with 
opposite signs. But the curvature vectors of the circles are horizontal and never 
zero and the normal vector to F is never vertical. Thus it follows that the curvature 
vectors to a, and a,, which are vectors in the plane P, are nonzero and oppositely 
oriented. In particular, each go; is a strictly convex curve in P. It is natural to 
parametrize a; by t, letting o,(t) = 0, N {z = t}. Let r(t) = Jlo,(t) — o(t)| be the 
radius of the circle F M {z = th. 


Fic. 5. A minimal annulus foliated by circles in parallel planes. 


Observe that by the Gauss-Bonnet formula (2.1) the total curvature of every 
compact subannulus bounded by circles is not greater than 47 in absolute value. 
Thus the total curvature of F is at most 47 in absolute value. If F is regular and 
complete, then by C5 it must be a catenoid. 

If F is not complete, then one or both of the convex curves o; must have a finite 
endpoint. If they both terminate at distinct endpoints at the same level {z = to}, 
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then F is bounded by a circle at that level. But by R3, this means that F lies in the 
interior of a larger minimal surface containing that circle in its interior. Such a 
surface is still fibred by horizontal curves near the circle at level {z = tp}, and 
hence, as observed above, by circles. This contradicts the assumption that F is the 
maximal such minimal annulus. 


Fic. 6. The maximal annulus F. 


Suppose that o,(t) and o,(t) both terminate at the same point at height fp. 
Without loss of generality, we may assume that F lies in the halfspace {z < tp}. We 
will show that this is impossible by computing the flux, in the direction of the 
vertical unit vector field, for the minimal annulus FN {t, < z < t,}, where ¢, < ty 
and t, is chosen large enough so that FM {z =t,} # ©. The flux through the 
boundary circle at level t, is nonzero. The flux through the boundary circle at level 
t, is bounded above in absolute value by 27r(t,), which goes to zero as t, — fo. 
According to the flux formula (1.3) these two quantities must be equal in absolute 
value, a contradiction. 

We may conclude that if one of the curves o,(t) terminates in a point at level fo, 
then the other o,(t) diverges to infinity as t > t). This means that F has a 
boundary curve at height ¢, consisting of a horizontal line. Suppose that this is the 
only component of dF. Then, according to R2, we may extend F by rotation about 
this line, producing a complete, embedded genus-0 minimal surface with 3 ends. 
We observed above that F had total curvature at most 47 in absolute value, so 
this extended surface has total curvature of at most 877 in absolute value. By C7, 
we conclude that this surface must be the catenoid, which is impossible because it 
contains a line (and has three ends!). Thus there must be another boundary 
component of F,, and the arguments above imply that it, too, is a straight line. 

Finally, we consider the case that o, and o, have no endpoints at all. In this 
case, F is a complete,embedded minimal annulus with total curvature at most 477, 
in absolute value, and so must be the catenoid. We have thus determined that 
either F is the catenoid or it is an annulus bounded by two lines L,, L,, each 
orthogonal to the plane of symmetry P, lying in parallel planes. 

From now on, we will assume F is not the catenoid. The function r(t) is thus 
defined on an interval and diverges to infinity at the endpoints. We may therefore 
find two distinct values of t say t, <t,, where r(t,) =r(t,). The corresponding 
circles at heights ¢, and ¢t, are invariant under rotation about a line / at height 
(t, + t,)/2 which is perpendicular to P and placed halfway between their centers. 

By A4, the annulus F/O {t, < z < t,} is invariant under rotation about /, which 
implies that F has the same property. This rotation interchanges o, and o,, so 
they are congruent. In particular, each one has one endpoint on one of the 
lines L,. 
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By R1, F can be extended by rotations, R,, R, across line boundaries L,, L,, 
respectively. Let G be the group of symmetries generated by R, and R, and let & 
denote the G-orbit of F. Note that F is invariant under the translation T = R,° R5 
and &/T is topologically a compact torus # punctured in two points p,, p>, 
corresponding to the points at infinity on the two lines in F U RF) = A/T. 

To determine the conformal type of &/T, we may use the fact that it is highly 
symmetric. Reflection through P produces an orientation-reversing automorphism 
that commutes with the orientation-reversing automorphisms corresponding to 
rotation about L, or L,. Straightforward arguments can be given to show that 
R/T is conformally diffeomorphic to C/A(A), where A(A) = {mA + niln, m € Z}, 
for some A > 0, Le., it is a rectangular torus. Moreover, reflection in P will 
correspond to complex conjugation (fixing the curves t and t+ i/2, 0 <t<A, 
which correspond to a, and a) and rotation about L, corresponds to reflection in 
the imaginary axis. We may translate so that the puncture, P,, corresponding to 
the end into which L, diverges, is at 0 (assuming L, crosses a;.) This will force 
rotation about L, to correspond to reflection in the line A/2+ it,0 <t < 1. Since 
the line L, crosses o, but not o,, it must be that the other end, p,, is at the point 
w3, =(A + i)/2. 


planar geodesics o 


ircle 


Q 


a rotation 


© = intersection of 
line and sigma 


O = flat end 


Fic. 7. ‘Riemann’s example parametrized by a rectangular torus. 


Thus we have determined the underlying Riemann surface structure of &/T. 
Furthermore, we know that the total curvature of F, computed as the limit of 
minimal annuli via the Gauss-Bonnet formula must be — 477. Hence the total 
curvature of #/T is —87, which means that the Gauss map has degree 2. 
Furthermore, the Gauss mapping is vertical at 0 and w3; without loss of generality 
we may assume that N(0) = (0,0, 1) and N(w,) = (0,0, — 1). In terms of g, the 
stereographic projection of N, g(0) = » and g(w,) = 0. Since the ends at 0 and W3 
are flat, not catenoidal, the pole at 0 and the zero at w; must have orders at least 
two. But the degree of g is 2, which implies that g has a pole of order 2 at 0, a 
zero of order 2 at w3, and no other zeroes or poles. From the basic theory of 


1990] MINIMAL SURFACES BASED ON THE CATENOID 723 


elliptic functions, we can conclude that g = clg(z) — »(w3)], where g is the 
Weierstrass g-function. Moreover, the third coordinate function x3 is constant on 
the lines ti, and A/2 + ti, 0 < t < 1. Again we can argue that (d/dz)(x3) must be 
equal to a constant. By using the Enneper-Weierstrass representation formula, we 
see that 1 = dz/g. Rotation about the line / fixes two points of 4&/T, and 
interchanges the lines L, and L,. It follows that we may choose these points to be 
w,/2 and 3w,/2. At these points the normal to #/T is horizontal, so g must be 
unitary. We adjust c, if necessary, to make this happen. We omit the simple 
argument which shows that c may be chosen to be real. 


4. Embedded periodic minimal surfaces and the period problem 


The surfaces of Riemann & are foliated by circles and straight lines and have 
the appearance of an infinite staircase. In fact, after normalizing # to have the 
(x,z)-plane as a symmetry plane and the horizontal lines at integer heights, A can 
be approximated by the union of parallel planes at integer heights joined by 
ascending tubes. Thus we see that & has an infinite number of annular ends, each 
of which is asymptotic to a horizontal plane at an integer height. In particular, # 
has genus zero and an infinite number of ends. Moreover there are infinitely many 
different such surfaces, one for each rectangular torus. 

M. Callahan and the authors [11] have found new examples of periodic minimal 
surfaces .@ that have much in common with &. Quablitatively, they may be 
described as an infinite number of planes connected not by genus-zero tubes but 
by genus-k minimal surfaces, which approximate compact pieces of the surfaces 
described in G1—G4. (See IMAGE 1.) These surfaces .4 are invariant under a 
translation T and .4/T has genus 2k + 1. Recently, new periodic examples ./ 
have been found by Hoffman and Wohlgemuth [34], with the assistance of E. Boix. 
They resemble the surfaces .4 with a handle, parallel to the planes, grown 
through the waist. The quotient by translation of these surfaces can have genus 
3k + 1 for any k > 1. 

In the construction of the Riemann examples #, there was no need to 
annihiliate a period in order to produce a well-defined surface; the surfaces were 
determined by geometric considerations and lie in a one-parameter family. In the 
Enneper-Weierstrass representation, the only period was the one that generated 
the desired translation. For the examples .4, the situation is quite different. The 
examples of a fixed genus are isolated. The Enneper-Weierstrass representation, 
derived from geometric considerations, has a free variable that must be adjusted to 
annihilate a period of the mapping. In [11], it is proved that this is possible. For the 
examples ./, the situation is more complicated still; there are two unwanted 
periods that must be annihilated and two free variables in the Enneper- Weierstrass 
representation. The periods are calculated on a computer. 

Graphics plays an important role in understanding the geometry of these 
complex surfaces as well as in checking that the period computations are accurate. 
The proof of existence of the surfaces -/ requires a degree theory argument, 
which in turn is based on a machine computation. This means that the existence 
proof (analog of S1-S3) must be done genus-by-genus. 

This situation is unsatisfactory. On the one hand, it now appears that we can 
make these surfaces more and more complicated by growing more and more 
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IMAcE 3. The Riemann example (right) and half of the Riemann example (cut by the plane P). 


IMAGE 4. The periodic examples of [11] and their more complex cousins, produced by handle-adding, 
in [34]. 
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handles and solving more and more complicated period problems by computer. 
Current experience with deriving the equations and using graphics tools to help 
solve the period problem indicates that these more complex surfaces are there. At 
present we do not understand the general behavior of the period mapping, nor do 
we have any general method of proving that the period problem can be solved in 
the setting of periodic minimal surfaces with a finite total curvature quotient, or 
even in the finite total curvature setting of Section 2. 

Computation and graphics have indicated that these constructions can be 
carried out in situations where, beforehand, it would have been very hard to even 
guess that the surfaces existed. However, a solution to the problem of periods, 
even for a restricted class of examples, is a theoretical problem that is far from 
being solved. 

Fortunately, we do know a good deal about the qualitative behavior of properly 
embedded periodic minimal surfaces with more than one topological end. If the 
symmetry group is not discrete it is the catenoid. Otherwise, [10] 

P1) A properly embedded periodic minimal surface M with more than one 
topological end and discrete symmetry group: 


i) has an infinite number of ends; 
ii) is invariant under a screw motion S; 
iii) all the annular ends of M are flat ends; 
iv) the total curvature of M/S is finite if and only if it has finite topology. 


(A screw motion is the composition of a translation and a rotation along an axis 
of the translation.) It is thus the case that the analog of Q3 has an affirmative 
answer in this context. See also P3 below. 

As a relatively easy consequence of these and companion results, it can be 
shown [10] that: 

T1) A properly embedded minimal surface that is doubly-periodic must neces- 
sarily have one topological end. 

Certainly, properly embedded triply-periodic minimal surfaces have one topo- 
logical end. There are also singly-periodic examples with one end, Scherk’s 
one-periodic surface for example (see IMAGE 5). Property P1 iv) generalizes to a 
properly embedded periodic minimal surface M invariant under a discrete group 
G, in the case M/G has finite topology [44, 45]: 

P2) Suppose M is a properly embedded minimal surface in a nonsimply-con- 
nected flat three-manifold. Then M has finite topology if and only if it has finite 
total curvature. 

P3) If M is a complete minimally immersed surface of finite total curvature in a 
flat three-manifold, then M is conformally diffeomorphic to a closed Riemann 
surface M punctured in a finite number of points. Furthermore, M can be defined 
analytically in terms of two meromorphic one-forms on M. 

These results are obtained by studying the asymptotic geometry of the quotient 
surface in the flat three-manifold R°/G, and were motivated by earlier construc- 
tions of Karcher [36] using the analytic technique described in P3. 

All known examples, M, of properly embedded minimal surfaces in R° are 
actually invariant under a discrete group G of symmetries that act freely on R°, 
and possess the property that the quotient surface M/G has finite total curvature. 
(When M itself has finite total curvature, the group G is the trivial group. In this 
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case the representation is described in K1—-K4.) It therefore follows from P3 that 
all known M are determined by a generalized Enneper-Weierstrass representa- 
tion, using two meromorphic one-forms on a closed Riemann surface. We believe 
that there exist properly embedded minimal surfaces in R° that cannot be defined 
in terms of meromorphic forms on a closed Riemann surface but we lack a method 
for constructing them. 

As we have observed in Q1, it is unknown whether or not the plane and the 
helicoid are the only properly embedded simply connected minimal surfaces in R?. 
However, using P3 and P4, it has been proved that the helicoid and the plane are 
the only properly embedded simply connected minimal surfaces in R° with an 
infinite symmetry group [44]. 


5. Conclusion 


In this article, we have tried to give a survey of some recent progress in the 
theory of minimal surfaces, while at the same time illustrating some of the 
underlying techniques and stressing the connection with the classical point of view. 
We wish to conclude by discussing briefly two topics—one topological, the other 
structural. 


5.1. The topology of proper periodic minimal embeddings in R°. In the previ- 
ous section, properly embedded minimal surfaces with an infinite symmetry group 
and more than one end were discussed. Except for the catenoid, they all had an 
infinite number of ends and were singly-periodic with a discrete symmetry group. 
Any other periodic minimal surface has one topological end and, of course, infinite 
genus. 

From the viewpoint of a topologist, it is reasonable to ask whether two properly 
embedded minimal surfaces of infinite genus and one end are ambiently isotopic. 
In other words, can one such surface be deformed through embedded surfaces to 
the other surface by a one-parameter family of diffeomorphisms of R°*? The 
answer to this question is yes [23]. 

T2) If M,, M, are two properly embedded minimal surfaces of infinite genus 
and one end, then there is a homeomorphism h: R*® — R° such that h(M,) = M). 

Even for particular examples of minimal surfaces satisfying the hypotheses of 
the above theorem, it is not obvious that the surfaces are ambiently isotopic. For 
example, consider Scherk’s singly-periodic and doubly-periodic minimal surfaces in 
IMAGE 5. Each of these surfaces has infinite genus and one end, but it is difficult to 
imagine a homeomorphism of R° taking one of these surfaces to the other. 


5.2. Periodic interfaces in materials as a source of new minimal surfaces. In 
compound materials, bonded polymers for example, equilibrium structures are 
formed, which tend to minimize the area of the interface between the materials. It 
is therefore not surprising that minimal surfaces and surfaces of constant mean 
curvature are reasonable models of the resulting interface. More surprising is the 
fact that in recent years, new periodic embedded minimal and constant mean 
curvature interfaces have been discovered by material scientists [1, 2, 20, 21, 58, 
61], and these discoveries have led to the discoveries of other new surfaces and to 
interesting theoretical work and collaboration [3, 38, 50]. 

We sketch briefly just one instance of this new input into the subject. In [21], 
Fischer and Koch, crystallographers, took a systematic approach to the construc- 
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IMAGE 5. Scherk’s singly and doubly-periodic surfaces. 


IMAGE 6. Singly-periodic examples related to triply-periodic surfaces with noncubic symmetry. 


tion of triply periodic embedded minimal surfaces. They studied the subgroups of 
the crystallographic groups of R°, looking for those that possessed a fundamental 
domain that contained extremal polygons, which were reasonable candidates for 
boundaries of fundamental pieces of triply-periodic minimal surfaces. One of the 
new surfaces they discovered is illustrated in IMAGE 6. 

On viewing a computer image of this surface, produced by O. Wohlrahb in 
Bonn, it was observed [28] that a similar construction could be used to create 
families of singly-periodic minimal surfaces, with flat ends (like Scherk’s single- 
periodic surface). The quotient of these new examples, by translation, are confor- 
mally rhombic tori. It is curious to note that the Enneper-Weierstrass representa- 
tion for these surfaces was determined by using the images of the surface as a 
guide, a complete reversal of the procedures described in the previous two 
sections. 
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Curvature in the Eighties 


ROBERT OSSERMAN, Stanford University 


ROBERT OSSERMAN wrote a Ph.D. thesis on Riemann surfaces under the 
direction of Lars V. Ahlfors at Harvard University. He gradually moved 
from geometric function theory to minimal surfaces, differential geometry, 
isoperimetric inequalities, and some aspects of partial differential equations 
and ergodic theory. He has been at Stanford University since 1955 and will 
be spending half time as Deputy Director of MSRI in Berkeley for a period 
of three years, starting September 1, 1990. 


The notion of curvature is one of the central concepts of differential geometry; 
one could argue that it is the central one, distinguishing the geometrical core of 
the subject from those aspects that are analytic, algebraic, or topological. In the 
words of Marcel Berger [1, p. 9], curvature is “the N° 1 Riemannian invariant and 
the most natural. Gauss and then Riemann saw it instantly.” 

Curvature also plays a key role in physics. The magnitude of a force required to 
move an object at constant speed along a curved path is, according to Newton’s 
laws, a constant multiple of the curvature of the trajectory. The motion of a body 
in a gravitational field is determined, according to Einstein, by the curvature of 
space-time. All sorts of shapes, from soap bubbles to red blood cells, seem to be 
determined by various curvatures. 

Given the vastness of the subject and the limited space available, I have 
restricted myself to some modest goals. First, to present a few topics that are easily 
accessible to a reader unfamiliar with the field. Second, to concentrate on results 
from the past ten years, to demonstrate the current vitality of the subject. And 
third, to give some feeling for the various forms of curvatures, both intrinsic and 
extrinsic. 

The results we discuss are mainly of two types: inequalities involving integrals of 
curvatures, and implications of one or another type of curvature being constant. A 
huge subject that we shall barely touch on is that of consequences of curvature 
having a given sign or being constrained in other ways. 

On the premise that anything published before 1980 can now be considered 
classic, we shall give explicit references chiefly to recent books and articles, where 
many further references may be found. 

This article is divided into five parts. 


Some background definitions and terminology. 
Curvature inequalities for curves and surfaces. 
Surfaces of constant mean curvature. 

Higher dimensions. 

Related results. 


WRWNP 


The first three sections concentrate on curves and surfaces in euclidean three- 
space, with a particular focus on recent work of Antonio Ros. A famous theorem 
of Alexandrov states that “all soap bubbles are spheres”. In other words, soap 
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bubbles, suitably idealized, must have constant mean curvature, and Alexandrov 
showed that the only compact surfaces of constant mean curvature, embedded in 
R°, are spheres. A new approach to the problem was initiated by Ros [1], and a 
subsequent paper of Montiel and Ros [1] contains a beautiful and elementary 
proof of Alexandrov’s Theorem. It also contains an equally neat proof of a strong 
generalization to higher dimensions: the constancy of any of a whole series of 
different curvatures implies, for a hypersurface in R4, that it must be a hypersphere. 
A first case, for which partial results had been obtained by Cheng and Yau [1], was 
settled by Ros in his first paper. That was followed by different proofs of the 
general theorem by Ros [2] and Korevaar [1]. Montiel and Ros’ proof is given in 
section 4. 

Section 5 includes some further results on hypersurfaces of euclidean space and 
of spheres, as well as some purely Riemannian matters. We describe a recent 
breakthrough of Szab6 [1] on a nearly fifty-year-old conjecture of Lichnerowicz, 
but do not go into the details of the proof. 

I would like to thank particularly Manfredo do Carmo, Jerry Kazdan, Ulrich 
Pinkall, and Harold Rosenberg, for pointing me in useful directions as I was 
preparing this paper. 


1. Background 


For a plane curve one has the familiar definition of curvature: 


da ' 
k= (1) 


where a is the angle made by the tangent vector to the curve (in the direction of 
travel) with a fixed direction, say the positive x-axis, and s measures arc length 
along the curve. 

Note that curvature is assigned not only a magnitude, measuring the rate of 
deviation from “straight-aheadness,” but also a sign, indicating the direction of 
deviation: positive if we veer off to the left, negative to the right. 

For a curve in space, we have no way of attributing a sign to curvature. Again 
using the parameter s of arc length along the curve, and denoting by x the position 
vector on the curve, we have the unit tangent vector 


dx 
T= — 
ds 
and the curvature k defined by 
dT d*x , 
tds} | ds? | (2) 


If the curve happens to lie in a plane, then the two definitions of curvature agree, 
up to sign: 


k = |r|. (3) 
Associated to a surface S§ in R° are a number of curvatures: 
the principal curvatures k,, k 5, (4) 
k, +k, 
mean curvature H = <= (5) 


Gauss curvature K = kk. (6) 
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As in the case of plane curves, the signs of these curvatures play an important 
role. For all except K, the sign depends on the orientation of the surface S, which 
in turn is determined locally by a choice of unit normal N. The intersection of S 
with a normal plane is a plane curve whose curvature, if not zero, is positive when 
it lies locally on the same side of the tangent line as N, and negative in the 
opposite case. The principal curvatures k,,k, are the maximum and minimum of 
the curvatures obtained in this way over all normal planes. Note the following basic 
facts concerning the curvatures at a point p of S: 


K <0 #k, and k, have opposite signs 
= § crosses its tangent plane in any neighborhood of p, (a saddle point); 
(7) 
K =0 @ either (or both) k,, k, is zero (8) 
K > 0 = S lies on one side of its tangent plane at p; 
the same side as N if k,,k, > 0, the opposite side if k,,k, < 0. (9) 


An important characterization of k,,k,, K, comes from the “Gauss map”: the 
map 


g:S>> 
where > is the unit sphere, defined by 


g: p> N(p). (10) 
Then 


Area g(U) 
K| = |det(d = lim ————— 
IK| = |det(dg,)| = lim 


(11) 
where U is a neighborhood of p and the limit is taken as the neighborhood 
contracts down to the point p. If K #0, then the sign of K is positive if 
orientation is preserved, and negative if orientation is reversed. 

This approach to curvature is the one taken by Gauss in his fundamental paper 
on surfaces. (See Dombrowski [1].) The names “Gauss map” and “Gauss curva- 
ture” are not really justified, however, since Rodrigues [1] not only introduced and 
studied both of them considerably before Gauss, but made one very important 
observation that escaped Gauss’ notice: the differential of the “Gauss map” is 
diagonalized by the principal curvature directions. Specifically, if x(t) defines a 
curve C on the surface and M(t) is the unit normal along C, then N’(t) = Ax'(t) 
for some real A if and only if the plane spanned by x(t) and M(t) intersects S ina 
curve whose curvature is k, or k,. The corresponding tangent direction x'(t) is 
then called a principal curvature direction. Said differently, if u,,u, are local 
parameters near a point p of S such that at p, they are in the principal curvature 
directions, then 


ON Ox 
Ou, 7 ; Ou, 

(12) 
ON Ox 
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Note that an immediate consequence is that 
ON ON Ox OX 
— X —— =k,k,-— X ——. (13) 
du, du, du, du, 
Since the area of the portion of S corresponding to a domain D in the u,, u,-plane 
is given by 


du, du, (14) 


ON ON 
_ du, du, (15) 


it follows from (11) and (13) that |K| = |k,k,|. It also follows from (13) that the 
orientation is preserved or reversed according as k,k, is positive or negative. Thus 
the expression (6) for K as k,k, is equivalent to the description following (11) in 
terms of the Gauss map. 

A final note regarding orientation. Given any local parameters u,,u, on a 
surface in §, one has an induced orientation on the surface from the standard 
orientation in the u,u,-plane. That induced orientation on S corresponds to the 
choice of unit normal: 


Ox Ox 
du, but, 

N= T ox] vs) 
du, du, 


These facts and formulas are all that is needed in order to follow the arguments 
in sections 2 and 3 below. Section 4 involves some generalizations to higher 
dimensions. 

First, if S is a hypersurface in R”, p a point of S, N a unit normal at p, then for 
any curve x(s) on S§ with x(0) = 0, if s is the arclength parameter, then the 
curvature vector x"(0) has a component in the direction N that depends only on 
the unit tangent vector T = x’(Q). In particular, if the curve C is the intersection of 
S with the plane through p spanned by WN and a given unit tangent vector T, then 
the quantity obtained in the above fashion is just the ordinary curvature of the 
plane curve C. As the curve x(s) varies, we obtain a set of values 


x"(s)-N=K(T) (17) 


called the normal curvatures of S at p. The right-hand side of (17) is the restriction 
to the unit sphere of a quadratic form Q on the tangent space. It follows that there 
exists an orthonormal basis e,,...e,_, of the tangent space diagonalizing Q. The 
directions e,,...,e@, , are called the principal curvature directions at p, and the 


corresponding quantities 
ky =k(e1),---, Kn = K(€n_1) (18) 


are the principal curvatures of S at p. 
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Just as in R° one can consider the Gauss map 
g: p> N(p), (19) 
whose differential 
dg,:x'(t)>N(t), (x(t) =p) (20) 


again satisfies Rodrigues’ equations 


dg ,(e;) = —k;e;, i=1,...,n—1. (21) 
We have the mean curvature 
1 
H = ki to +h) = ~ 7, (4g,) (22) 


and the Gauss-Kronecker curvature 
ky ++: ky, = (-1)” “det(dg,), (23) 


but we also have the other symmetric functions of the principal curvatures, called 
the higher-order mean curvatures. We denote by H, the jth-order mean curvature, 
normalized so that 


Thus, H, = H, the mean curvature, and H,,_, is the Gauss-Kronecker curvature. 
For n = 3, that is all there is, but in higher dimensions, there are intermediate 
ones. Among those, H, plays a special role. Except for a numerical constant, H, 
equals the scalar curvature, an important Riemannian invariant. Before describing 
it, we note that the remainder of the background material in this section is only 
needed for certain parts of the discussion in section 5 below. 

We now move from the special case of a hypersurface to the general case of an 
m-dimensional submanifold M of R”. The important distinction made by Gauss in 
the case of surfaces in R° also holds in all generality. Intrinsic quantities of M are 
those that are determined by measurements along M itself, while extrinsic ones 
depend on the way M lies in R”. Riemann introduced the notion of a Riemannian 
manifold in which lengths are defined by a formula like the one that holds for a 
submanifold of R”. About one hundred years later, John Nash showed that 
although the idea was new (and extremely fruitful) the objects themselves were 
not, in that no new manifolds arise that are not already among those that occur as 
submanifolds of R”. 

For surfaces in R° it is clear that the principal curvatures and mean curvature 
are not intrinsic, since rolling a sheet of paper into a cylinder alters them while 
leaving distances along the surface unchanged. The big surprise, Gauss’ Theorema 
Egregium, was that the product k,k.,, the Gauss curvature, is intrinsic. 

For an m-dimensional Riemannian manifold M, which we may if we like, by 
Nash’s embedding theorem, consider as lying in some R”, Riemann defined at 
each point p the sectional curvatures as follows. Pick any pair of independent 
tangent vectors to M at p, say v and w. For every unit vector u = Av + pw, there 
is a unique geodesic in M starting at p, with tangent vector u. The set of all such 
geodesics, as u describes the unit circle in the plane spanned by vu and w, sweep 
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out a surface whose Gauss curvature at p is called the sectional curvature 


of the plane II spanned by v and w. 
Let e,,...,e,, be an orthonormal basis of the tangent space to M at p. Then 
the quantity 
T=2 » K[e;, e,| (26) 
1<i<j<m 


is independent of the choice of basis, and is called the scalar curvature of M at p. 

Returning to the special case of a hypersurface in R”, we may choose the e; to 
be principal curvature directions at p. Then a computation very similar to the one 
that Gauss used to prove that K is intrinsic leads to the equation 


K[e;,e,] = k,k,, (27) 


oe 


so that the scalar curvature + may be expressed in terms of the principal 
curvatures k,; by 


T=2 > kjk, (28) 
1<i<j<n-1 
or by (24): 
tT=(n-1)(n - 2)H,. (29) 


With that as background, we can proceed to the results. 


2. Curvature inequalities; curves and surfaces 
We start with a classical theorem of Fenchel for a curve in R* with curvature k. 


THEOREM 2.1. For a simple closed curve C in R°, 
/ kds = 27, (30) 
C 


with equality if and only if C is a convex plane curve. 


We give a slightly modified version of the proof in the book by do Carmo [1]. It 
has the unusual feature of “reducing” a theorem on curves to one on surfaces. 
Let 


K*= max{ K,0}. 


In other words, K* is the function on a surface whose value equals the Gauss 
curvature K wherever K is positive and is zero wherever K is negative or zero. 


LemMMa 2.2. For a compact surface S in R°, 
[K* dA > 47. (31) 
K) 


Proof. The left-hand side of (31) represents the area of the image under the 
Gauss map (counting multiplicities) of the part of S where K > 0. Hence, it is 
sufficient to show that the image covers the whole unit sphere. For that, it suffices 
to take your surface, as Heinz Hopf used to say, and “set it down on the floor.” 
The first point of S where it touches the floor has a normal vector pointing 
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vertically downward (assuming that we are using the orientation of the surface 
corresponding to the outer normal) and has Gauss curvature K > 0, because at a 
point where K <0, the surface is saddle-shaped and crosses its tangent plane, 
by (7). But the “floor” could be in any direction, and thus we get a point with 
K = 0 corresponding to every normal direction. Since the area of the unit sphere is 
47r, the lemma is proved. 

To prove Fenchel’s theorem, we apply the lemma to a thin tube around the 
given curve. In fact the proof is simply a computation showing that for such a tube, 
the left-hand side of (31) is equal to twice the left-hand side of (30) over the 
original curve. 

Let the curve C be given by 


x(s), 0O<s<L, 
where s is the parameter of arc length. At each point x(s) of C, we set 
e(s) =x'(s), 
the unit tangent vector. Let e,(s) be a smooth unit vector field such that 
e(s) -e(s) =0 
and set 
e,(5) =e(s) Xe,(s). 


Thus, e€,, e, span the normal space to the curve, and the tube of radius r around C 
is the surface 


y(s,t) =x(s) +rN(s,t), 


where 
N(s,t) = cos te,(s) + sin te;(s). 
Setting 
3 
el(s) = J a;(s)e,(s), i = 1,2,3, 
j=l 
and using a;; = —a,; (by differentiating the equation es) -e,(s) = 0), we find 
that 
dy oy . 
a as 7 r[1 + r(a,, cost + a3, sint)]N. 


Thus, for small positive r, the surface y(s,t) is a regular surface, and its unit 
normal is precisely the vector N(s, t). Furthermore, 
ON ON 
Or x a5. = —(a), cos tf + a3, Sm t)N. 
Since the definition of K amounts to the equation 
ON ON oy Oy 
— X — =K— x —, 
Ot OS Ot OS 
it follows that 
a,,cost + a3,sint 


K = —-——4 
r{1 + r(a,, cost + a3, sin t)| 
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But the curvature k of C is 
_ ” _ ’ _ __ _ 2 2 
k =|x"(s)| = lei(s)| = |A42€2 + 43€3| = |A21€2 + 31€3| = ya3, + a3; . 


Thus k(s) = 0 implies K(s,t) = 0 for 0 < t < 27. At each point where k(s) > 0, 
we may set a,, = k cos 6, a3, = k sin 9, so that 
k cos(t — @) 
OS r[1+ rk cos(t — @)|- 
Thus K > 0 for 7/2 <t — 6 < 32/2. Since 


°y °y =ril k @ 
— x —/=r[1+ t — 
1 4 r| rk cos( )], 


for any interval a < s < b where k(s) > 0, we have 


[xt aa=[” 


a 


fT kGs) cos(t — 6)] at ds = 2f’k(s) ds, 


and for the entire tube, we get 
[Kt dA = 2 ["k(s) ds. 
0 


Fenchel’s inequality (30) now follows immediately from (31). A closer analysis 
shows that equality holds only in the case indicated. (For details, see do Carmo [1], 
p. 400.) 

We next ask how Fenchel’s inequality extends to surfaces. The answer depends 
on which curvature we adopt in place of k. 


THEOREM 2.3. Let S be a compact surface embedded in R°. Then 
[ikl dA > 4q, (32) 
s 


with equality if and only if S bounds a convex domain in R°. 


Proof. Using Lemma 2.2, 
[\K\d4 > [K* dA > 4z. 
S S 


If K <0 at any point, then K < 0 in a neighborhood, and the first inequality is 
strict. Hence equality implies K > 0 everywhere and by a theorem proved first by 
Hadamard when K > 0, and later by Chern and Lashof [1] in general, S must 
bound a convex domain. 

If, instead of K, we consider the mean curvature H, we find several possible 
formulations. 


THEOREM 2.4 (Willmore). Let S be a compact surface in R°. Then 
/ H*dA> 47 (33) 
s 


with equality if and only if S is a standard sphere. 
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Proof. 


k,+k,\? k,—k,\? 
w= (2) = kyk, + (5) > K, (34) 


so that 
[HP dA > [K dA >4r 
RY RY 


by Lemma 3.2. Furthermore, inequality is strict unless H* = K, in which case 
k, =k, so that every point of S is umbilic. It is then a classical result (for example, 
do Carmo [1], p. 147) that S$ is a standard sphere. 

A question that has received a great deal of attention in the past decade is the 
Willmore Conjecture. If S is topologically a torus, then 


[HP dA > 21. (35) 
S 


This conjecture has been verified for many cases, including when S is a tube about 
some curve, or of certain conformal type. There are also higher-dimensional 
versions, chiefly due to B.-Y. Chen. For these and other results on the conjecture, 
see Bryant [1], and the books by Willmore [1] and Chen [1]. More recently, Langer 
and Singer [1] proved the conjecture for tori of revolution. Leon Simon [1] 
obtained the important result that there exists an immersed torus minimizing the 
left side of (35), and a result of Li and Yau [1] guarantees that such a surface must 
be embedded. Ferus, Pedit, Pinkall, and Sterling [1] provide, in some sense, a 
complete classification of Willmore tori; that is, stationary surfaces for the integral 
in (35). 

An inequality of a somewhat different character is one obtained by Minkowski 
for a surface S bounding a convex domain D of volume V. We may write the 
inequality as 


[Haa <— (36) 


where A is the area of S. Equality holds (for smooth surfaces) only when S is a 
standard sphere. 

We come finally to the recent results of Ros. He proved a new inequality, in a 
way dual to (36). 


THEOREM 2.5 (Ros [1]). Let S be a compact embedded surface in R° bounding a 
domain D of volume V. If the mean curvature H of S is positive everywhere, then 


1 
i 7 aA = BV. (37) 
Equality holds if and only if S is a standard sphere. 


Ros’ original proof uses results of Reilly [1]. A subsequent paper of Montiel and 
Ros [1], based on ideas of Heintze and Karcher [1], uses a more geometric 
approach. 
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Proof. The idea is to use a well-known formula (whose derivation we include at 
the end of the proof) for the volume of a one-sided shell on the inside of S, where 
we allow for a shell of variable thickness (somewhat like an oyster shell). In other 
words, we compute the volume of the domain obtained by starting at each point p 
of S and going a distance h(p) along the inner normal to S. The volume of this 
Shell is given by 


V = [Fa (38) 
S 
where 
F= pra ~k,t)(1 —k5t)| dt = peru — 2Ht + Kt?| dt, (39) 
0 0 


provided there is not overlap; that is, provided no portion of the shell is counted 
twice along normals from different parts of §. To prevent overlap we define h(p) 
as follows: 


h( p) = sup{r : the point p is the unique nearest point on S to the point q 
at distance r from p along the normal to S at p}. (40) 


From the definition of h(p) it follows that the interior of the shell of variable 
width h(p) has no overlap; every point of the interior lies on a unique normal to S. 
On the other hand, every point of D lies in the closed shell. Namely, given q in D, 
let d be the distance from q to S. Then the open ball B,(d) of radius d centered 
at q contains no point of S, but there is at least one point p of S on its boundary. 
For any point q' on the radius from q to p, if r is the distance from q’ to p then 
the closed ball B,,(r) lies in the interior of B,(d) except for the point p. It follows 
that p is the unique point of S realizing the distance r from q’ to S. By the 
definition (40) of h(p), we have d < h(p). Hence gq lies in the closed shell, as 
claimed. 

We conclude that the volume V of D is precisely equal to the volume of the 
shell given by (38), since all points of D are covered and the only points covered 
twice are in the image of the boundary, which is lower dimensional and contributes 
nothing to the volume. 

The key to the argument is now the elementary but crucial observation: 


1 

hp) = max{k,( p),k,(p)} (41) 
for every point p on S. Namely, let q be the point at distance h(p) from p along 
the normal. Then the open ball B,(h(p)) cannot contain any points of S, since if 
there were such a point p’, then its distance to qg would be less than h(p); for all 
points q' on the radius from q to p, sufficiently near q, the distance from q’ to p’ 
would then be less than the distance from q' to p, contradicting the definition of 
h(p). But since the surface S lies outside the sphere S,(h(p)), the surface and the 
sphere are tangent at p, and the normal curvatures of §$ at p are all bounded 
above by the normal curvatures of the sphere, which equal 1/h(p). This proves 
(41). 

It follows from (41) that each of factors 


(1—k,t), (1-&,t) 
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in (39) is non-negative for 0 < t < h(p), so that 


[OPW = et) = kyle = [PC = kt) = kat) a. (42) 
0 0 
By the inequality of the geometric and arithmetic mean, 
(1 —k,t)(1—k,t) < (1 — At)’ (43) 
and by (41) 
1 
—— >H(p 
np) =P) 


so that from (39) 
1/H 5 1 
F 1 — Ht)* dt = — 
< f(- Hy’ dt = = 
and by (38) 
V dA 
< —_——- 
< an 
proving (37). For equality to hold, it must hold in (43), which means that k, = k,. 
But then by the classical result referred to earlier in the proof of Theorem 2.4, S$ 
must be a sphere. This proves the theorem. 

For completeness, we show how one derives equations (38) and (39). It is 
sufficient to carry out the computation locally on S, since one can then add up to 
obtain the integral over the whole surface. So suppose we have S given locally in 
parametric form: 

S: x(u,,U,), (U,,u,) ED CR’. 


At each point we have the unit normal N(u,,u,), and we parametrize our 
“one-sided shell” by 


y(Uj,,Uz,U3) =x(U,,U,) + UzN(uy, Uy), (u,,U,) €D,0 <u; <h(uy,, up). 
The volume of this shell is then given by 
u,,u>)} 1 Y1> Y2>Y 
ff ie 1 U2) ( 1° 42 3) du du, dus, 
D| "0 (Uy, Uz, U3) 


provided the correspondence (u,,u,,uU3) > (y1, yz, y3) is one-to-one (which the 
definition (40) of A(p) is designed to guarantee). To evaluate the integrand, we 
note that if u,;,u, are principal curvature directions, then Rodrigues’ equations 
(12) imply 


Oy Ox ON Ox 
5 = a + u37— = (1 — kyu) -—, 
du, du, du, du 
Oy Ox Oy 
—=(1-k,u;)—, —-=N; 
du, du, du; 

Oy Oy Ox Ox 

a X yo = CU — kyu3)(1 — k2u3)— x -— 

du, du, du, du, 


Ox Ox 
= (1 — 2Hu, + Ku3)— x —, 
du, du, 
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where —H and K are the trace and determinant of the differential of the Gauss 
map g given by (10); that is, if 


then the trace and determinant of (c;,) is independent of choice of parameters. 
The equation above for (dy/du,) X (dy/du,) is, therefore, valid for any parame- 
ters u,,u,. Then 


A(¥1, Y25 Y3) dy ay dy dx Ox 
OCYte Yor Ys) | _|/ OY OY | | Oy = |l — 2Hu, + Ku3\|— x —|, 
0( x1, X>, 3) du, Ou,} du, du, du, 
so that from (14) 
Ox Ox 
h(u,, uy) a) 
V= 1 — 2Au., + Ku4| du.||— XK — | du, du 
[J] fre = eta, + Kid] dus|| =~ x | i, du, 


— pipers — 2Hu, + Kus au,| dA, 


which gives equations (38), (39). 

We have confined ourselves in this section to the case of compact surfaces. 
However there are many interesting results involving integrals of various curva- 
tures over complete, non-compact surfaces. We note in particular a recent paper 
by Brian White [1] and earlier ones by Osserman [1], [2]. 


3. Surfaces of constant mean curvature 


One consequence of Ros’ inequality (37) is a remarkably elementary proof of a 
famous theorem of Alexandrov: the only compact embedded surfaces of constant 
mean curvature in R° are the standard spheres. 

To see that, Ros makes use of a general equation, valid for any oriented 
compact immersed surface S in R?: 


A= — [HN -xdA, (44) 
S 


where A is the area of S, H its mean curvature, N the unit normal and x the 
position vector. Equation (44) can be proved in various ways, the simplest, perhaps, 
being a special case of the first variation formula; if v is any smooth vector field 
along S, let S, be the surface defined by 


y=x+ru (45) 


and let A(r) be the area of S,. Then an elementary computation gives 
A(0) = —2{ HN -udA., (46) 
s 
In this case, we choose v = x, so that (45) reduces to 
y=(1+r)x, 
a uniform stretching of S by the factor (1 + r), so that 
A(r) =(1+r)’A 
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and 
A'(0) = 2A. (47) 


Combining (46) with (47) gives equation (44). 

Now restrict to the case where S is an embedded surface of constant mean 
curvature H bounding a domain D. If we choose N to be the interior normal at 
each point, then H > 0, since S lies inside some large sphere, and contracting the 
sphere until it makes a first point of contact p with S, we find that both principal 
curvatures at p with respect to the inner normal must be positive. Applying the 
divergence theorem we deduce from (44): 


A= -H{N-xdA =H divxdV = 3HV (48) 
S D 
since 
div x = 3. 
Hence, 
A 
H- =, (49) 


which means that equality holds in (37), and by Ros’ theorem, S must be a 
standard sphere. 

For a very pretty elementary proof of Alexandrov’s theorem, of a more analytic 
nature, see Reilly [2]. Incidentally, Harold Rosenberg also noticed that equation 
(49) holds for constant mean curvature surfaces, and pointed out that as a result, 
equality holds in Minkowski’s inequality (36). Since it is known that at least for 
smooth convex surfaces, equality can hold in (36) only for the sphere, this gives 
another proof of the original theorem of Liebmann, the precursor of Alexandrov’s, 
that a convex surface of constant mean curvature must be a sphere, and could 
conceivably yield the full Alexandrov theorem if a suitable extension of Minkowski 
was found. 

A generalization of Liebmann’s theorem in a different direction than 
Alexandrov’s was given by Heinz Hopf. He showed that any immersion of a 
surface, topologically a sphere, with constant mean curvature in R° must be a 
standard sphere. In conjunction with Alexandrov’s theorem, that led to one of the 
most stubborn questions in the field during a period of several decades: must every 
immersed compact surface of constant mean curvature in R° be a standard sphere? 
That question, generally known as “Hopf’s conjecture,” was finally settled in 1986 
by Wente [1] who demonstrated the existence of an immersed torus of constant 
mean curvature. Wente’s work inspired a string of further results on compact tori 
of constant mean curvature (Abresch {2], Spruck [1], Pinkall-Sterling [1]). Using 
radically different methods, Kapouleas [2] has just shown that there also exist 
compact immersed surfaces with constant mean curvature of every genus g > 3. 
The existence of such surfaces with genus 2 remains an open question. 

We may mention just a few further results of interest on constant mean 
curvature surfaces proved in the last few years. 

In the case of compact surfaces, the first-variation formula (46) applied to a 
normal variation 


_U=fN 
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for an arbitrary function f, gives 
A(0) = —2f fHdA. 
s 


If S bounds a domain of volume V, and the varied surfaces S, bound domains of 
volume V(t), then 


V'(0) = | fdA. 
S 
In particular, if the volume is held fixed, so that 
Vit) =V, 


then V’(0) = 0, and the problem of minimizing surface area while fixing the 
enclosed volume leads to the analytic formulation of finding a surface S such that 


/ fHdA =0 whenever / fdA =0. (50) 
S S 


But a standard argument shows that (50) holds if and only if H is constant. A 
surface § satisfying (50) is called stationary with respect to the given class of 
variations. It is called stable if in addition 


A"(0) = 0. 
Note that condition (50) makes sense for any immersed surface, as does stability. 


THEOREM (Barbosa and do Carmo [1]). A compact stable surface of constant 
mean curvature is a standard sphere. 


Thus, Wente’s surfaces and their successors must all be unstable. 

Do Carmo’s theorem was extended by Palmer [1], Silveira [1] and Lopez and 
Ros [1] to complete surfaces, where the stability assumption applies to every 
compact subdomain and the surface is assumed to have non-zero constant mean 
curvature. This result, along with many others on complete surfaces of non-zero 
constant mean curvature, tends to restrict the possibilities for such surfaces. Other 
such theorems are those of Klotz-Osserman and Hoffman-Osserman-Schoen [1] 
stating that a complete surface of non-zero constant mean curvature must be a 
circular cylinder if, in the first case, its Gauss curvature K satisfies K < 0, and in 
the second, its image under the Gauss map lies in a closed hemisphere. The latter 
result was extended by Lopez and Ros [1] who showed that it is sufficient to 
assume that for,some compact set on the surface, the image of its complement 
under the Gauss map lies in a closed hemisphere. These results are sharp, in that, 
as Seaman showed [1], there exist examples whose Gauss map covers an arbitrarily 
narrow strip surrounding the equator: namely certain of the surfaces of revolution, 
called Delaunay surfaces. 

A beautiful very recent result by Korevaar, Kusner and Solomon [1] character- 
izes the Delaunay surfaces as the only doubly-connected surfaces (besides the 
circular cylinder) properly embedded in R°* with non-zero constant mean curva- 
ture. 

In view of the paucity of examples, it represented a major breakthrough when 
Kapouleas [1] proved the existence of a whole new species of complete surfaces of 
non-zero constant mean curvature, including examples of infinite genus, and 
infinitely many topologically distinct ones of every finite genus. Each annular end 
converges to a Delaunay surface. That turns out not to be an accident, as was 
demonstrated shortly after in the paper of Korevaar, Kusner and Solomon [1]. 
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They prove: 


THEOREM. Let S be a properly embedded surface of finite topological type with 
constant non-zero mean curvature. Then each end is asymptotic to a Delaunay surface. 


We have restricted our attention in this section to surfaces of non-zero constant 
mean curvature. Surfaces with H = 0, minimal surfaces, constitute a whole world 
of their own, about which one can learn in a number of recent books: Barbosa and 
Colares [1], Bourguignon, Lawson, and Margerin [1], Nitsche [1], Osserman [3], [4], 
and Yang [1] in particular. 


4. Higher dimensions 


Some of the results described in sections 2 and 3, such as Hopf’s and Kapouleas’ 
Theorems, use the two-dimensionality of the surfaces studied in an essential way. 
In fact, Hopf’s theorem turns out to be false in general, as was proved by Wu-Yi 
Hsiang [1], who constructed immersed spheres of constant mean curvature in R”, 
n> 4, different from the standard sphere. However, other results extend to 
hypersurfaces in higher dimensions, in many cases in a straightforward manner. In 
particular, Ros’ fundamental inequality, along with its proof, extends to compact 
hypersurfaces S in R” with H > 0 in the form 


1 
la dA > nV, (51) 


where S is assumed to be the boundary of a domain D of volume V, and again, 
equality holds only for the standard sphere. But also the first variation formula 
(46) and its consequences, (44) and (48), extend easily to give equality in (51) when 
#7 is constant, and hence proves Alexandrov’s theorem in R”. 

Ros’ main goal, however, was to prove the conjecture that constant scalar 
curvature implies that an embedded compact hypersurface must be a sphere. He 
did that [1] using results of Reilly [1], together with the fact that, as follows from 
(29), constant scalar curvature is equivalent to constant second-order mean curva- 
ture. That paper led Ros [2] and Korevaar [1] independently to a proof of the 
general theorem that the constancy of any of the H;, is possible only for a sphere. 
Korevaar showed how Alexandrov’s original argument for H, could be carried 
over, whereas Ros adapted the reasoning in his earlier paper. Finally, Montiel and 
Ros [1] gave the new and elementary proof of Ros’ inequality (51) that we 
described above. 

To complete the proof for the higher order mean curvatures H,, defined as we 
noted earlier by the equation 


n-1 n-1 n—1 . 
Wa+%)=5 , JH 
j=1 j=0 J 
Ros makes use of the Minkowski equations 
[(H, + H,,,N:x)dA =0. (52) 


Note that equation (44) is just the special case j = 0 of (52). But the general 
equations follow from the special case by computing the mean curvature H(r) of a 
parallel hypersurface S,. to S, which may be expressed explicitly in terms of the 
quantities H, on the original surface, and substituting in (44); setting the coeffi- 
cient of each power of r equal to zero gives Minkowski’s equations (see Montiel-Ros 
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[1]. Incidentally, these “Minkowski equations” were apparently first stated and 
proved by Kubota for convex hypersurfaces and then obtained in the general case 
by Hsiung [1]. What Minkowski did was the case n = 3, j = 1.) 

To conclude the proof, Ros shows that if a particular H; is positive on all of S, 
then the same holds for H,,..., H,_,, and the following inequalities hold: 


H)-'<Hj_,, H,<H', i=1,...,j. (53) 


L 


Suppose now that H, is constant on S. Since there exists some point where all 
the principal curvatures are positive (using the same argument of contact with an 
enveloping sphere as in R*), the constant value of H, is also positive, and 
inequalities (53) hold. By (52), 


[H-1dA = ~ J H)N «xd = ~HyJ No xd = HynV, 


where we have used the n-dimensional version of (48) in the last step. Invoking the 
inequalities (53) then yields 


1 1 A 1 1 
— __ —_ G-1)/i —_ = | —__ — 
nV iy, J i-1 44 > rake dA > A Sag dA > [a 4A, 


But this is just the reverse of Ros’ inequality (51). Hence equality holds in (51) and 
by Ros’ Theorem, S is a standard sphere. 


5. Related results 


We now review briefly a few further topics, again with emphasis on some of 
those where exciting progress has been made in the eighties. 

First, another innocuous-sounding question that flowered into a beautiful the- 
ory: what are the surfaces for which not only the mean curvature, but each of the 
individual principal curvatures is constant? That turns out to be such a strong 
condition, that even locally, any piece of such a surface must lie on a sphere, a 
plane, or a right circular cylinder. An interesting, and perhaps unexpected feature, 
is that it is impossible for the principal curvatures of a surface to be, even locally, 
distinct non-zero constants. Shortly after these results were obtained by Levi-Civita 
[1], they were extended to hypersurfaces of arbitrary dimension. Segre [1] proved 
that if all the principal curvatures are constant, then there could be at most two 
distinct values,-and at most one of them different from zero. As a result, the 
hypersurface must again lie either on a sphere, a hyperplane, or a generalized 
cylinder S* x R"~*, k =1,...,n — 1. The whole subject, which was apparently 
finished shortly after it began, took on an entirely new life with a paper of Cartan 
[1] who posed the same question for hypersurfaces in a sphere. Before describing 
the results, let us recall the definitions. 

Let C be a curve lying on a sphere S” in R"*'. If C is given parametrically as 
x(s) where s is the parameter of arclength and x(s,)) = p, then the curvature vector 
of C at p with respect to S$” is the projection of the Euclidean curvature vector 
x"(s)) onto the tangent plane of S$” at p. Note that a curve C has zero curvature 
vector with respect to S if and only if x"(s) is normal to S”. If that holds on an 
interval, then one can show that the corresponding arc of C lies on a great circle. 

Now let M be an (n — 1)-dimensional manifold lying on S$”, and p a point of 
M. By considering the curvature vectors at p with respect to S” of curves through 
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p lying on M, we arrive at a set of principal curvatures of M at p with respect to S” in 
the same manner that we defined in (18) the principal curvatures of a hypersurface 
in R” by using the standard curvatures of curves on the hypersurface at a given 
point. It is those principal curvatures that Cartan requires to be constant. 

In his original paper and a number of subsequent ones, Cartan shows that this 
class of surfaces, called isoparametric hypersurfaces, is far more interesting in S$” 
than in R”. The intersection of the cylinder 


xp tics: +x 

with the sphere S” 
2 2 4.2, ,2 
Xp ttt X74 HN ary 


is an isoparametric hypersurface. Cartan showed that every hypersurface of S” 
with two distinct (constant) principal curvatures is one of those. They include a 
famous surface, the Clifford torus in S°. Cartan also showed that there were 
examples with three distinct curvatures, and that they could all be represented as 
tubes over a special set of embeddings in certain S$”, including S*. He then 
constructed an example with four distinct curvatures in S°, and was naturally led 
to ask whether there exist examples with any prescribed number of distinct 
curvatures. That question remained unanswered for almost forty years, until the 
appearance in 1980 and 1981 of two remarkable papers by Minzner [1]. Among 
the results he obtains are 

1. the only possibilities for the number of distinct curvatures are 1, 2, 3, 4 
and 6; 

2. every isoparametric hypersurface of S$” lies on an algebraic surface; if there 
are g distinct curvatures, then it lies on the intersection of S$” with a variety 
defined by a homogeneous polynomial of degree g. 

An illustration of the second result is the theorem of Cartan mentioned earlier 
that all examples with two curvatures are given by the intersection of the sphere 
with a cylinder, defined by a quadratic polynomial. 

Following the work of Miinzner there has been a great burst of activity in the 
field. For an excellent discussion and references up to 1985, see Chapter 3 of the 
book by Cecil and Ryan [1]. A recent paper of interest with further references is 
that of Solomon [1]. 

Cartan alsq considered isoparametric hypersurfaces in hyperbolic space—that 
is, a space with constant negative sectional curvature. What Cartan showed [1] was 
that isoparametric Surfaces in hyperbolic space were limited in the same manner as 
in Euclidean space, in that they could have at most two distinct principal curva- 
tures. 

We should say a word about how principal curvatures are defined for hypersur- 
faces in a Riemannian manifold, where the metric is only defined intrinsically. 
First of all, by Nash’s embedding theorem, every such manifold M can be thought 
of as embedded in some Euclidean space R%, in which case any curve C in M has 
at each point its ordinary curvature vector x"(s) in R%. The geodesic curvature 
vector of C at a point is simply the projection of x"(s) onto the tangent space of M 
at the point. The critical fact is that the geodesic curvature vector can be expressed 
intrinsically in terms of the Riemannian metric on M. Consequently, one does not 
need to consider an embedding, but can define curvatures of curves, principal 
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curvatures of hypersurfaces, and all the corresponding quantities in any Rieman- 
nian manifold. 

As an example, once one has the principal curvatures of a hypersurface in 
hyperbolic space, one can define the mean curvature as their average, and all the 
higher-order mean curvatures as before. One can also define “spheres” in a 
Riemannian manifold, or “geodesic spheres,” as they are usually called, centered 
at a point p, with radius r, as the set of points reached by going a distance r along 
an arbitrary geodesic starting at p. For a compact hypersurface M embedded in 
hyperbolic space, Korevaar [1] and Montiel-Ros [1] prove that if any of the mean 
curvatures is constant, then M must be a geodesic sphere. The same is true if M 
lies in a hemisphere of S”. However, for hypersurfaces in the full sphere, the result 
is not true, since one has all the isoparametric examples, which clearly have all 
mean curvatures constant, since all principal curvatures are constant. One can 
narrow down the possibilities if one restricts, for example, the sectional curvature 
of M. (See Walter [1] and further references given there.) 

We conclude our discussion of this subject with two further remarks. First, for 
an isoparametric hypersurface M in a constant curvature space, the parallel 
hypersurfaces to M are also isoparametric. Second, the family of parallel isopara- 
metric hypersurfaces may be written as the level sets of a function F with the 
property that both the Laplacian of F and the magnitude of the gradient of F are 
functions of F itself. In fact this property characterizes isoparametric hypersur- 
faces and is the origin of their name. 

We now enlarge our viewpoint a bit and ask the following question: for which 
Riemannian manifolds do all geodesic spheres have constant mean curvature? It turns 
out that manifolds with this property arose in the context of “harmonic spaces.” 
These spaces, or manifolds, may be characterized in a number of ways—for 
example by the property that the mean value of a harmonic function over a 
geodesic sphere equals its value at the center, just as in Euclidean space (a 
theorem of Willmore. See also Ruse, Walker and Willmore [1] and Chapter 6 of 
Besse [1, in particular, 6.19].) 

Among manifolds that satisfy the given condition is the famous class of “‘two- 
point homogeneous manifolds.” They are defined by the property that for every 
two pairs of points p,,q, and p,,q,, such that the distances d(p,, q,), d(p5, gz) 
are equal, there is an isometry of the manifold taking p, to p, and q, to q,. In 
particular, if p, = p,, then for any two points q,, q, of a geodesic sphere centered 
at p,, there is an isometry of the manifold fixing the point p,, hence necessarily 
taking the sphere to itself, and mapping q, to g,. Thus, the mean curvature of the 
geodesic sphere is necessarily constant. In 1944, Lichnerowitz conjectured that the 
two-point homogeneous spaces were the only harmonic spaces. Very little progress 
was made for 45 years. Then in 1989, the conjecture was proved by Szab6 [1] for all 
simply connected compact harmonic spaces. The proof draws on much work that 
had been done on both harmonic spaces and two-point homogeneous spaces 
separately, as well as some results linking them. 

We mention here a few facts about two-point homogeneous spaces in which 
curvature plays a major role. We start with the celebrated “pinching theorem” of 
Toponogov, Berger, and Klingenberg: 


THEOREM. Let M be a compact simply-connected manifold whose sectional curva- 


ture K satisfies 
0<d6<K<l. 
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Let d be the diameter of M. Then 
i)d>7T7; 
ii) M is homeomorphic to a sphere if any of the following holds: 
a) 6 > G, 
b) 6 = 4 and dim M is odd, 
c) 6 =iandd>vz; 
iii) if dim M is even, 5 = +, and d =7, then M is a two-point homogeneous 
space. 


For proofs of this and other pinching theorems see the books of Chavel [1] and 
Cheeger-Ebin [1], or the excellent recent article by Karcher [1]. 

Regarding part (iii) of the theorem, it turns out that one can list explicitly all the 
compact two-point homogeneous spaces; they are the standard spheres and real 
projective spaces, the complex and quaternionic projective spaces, and the projec- 
tivized Cayley plane, all with their standard Riemannian metrics. 

This same list arose in Cartan’s investigations of a certain class of spaces 
defined by a condition on curvatures. To explain that condition, let us start again 
with a manifold M embedded in Euclidean space. Let C be a curve in M, and let 
uv be a vector field along C; that is, at each point x(t) of C, v(t) is a tangent vector 
to M at x(t). Then v is said to be parallel along C if v(t) is normal to M at x(t) 
for all ¢t. This notion is the parallel translation of Levi-Civita, and like the closely 
related notion of geodesic curvature, it has the remarkable property that it can be 
defined intrinsically, independent of whether or how M is embedded in Euclidean 
space. Furthermore, given C and a tangent vector v, at a point x(t.) of C, there is 
a unique parallel field v(t) along C with v(t.) = vo. Finally, if v(t), w(t) are both 
parallel along C, then v(t) - w(t) is constant. It follows that if IJ, is a two-dimen- 
sional tangent plane at a point x(t,) of C, then we can define parallel two-planes 
along C by taking a pair of orthonormal tangent vectors U),w, spanning Ilo, 
letting v(t), w(t) be the parallel fields along C extending v,,w, and defining rm) 
to be the plane spanned by u(t) w(t), which are necessarily orthonormal for all t. 
By the properties we have mentioned, this one-parameter family of planes along C 
exists, is unique, and is independent of the choice of the original pair Uy, Wo. 

We come now to the notion introduced by Cartan. 


DEFINITION. A Riemannian manifold is locally symmetric if the sectional curva- 
ture is constant.along every family of parallel planes II(¢). 


A locally symmetric space has rank one if none of its sectional curvatures is 
zero. 


THEOREM. The two-point homogeneous spaces are precisely the Euclidean spaces 
IR”, the real projective spaces RP”, and the complete simply-connected locally symmet- 
ric spaces of rank one. 


For more on this subject, see the books of Chavel [1], Helgason [1] and Besse 
[1]. In this last reference, a Compact Rank One Symmetric Space is referred to 
acronymically as a CROSS. Thus, the theorems of Szab6 and Berger referred to 
above may be stated: a compact simply-connected manifold which is either har- 
monic, or is even dimensional with diameter zw and sectional curvatures pinched 
between ; and 1, must be a CROSS. 

We conclude this section with a quick review of questions involving manifolds 
for which one or more of the intrinsically defined curvatures is constant. 
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First, complete Riemannian manifolds with constant sectional curvature—the 
“Clifford-Klein space forms’”—were studied in detail by Hopf, and their classifica- 
tion was completed by Wolf [1]. The case of three-dimensional manifolds with 
constant negative curvature has been intensively studied in recent years, with the 
major impetus coming from Thurston [1], [2]. 

The requirement of constant sectional curvature imposes severe restrictions on 
the possible forms of the manifold. One can ask what restrictions arise from 
requiring that the scalar curvature be constant. The fundamental paper here is 
one of Yamabe from 1960. The question was whether there are any restrictions 
needed to have a metric of constant scalar curvature. Yamabe presented a proof 
that for compact manifolds there always exists such a metric conformally equiva- 
lent to any given metric on the manifold. However, his purported proof turned out 
to be incomplete, and after partial results by Trudinger, Aubin [1] proved the 
theorem for all manifolds of dimension n = 6 that were not conformally flat. To 
settle the problem completely, a new global type of argument was needed, and that 
was provided in 1984 by Schoen [1]. A general survey of the problem may be found 
in Lee and Parker [1]. Subsequent papers by Schoen [2], [3], [4] contain a number 
of further results, including an analysis of all metrics of constant scalar curvature 
in a given conformal class. 

Thus, in contrast to the case of embedded hypersurfaces where, as we saw, 
constant scalar curvature was possible only for a sphere, for general compact 
manifolds there are no topological implications whatever. On the other hand, the 
sign of the scalar curvature does have significance. For a wealth of information on 
that topic we refer to the book of Lawson and Michelson [1]. 

The sectional curvature and the scalar curvature are the two most immediate 
curvature quantities that arise in the study of Riemannian manifolds. However, 
experience has shown that a third—intermediate—curvature is in many respects 
the “right one” to consider. To define it, we look more closely at the totality of 
sectional curvatures at a point p of a Riemannian manifold M. 

Let uv be a unit tangent vector to M at p. Let H, =vu~ be the hyperplane in 
the tangent space orthogonal to v. For w € H,,, set 


Q,(0)=0, Q,(w) =|w|*K(v,w) for w # 0 (54) 
where 
K(v,wy) = sectional curvature of the plane spanned by v and w. (55) 
Then Q,, is a quadratic form on H,, whose eigenvalues we denote 
A,(v) 2 A,(v) = +++ 2A,_(v) (56) 
where dim M = n. The corresponding eigenvectors 
Ae 2 
form an orthonormal basis of H,,, and 
Av) = Q,(e;) = K(v, 2) 
is a sectional curvature of a plane through v, with A, and A,_, representing the 
maximum and minimum of such curvatures. 


DEFINITION. The Ricci curvature R(v) of M at p in the direction v is 


R(v) = Ya,(v). (57) 


i=1 
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Thus, the Ricci curvature is the trace of the quadratic form Q,,, and since the 
trace is independent of basis, we can also write 


n—-1 
R(v) = ¥ K(v,¢)) (58) 
i=1 
where é,,...,é€,_, 18S any orthonormal basis of H,,. 
Geometrically, it would generally make more sense to consider the average 
rather than the sum of the sectional curvatures, and in some places that conven- 
tion is adopted. However, (58) is the more common usage. 


DEFINITION. An Einstein manifold is a Riemannian manifold with constant 
Ricci curvature. 


The literature on Einstein manifolds is immense—we refer to the excellent 
book of Besse [2]—and there are many recent results of interest involving the 
Ricci curvature. Let us mention just one: the celebrated theorem of Hamilton [1]. 


THEOREM. Let M be a compact manifold, dim M = 3. If there exists a metric on 
M with positive Ricci curvature, then there also exists one of constant positive sectional 
curvature. 


Since, as we mentioned, manifolds with constant sectional curvature are com- 
pletely classified, Hamilton’s Theorem implies that in three dimensions, positive 
Ricci curvature has powerful implications. 

In view of the rich theory that grew out of replacing submanifolds of constant 
mean curvature by the more restrictive class of manifolds with each principal 
curvature constant—the isoparametric manifolds—it seems natural to consider a 
kind of intrinsic analog. We ask about manifolds satisfying the following condition: 


(*) the set of eigenvalues \,(v),..., A, (Uv) in (56) is the same for all p and v. 


Clearly, by (57), such manifolds are Einstein manifolds. 

Although the analogy with isoparametric surfaces would be an obvious motiva- 
tion for introducing condition (+), that was not the original source of the notion. It 
arose from some results in ergodic theory giving a lower bound for entropy of 
geodesic flow in terms of curvature quantities (Osserman-Sarnak [1]) and from a 
later conjecture about a possible upper bound for the entropy (see Burns and 
Katok [1], Conjecture 5.4). A discussion of these matters with Christopher Croke 
led to condition (*). . 

For rank one locally symmetric spaces the values of the A,(v) in (56) are known 
explicitly: there are just two distinct values, with ratio 1/4, and they are indepen- 
dent of p and v. Thus all such spaces satisfy condition (*). In view of some of the 
results from ergodic theory, we are led to the following: 


ConsectTurE. The only nonflat manifolds satisfying condition (* ) are the rank one 
locally symmetric spaces. 


More details on the background to this conjecture are given in the paper of Chi 
[1], where it is shown that the conjecture does indeed hold in many cases: in 
particular for dimension 4, 2k + 1, or 2(2k + 1). A later paper of Chi [2] settles 
several further cases. However, the general conjecture appears quite difficult. Of 
course it is possible that a counterexample may turn up, in which case the class of 
manifolds satisfying (*) may turn out to be of separate interest. 
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For an interesting discussion of related questions and many more references, 
see the papers of Vanhecke [1], [2]. 


Conclusion 


Of the many topics that could have been included in this discussion but were 
not, let me name just a few: there are the ‘“‘almost flat manifolds” of Gromov, the 
classification of manifolds of non-positive curvature by Ballmann, Brin, Eberlein, 
Burns and Spatzier, and a whole series of results relating curvature bounds to 
topology. A few references for these subjects are included in the bibliography. I 
hope that at least the topics touched on here will convey some notion of the role of 
curvature in geometry, and a sense of its continuing fascination in current re- 
search. 


Notes added in proof. Both Yu D. Burago and Joel Spruck have pointed out 
that for the special case of convex hypersurfaces, Ros’ inequality (37) is a 
consequence of Minkowski’s (36): 


1 ; 1 
2 _ _ 
av f HdA <A [VT ua) < fHdA| 5 aA. 

The fact that equality holds in both Minkowski’s and Ros’ inequalities when H 
is constant turns out for star-shaped domains to be a special case of the equation 
3VH = A that holds for all such domains, where H is the volume average of the 
function obtained by extending the mean curvature H of the surface to be constant 
along radii. (See Theorem 9.1, p. 234 of Finn [1].) 

An interesting additional reference on surfaces of constant mean curvature is 
the recent paper of Bobenko [1]. 

Finally, there are strong parallels between embedded surfaces of constant mean 
curvature in R° and metrics of constant scalar curvature on conformally flat 
manifolds. Alexandrov’s Theorem that the sphere is the only embedded compact 
surface of constant mean curvature has a counterpart in a uniqueness theorem of 
Obata, and the initial results of Kapouleas [1] were an offshoot of some of 
Schoen’s work [2] related to the Yamabe problem. 
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Veblen Prize in Geometry, the Alan T. Waterman Award of the NSF, and 
the Fields Medal. His Ph.D. is from Berkeley, in 1972. 


1. Introduction 


John Conway discovered a technique using infinite, finitely presented groups 
that in a number of interesting cases resolves the question of whether a region in 
the plane can be tessellated by given tiles. The idea is that the tiles can be 
interpreted as describing relators in a group, in such a way that the plane region 
can be tiled, only if the group element which describes the boundary of the region 
is the trivial element 1. 

A convenient way to describe the construction is by means of the Cayley graph 
or graph of a group. If G is a group, then its graph ['(G) with respect to generators 
21, 8>--+, Z, 1S a directed graph whose vertices are the elements of the group. For 
each vertex v € ['(G), there will be m outgoing edges, labeled by the generators, 
and n incoming edges: the edge labeled g, connects v to ug;. 

It is convenient to make a slight modification of this picture when a generator g, 
has order 2. In that case, instead of drawing an arrow from v to vg, and another 
arrow from ug, back to v, we draw a single undirected edge labeled g;. Thus, in a 
drawing of the graph of a group, if there are any undirected edges, it is understood 
that the corresponding generator has order 2. 

The graph of a group is automatically homogeneous: for every element g € G, 
the transformation v — gu is an automorphism of the graph. Every automorphism 
of the labeled graph has this form. This property characterizes graphs of groups: a 
graph whose edges are labeled by a finite set F such that there is exactly one 
incoming and one outgoing edge with each label at each vertex is the graph of a 
group if and only if it admits an automorphism taking any vertex to any other. 

Whenever R is a relator for the group, that is, a word in the generators which 
represents 1, then if you start from v € [ and trace out R, you get back to v again. 
If G has presentation 


G = (81, 8)>---, 8, |R,; = 1, Ro =1,...,R, = 1), 


the graph I'(G) extends to a 2-complex ['7(G): sew k disks at each vertex of 
v € I(g), one for each relator R,, so that its boundary traces out the word R;. An 
exception is made here for relations of the form g7 = 1, since this relation is 
already incorporated by drawing g, as an undirected edge. The 2-complex ['7(G) is 
simply-connected: that is, every loop in [*(G) can be contracted to a point. In fact, 
if the loop is an edge path, the sequence of edges it follows describes a word in the 
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generators. The fact that the path returns to its starting point means that the word 
represents the identity. A proof that this word represents the identity by making 
substitutions using the relations R,; can be translated geometrically into a homo- 
topy of the path in T7(G). 

As a very simple example, the symmetric group S, is generated by the transposi- 
tions a = (12) and b = (23). They satisfy the relation (ab)* = 1. The graph is a 
hexagon, with undirected edges, alternately labeled a and b. 

A slightly more complicated example is S,. It is generated by three elements 
a = (12), b = (23), and c = (34). A presentation is 


S, = (a,b, cla* = b? = c? = 1,(ab)’ = (bc) = (ac)* = 1). 


To construct its graph, first make some copies of the ab hexagon for the S, 
subgroup generated by a and b, and similarly make some copies of bc hexagons. 
The subgroup generated by a and c is Z, X Z,, and its graph is a square with 
edges labeled alternately a and c. Make copies also of ac-squares. Take one copy 
of each polygon, and fit them together around a vertex, gluing an a edge to an a 
edge, etc. Around the perimeter of this figure, keep gluing on a copy of the 
polygon that fits. If you do this systematically, layer by layer, you will have 
constructed a polyhedron—it is a truncated octahedron. All the edges from the 
underlying octahedron are labeled b, while the squares produced by truncating the 
vertices are labeled acac. 

The reader may enjoy working out the graph of the alternating group A., using 
generators a = (12)(34), and b = (12345). Note that they satisfy the relations 
b> = 1 and ab? = (135) = 1. Try kicking around the construction, with white 
ababab hexagons and black bbbbb pentagons. 


SY 
VY 


Fic. 1.1. Soccerball. A soccerball is constructed from 12 pentagons, obtained by rotating and shrinking 
the faces of a regular dodecahedron, together with 20 hexagons centered at the vertices of the 
dodecahedron. 


Of course, graphs of groups don’t always work out so nicely or so easily, but 
often, for simple presentations, they can be worked out, and they tend to have a 
nice geometric flavor. 


2. Lozenges 


We will begin with a relatively easy tiling problem. Suppose we have a plane 
ruled into equilateral triangles, and a certain region R bounded by a polygon 7 


1990] CONWAY’S TILING GROUPS 759 


whose edges are edges of the equilateral triangle network. When can R be tiled by 
figures, let us call them lozenges, formed from two adjacent equilateral triangles? 


Fic. 2.1. A region tiled by lozenges. A portion of an equilateral triangle subdivision of the plane, tiled 
by lozenges. 


To analyze this problem, we first establish a labeling convention. We arrange 
the triangulation of the plane so that one set of edges is parallel to the x-axis, or at 
0°. Label these directed edges a, label b the directed edges pointing at 120°, and c 
the edges pointing at 240°. This labeling is homogeneous, so it is the graph of a 
group A. We can read off relators for A by tracing out the boundary curves of 
triangles: A satisfies abc = 1 and cba = 1. If desired, the first relation could be 
used to eliminate c; the second relation then says that ba = ab. The group A is 
Z + Z, as we could have seen anyway by its action on the plane. 

The shape of the polygon 7 is determined by the sequence of edges it traces 
out; this is a word in the generators a,b,c of A. Rather than thinking of it as a 
word, we prefer to think of it as an element a(a) in the free group F with 
generators a, b,c. The fact that 7 closes up is equivalent to the condition that the 
homomorphism F — A send a(7) to the identity. 

If a lozenge is placed in the triangular network, its boundary can be traced by 
one of three elements, depending on its orientation: that element is either 
L, = aba~'b~',> L, = beb™'c~', or L; = cac~'a~'. The precise word depends on 
the starting point on the boundary of the lozenge, but starting from a different 
vertex only changes the word by a circular permutation; the two choices give 
conjugate elements of F. The lozenge group L is defined by these relators, that is 


L = (a,b, clL, =L, =L,=1). 


Actually, the three relations say that the three generators commute with each 
other, so that L = Z°. 

We claim that if the region R can be tiled by lozenges, then the image (7) of 
a(ir) in L must be trivial. In fact, suppose that we have such a tiling. If R consists 
of a single tile, the claim is immediate. Otherwise, find a simple arc in R which 
cuts R into two tiled subregions R, and R,. By induction, we may assume that 
I(a,) and J(7,) are both trivial, where 7, is a polygonal curve tracing around 0R,. 
But I(r) = I(7r,)* I(7r,), so I(r) is also trivial. 
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Fic. 2.2. Three-dimensional interpretation of lozenge tiling. If a region R can by tiled by lozenges, then 
the lozenge pattern lifts to the 2-skeleton of a cubical tiling of R°, oriented diagonally to the plane of 
the lozenges. 


There is a very direct geometric interpretation: think of the graph ['(L) as the 
1-skeleton of a cubical tesselation of space, oriented so that cubes are on their 
corners: more precisely, so that the two endpoints of any path labeled abc are on 
the same vertical line. The 2-complex I'*(L) is the union of the faces of the cubes. 
A lozenge in the plane is the orthogonal projection of a square face of a cube. 
Given a path 7 in the plane, arrange it (for notational purposes only) so the base 
point * lies below the base point 1 of ['(L). Lift it edge by edge to a path in T'(L). 
When you make a complete circuit around 7, you may or may not come back to 
the starting point in [(L). The invariant J(az7) € L is the ending vertex. This 
invariant of necessity lies in the kernel of the map L — A, which is isomorphic to 
Z: it can be described simply as the net rise in height. 

If R can be tiled by lozenges, the tiling itself can be lifted, tile by tile, into 
[?(L), that is, into the 2-skeleton of the cubical tesselation. This gives another 
proof that the invariant J() must be 1 if R can be tiled. In fact, if you look at a 
tiling by lozenges, you can imagine it so that it springs out at you in a three-dimen- 
sional picture. 


Fic. 2.3. Nontileable region. The region in the plane enclosed by the polygonal curve cannot be tiled by 
lozenges, since when it is lifted to the cubic network, it fails to close. 


Algebraically, given the word representing 7r, the net rise in height is simply the 
sum of the exponents. The condition is that 7 heads at a bearing of 0°, 120°, or 
240° the same length of time it heads at a bearing of 60°, 180°, or 300°. 
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This condition can be seen in an alternative way using a coloring argument. The 
triangles in the plane have an alternating coloring, with abc triangles colored white 
and cba triangles colored black. Each lozenge covers one triangle of each 
color—therefore, if R can be tiled, the number of white triangles must equal the 
number of black triangles. The difference in fact can be shown to be the net rise in 
height of a, as measured in main diagonals of cubes. The coloring consideration 
really gives a more elementary derivation that /(7) must vanish for a tiling to be 
possible. However, this and related coloring arguments in general cannot give as 
much information as /(7). One way to think of it is that coloring arguments are 
the abelian part of the group theory. If the group is abelian as in the present case, 
or more generally if the subgroup consisting of invariants [(7) for closed paths is 
abelian, then that information is sufficient. 

The algebraic condition that J(7) = 1, is not sufficient to guarantee a tiling by 
lozenges. There are curves 7 which go around nearly a full circle, with the lift in 
I'(L) rising considerably, and then instead of closing, they circle around another 
loop which brings them down to the starting height. If R could be tiled by 
lozenges, it could be divided into two regions by a fairly short path along edges of 
lozenges: but the rise in height for one side would be forced to be still positive, 
which would be a contradiction. We will return later to give a necessary and 
sufficient condition for a tiling by lozenges, along with a formula for a tiling if such 
exists. 


Fic. 2.4. Potentially tileable region. The boundary curve of this region lifts to a closed curve, so it 
meets the group-theoretic tiling condition. An actual tiling will be shown in 4.1, High lozenge tiling. 


3. Tribone Tilings 


Here is another example, for which other methods seem inadequate. I first 
heard this problem in an electronic mail inquiry from Carl W.Lee (ms.uky.edu!lee) 
in Kentucky. 


Last semester, a number of us here became interested in a combinatorial 
problem that was making the rounds. I’m sure you already have heard of it, 
and we heard a rumor that John Conway had solved it. It concerned a 
triangular array of dots. The problem was to pack in as many segments as 
possible, where each segment covered three adjacent dots in one of the three 
directions, and no two segments were allowed to touch. Is there any size 
configuration that admits a packing such that each dot is covered? Do you 
know anything about the status of this problem? Thanks in advance. 
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Fic. 3.1. Triangle of hexagons. A triangular array of hexagons, eight on a side. Can this be tiled by 
tribones? 


I hadn’t heard of it, but I asked Conway about it. We sat down together, and he 
worked it out. 

This question can be alternately formulated in terms of a triangular array of 
hexagons. The problem is to show that one cannot tesselate the region using tiles 
made of three hexagons hooked linearly together. More generally, one can ask for 
the minimum number of holes left in an attempt to tile the region by these tiles. 

If the region has side length n, then the number of hexagons is n(n + 1)/2. A 
first, necessary condition is that n or n + 1 is divisible by 3, that is, n is congruent 
to 0 or 2 mod 3. Note that if it is ever possible to solve the problem when n is 
congruent to 2 mod 3, one can extend the solution by adding a row of tiles along 
one side, to derive a solution for n + 1. 

Label each side. in the hexagonal grid with an a, b, or c, according to the 
direction of the edge: a if it is parallel to the x axis, b if the angle from the x-axis 
to the edge (measured counterclockwise) is 60°, and c if this angle is 120°. Thus, 
the sides of every hexagon are labeled abcabc. 

This labeling gives the 1-skeleton of the grid the structure of a group graph, 
where the group is 


A = (a,b, cla? = b? = c? = (abc) = 1). 


The group is a group of isometries of the plane, generated by 180° revolutions 
about the centers of the edges; it also contains the 180° revolutions about the 
centers of the hexagons. The group A is sometimes called the (2, 2, 2, 2)-group. 

A path 7 in the 1-skeleton of the hexagonal grid now is determined by a word 
in the generators of A. We prefer to think of this in a slightly different way: 7 
determines an element a (7) in the free product F = Z,*Z,*Z,. We are 
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particularly interested in closed paths, that is, elements of the kernel of F > A. 
Unfortunately, this kernel is infinitely generated: it is a free group whose genera- 
tors are given by arbitrary paths p,, followed by a circuit around one of the three 
hexagons at the endpoint of p,, followed by the p;'. 


Fic. 3.2. Tribones in three orientations. There are three possible orientations for a tribone, in an array 
of hexagons. With our labeling convention, they are labeled in three different ways. 


The standard tile, let us call it a tribone, can be laid in the plane in three 
different orientations. Circuits around the tribones in these three orientations 
trace out the elements 


T, = (ab)’c(ab)°c 
T, = (bc)°a(be)*a 
T; = (ca)*b(ca)*b. 


If w is a simple closed circuit in the plane such that the region R bounded by 7 
can be tiled by these ,tribones, then the image J(7) of a(7) in the tribone group 


T = (a,b, cla* =b* =c? =T, =T, =T; = 1) 


must be trivial. 

The relation T, says that c conjugates (ab)° to its inverse. Observe that a and b 
also conjugate (ab)° to its inverse—in fact, this is already true in F. In other 
words, (ab)° generates a normal subgroup, and it commutes with every word of 
even length. Similarly, (bc)* and (ca)*> generate normal subgroups. Together, the 
three elements generate a normal abelian subgroup J of T. 

To form a picture of T, let us first look at the quotient group 7) = T/J = 
(a, b, cla” = b* = c? = (ab)’ = (bc)? = (ca)? = 1). The graph of T, can readily 
be constructed: take an infinite collection of three types of hexagons, with their 
edges labeled by the relations C,,C, and C3. These glue together to form a 
hexagonal pattern in the plane, where each vertex has one a edge, one b edge, and 
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Fic. 3.3. Second hexagonal group. The group 7, also has a graph isomorphic to the edges of a 
hexagonal tiling of the plane. 


one c edge incident to it. The group T, acts faithfully as a group of isometries of 
the plane, generated by reflections in the edges of this hexagonal tiling: it is a 
triangle group. It is curious that even though the groups A and 7, and the labeled 
graphs (A) and I(7,) are different, when the labels are stripped they become 
isomorphic. 

If the region R can be tiled by tribones, then a(7) must map to the trivial 
element of 7’, so it maps to the trivial element of 7). In our case, the region is a 
triangular array of hexagons, and its boundary can be taken as a(a7) = 
(ab)"(ca)"(be)". 

Obviously, if m is a multiple of 3, the image J(7r)/J in Ty, is trivial. In the other 
case, that n is 2 more than a multiple of 3, it is also trivial. This is easily seen by 
tracing out the curve in our array of hexagons, or by noticing that one can add 
additional tribones along one edge to form a triangular region with side length 
n + 1, which is a multiple of 3. Since we have pushed 7 only across tribones, /(7) 
is the same for the two cases. 

Since T, was not sufficient to detect the nontriviality of /(a), we need to finish 
our job, and build a picture of 7. First, look at the path in the graph of T) 
determined by the element 7,. Start at a vertex * where the circuit C, = ababab 
goes counterclockwise around a hexagon. Then T, goes counterclockwise around 
this hexagon, then along the c edge, clockwise around the C, hexagon through 
that vertex, and back along the c edge to close. In particular, the signed total of 
C,-hexagons enclosed (counted according to degree of winding with counterclock- 
wise circuits counted positively), is 0. 

It is not hard to describe now the full group 7, which is an extension of the form 
J = Z° > T > T). We can interpret an element of T to be a vertex v in the graph 
of 7), together with a path p from * to v, subject to the equivalence relation that 
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A 


Fic. 3.4. Alternate image of tribone. By construction, the tribone relations are satisfied in the groups T 
and hence JT) = T/J. This is the image of one of the tribone relators in the graph of the group Ty. Note 
how it encircles two AB-hexagons, once clockwise and once counterclockwise. 


if q is another path from * to v, then p ~ gq if the signed totals of C,, C,, and C, 
hexagons are all 0. (Of course, if we pick one path such as p from * to v, then 
other paths from * to v are determined by three arbitrary integers, which specify 
these signed totals.) With this definition, the relations T, are obviously satisfied, 
hence the group so constructed is at least a quotient group of 7. But we have 
already seen that the kernel J of the map T > Ty) is abelian, and generated by C;,. 
In the construction, this kernel is the free abelian group on the C;,, so it must in 
fact give T. 

Once we know 7, we can read I(7) by inspection. As we saw, it suffices to 
consider the case n = 3k; the invariant is CfC*C¥%, which is obviously not 1, so the 
tiling is impossible. 

One can ask whether this method gives a lower bound on the number of holes 
one is forced to leave, in a partial tiling of R by tribones. To study this question, 
we should examine the subgroup K of T generated by elements of the form I(y), 
where y is a path in the graph of A going from * to some point v, circumnavigat- 
ing a hexagon, and returning. In other words, K is the kernel of the map T —> A. 
Note that a(y) has the form gabcabcg~', where g is arbitrary. In the group 7), 
abcabc acts as a translation. The conjugates of abcabc in T, are translations in 
three different directions spaced at 120° angles, and the subgroup they generate is 
ismorphic to Z’. In K, there are actually an infinite number of different conjugates 
of abcabc: if g acts as a translation in JT), then the commutator gabcabcg™ 'cbacba 
is trivial in T,, but it might not be trivial in T: this path may enclose an arbitrary 
number m of hexagons of type C,, and an equal number of type C, and C,. 

The subgroup K is therefore a nilpotent group, generated by s = abcabc, 
t = bcabca, and u = C,C,C;, with presentation 


K =(s,t,ulls,u] =[t,u] =1,[s,¢t] =u°). 
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C 


Fic. 3.5. Alternate image of a triangle. The triangle word (ab)"(ca)"(bc)” of size n = 
3m or n = 3m + 2 maps to the trivial element in 7. In the diagram above, if n = 3m, trace the word 
starting at the center. If n = 3m + 2, start b from the center. 


It is easy to check that every element of K is realized as J(7), for some simple 
closed curve 7 in the plane. 

Even though the invariants associated with triangular regions take larger and 
larger values in J, this does not give any information limiting the number of holes: 
for instance, three holes g,abcabcg; ' can yield u*, for arbitrarily high k. In fact, it 
is possible to tesselate the triangular region of size n with tribones except for 1 
hole, if n = 103), by placing the hole exactly in the middle, and then arranging 
concentric triangular layers of tribones around this hole. From these examples, 
tribone tilings with 3 holes are easily constructed when n = 0(3) or 2(3). It does 
give some information, however: in the case that n = 2(3) or n = 0(3), the 
conjugacy class changes (“increases”) with n, which implies that the length of 
the minimum closed loop enclosing all the holes has to go to infinity with n. In the 
case n = 1(3), the conjugacy class of J(7) is constant—since the region can always 
be tiled with a single hexagon missing, /(7) is conjugate to abcabc. However, the 
actual word changes with n, which implies that the missing hole cannot be too 
close to the boundary. Perhaps a careful analysis would show that if there is a 
single hole, it must.be exactly in the center of the triangle. 


4. Dominoes and Lozenges Revisited 


Conway’s tiling groups are quite versatile, provided you can work out the group 
determined by the tiles. Even when (or perhaps especially when) the invariant 
I(ar) gives no information which could not have been easily obtained by other 
means, the geometric picture of the graph of the group can sometimes be exploited 
to give not just an algebraic criterion, but a precise geometric criterion for the 
existence of a tiling. 

When G is a tiling group (with presentation given by a set of tiles), we define a 
measure of area in ['*(G) to be the area defined by projection to the plane: the 
area of a 2-cell is the area of a corresponding tile. When the algebraic invariant 
I(zr) is 1, the curve 7 bounding R lifts to a closed 7 in I'(G). We can ask, what is 
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the minimum area of a surface S in [*(G) with boundary 7? This area is 
necessarily at least as great as the area of R. If it is equal, then the images of the 
2-cells of S must be disjoint, so that they form a tiling of R. There are several 
approaches which are sometimes successful for calculating this minimal area, but 
there is one particular situation when there is a really definitive solution: when 
['7(G) can be enlarged, by adding 3-cells, to make a contractable 3-manifold. In 
this situation, there is a “max flow min cut” principle which guarantees an efficient 
algorithm for finding a minimal surface. 

Rather than going on with the general theory, we will illustrate this with two 
examples. First we revisit the lozenge question. 

If R is a union of triangles in the plane, and if v and w are vertices in R, 
possibly on the boundary, define d(v, w) to be the minimum length of a positively 
directed edge-path in R (possibly going on the boundary) joining v to w. This 
“distance” function d is not symmetric, since we cannot simply reverse an edge 
path. Any closed positively directed edge path has length a multiple of 3, so the 
d(v,w) is defined modulo 3 independent of path. The three vertices of a triangle 
take the three distinct values modulo 3. If R is connected, it is always possible to 
find at least one positively directed path from v to w, so d(v, w) is well-defined. 

Consider the lifting of any tiling of R by lozenges to the cubical network, ['*(L). 
This is determined by a height function h(v) for the vertices v. We can choose the 
vertical scale so that A is integer-valued, and each edge of a lifted lozenge 
increases in height by 1; the edge of the triangular network covered by the lozenge 
lifts to a diagonal of a square, and decreases in height by 2. It follows that 
h(w) — h(v) = dtu, w). 

The boundary path 7 determines a unique height function h/ on its vertices, up 
to constants. This gives a necessary condition that R can be tiled: for any two 
vertices v and w on 7, h(w) — h(v) = div, w). 

If a satisfies this necessary condition, then there is a unique maximally high 
lozenge tiling: define 


h(x) = min {d(v,w)}. 


To produce the actual tiling, place a lozenge so as to cover an edge where the 
height changes by 2. Since the three vertices of a triangle take distinct values 
moduluo 3, and since h increases by at most 1 along any edge, each triangle has 
exactly one edge’ where / changes by 2: therefore, the collection of lozenges is a 
tiling. 


Fic. 4.1. High lozenge tiling. The “highest” lozenge tiling compatible with the boundary curve. 
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There is a simple algorithm for quickly computing h, and the tiling: rather than 
spell it out, we will describe the analogous algorithm for dominoes. 

A closed path 7 in a square grid can be described by an element a(zr) of the 
free group F(x, y), which maps to the trivial element of the A = Z*. If the region 
R bounded by 7 can be filled with dominoes, then the image J(7) of a(zr) in the 
domino group G = (x, y|xy? = y*x, yx* =x’y) must be trivial. 

What does the graph of G look like? We can construct a picture in R°, as 
follows. Fill the xy-plane with a black and white checkerboard pattern. Above the 
black square [0,1] < [0,1], construct a right-handed helix, joining (0, 0,0) by a line 
segment to (0, 1,1), to (1, 1, 2), (0, 1, 3), (0, 0, 4), and so on: the x and y coordinates 
here marching forever around the boundary of the square, while the z coordinate 
increases by 1 each move. Similarly, (0,0,0) is connected to (0,1, — 1), etc. 
Construct a similar helix above each black square. Label each edge x or y, 
according to its image in the plane. Note that this creates left-handed helices 
above the white squares. The boundary of any domino in the plane lifts to a closed 
path in this graph we have constructed. Since the graph has a simply-transitive 
group of isometries, it is the graph of a group. Since it satisfies the domino 
relations, it is at least a quotient group of the domino group G. It is not hard (and 
strictly speaking, it is not logically necessary) to verify that this graph is indeed the 
graph of G. 

The curve 7 lifts to a curve 7 in the graph of G. A convenient way to denote 
this, in the plane, is to record the height of the lift next to each vertex of 7 in the 


Fic. 4.2. The domino group. The graph of the domino group is a union of square helices over the 
squares of a checkerboard, alternating in handedness. A domino anywhere in the plane lifts to this 
graph, starting at any pojnt. This illustration shows two coils of four neighboring helices. 


Fic. 4.3. Domino tiling. A tiling by 9 dominoes, lifted to the graph of the domino group. 
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plane. The rule is simple: one can start with 0 at some arbitrary vertex. Along any 
edge of a which has a black square to its left, the height increases by 1. Along 
any edge with a white square to its left, the height decreases by 1. A necessary 
condition that R can be filled with dominoes is that the height after traversing 
once around the curve is 0. 


Fic. 4.4. Domino roof. This is the tiling which the algorithm yields, when applied to a 16 X 16 square 
grid. This is the tiling which has the highest lifting to the graph of the domino group of any tiling by 
dominoes. 


There is a criterion and construction for a domino tiling, analogous to the 
construction for lozenges. Here is how the formula can be worked out, on a sheet 
of grid paper. Begin, as above, by labeling the height of each vertex of 7a. The 
heights consist of the integers in some interval, [n, m]. We will construct a height 
function on all vertices of R, beginning with n + 1, and working up. Suppose, 
inductively, that we have finished with all vertices of height less than or equal to k. 
For each vertex v of height k, and for each edge e leading from v which has a 
black square on its left, consider the second endpoint w of e. If the height of w 
has been previously defined, and if it is not greater than k + 1 leave it as is. If the 
height is defined and greater than k + 1, then a domino tiling is impossible: give 
up. Otherwise, define the height of w to be k + 1. 

If this procedure reaches a successful conclusion, each edge of R has a 
difference of heights of its two endpoints of either 1 or 3. (Note that the height 
modulo 4 is determined by the point in the plane.) Erase all the edges whose 
endpoints have a difference of height of 3. What is left is a picture of a tiling by 
dominoes. 


5. Triangles 


Here is a related sequence of tiling problems which are resistant to direct 
attempts at general solution, but translate nicely into the realm of group theory. 

Consider, again, a triangular array of dots, with N dots on each side. Is it 
possible to subdivide this array into disjoint triangular arrays of dots with M on 
each side? We suggest the reader indulge in experimentation with a few cases, 
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Fic. 4.5. Domino bubble. This illustration shows both the highest and the lowest tiling by dominoes of 
a standard checkerboard. They are isomorphic, differing only by a 90° rotation of the checkerboard 
(interchanging colors). The upper tiling is shown in the upper plane as well as the upper surface of the 
bubble, the lower tiling in the lower plane and the lower surface of the bubble. The bubble they form 
encloses the lift of any tiling by dominoes. Possible tilings are “like” Lipschitz functions in the square 
with Lipschitz constant 1, as measured in the Manhattan metric. The limits of domino tilings, lifted to 
the graph of the group, as the grid size goes to zero, are exactly such Lipschitz functions. 


before reading further. For example, the cases M = 2 with N ranging from 2 to 12 
are interesting. 


As in the case of the tribones, this translates into a tiling problem: given a 
triangular array of hexagons with N hexagons per side, can one tile it by tiles T,, 
which are triangular arrays of hexagons M per side? We can express this with 
notation as in the case of tribones: label the edges of the underlying hexagonal 
tiling by a’s, b’s and c’s. Given a path 77 in the plane, it is described by an element 
a(i) of F = (a,b, cla” = b* = c* = 1). If the region R bounded by 7 can be 
tiled by the copies of 7,,, then the image /(7) of a(7) is trivial in the group 


Gy = (a,b,cla* = b* =c? = 1,ty = 1), 
were t,, represents the boundary curve of the tile T,,, 
ty, = (ab) (ca) (be). 


A parallelogram of hexagons with M hexagons on one side and M + 1 on the 
other can be tiled by two copies of 7,,. This implies that (ab)” commutes with 
(bc)“*! and with (ca)“*!, and so forth. 

These relations imply that (ab)” commutes with (bc , and they also 
imply that (ab)“*! commutes with (bc)“”*. Combining these two facts, it 
follows that (ab) commutes with (bc)“™“*). Geometrically, one can tile an 
M X M(M + 1) parallelogram and an M+ 1 X M(M + 1) parallelogram. Their 
difference isa 1 X M(M + 1) parallelogram: this can be tiled in a certain algebraic 
sense as the difference of the two. 

It will simplify the picture at this point if we pass to the subgroups F° and Gy, 
generated by words of even length. Since all relations have even length, the 
wordlength modulo 2 describes a homomorphism of F and G,, to Z,, and these 
subgroups have index 2. The group F* is the free group on 2 generators, but a 
more symmetric description is 


yM(M + 1) 


F° = (x,y, z|xyz = 1), 
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where x = ab, y = bc, and z = ca. A presentation for the group Gj, is obtained 
by adjoining relations coming from t,, to F*: it requires two relations, one 
obtained by transcribing t,, directly, and the other transcribing the conjugate of t,, 
by an element of odd length. Using t,, = 1 and bt,,b = 1, we obtain 


M,,M,M 


Ge, = (x,y, zlxyz = 1, x yM@2M4 = 1, Dy MDZ ED = I, 


G¢, has an interesting alternate generating set: X = x”, X' =x “*, together 
with Y, Y'’, Z and Z' defined similarly, clearly generate. We have already seen that 
X, Y, and Z commute with X’, Y’, and Z’. 

The elements s = X™”t!,¢4 = Y™*!, and u = Z™”*! commute with everything in 
G‘,, so they generate a central subgroup J which is Z° or a quotient. Let 
Gy, = Gé,/J. We will analyze the structure of Gy,, and from that construct Gj. 

In Gy,, X, Y, and Z satisfy relations 

XYZ = 1, XMt+tLW yMtin FMt1 = 4. 

These relations describe the orientation-preserving (M+ 1,M + 1, M + 1) trian- 
gle group, which acts as a discrete group of isometries on the Euclidean plane if 
M = 2 and on the hyperbolic plane if M > 2. We have not checked that these 
generate all the relations on X, Y, and Z, but we immediately deduce that the 
subgroup H of G;, generated by X, Y, and Z is a quotient of this triangle group. 
But there is a homomorphism f of the original group G,, to the full triangle group 
(including reflections), defined by sending a, b, and c to reflections in the sides of 
a w/(M + 1), 7/(M + 1), 7/(M + 1) triangle. The relation t,, = 1 is satisfied, 
since in this group (ab)” = ba so that (ab)”(ca)"(bc)” = (baXacXcb) = 1. Note 
that f sends X to ba, Y to ac and Z to cb, that is, to the standard generators of 
the (M+ 1,M+1,M + 1) triangle group, and it sends s, t, and u to 0. There- 
fore, H is isomorphic to the orientable (M + 1, M+ 1, M + 1) triangle group. 

A similar analysis shows that the subgroup H’ generated by X’, Y’ and Z’ is the 
orientable (M, M, M) triangle group. This group acts on the sphere, the Euclidean 
plane, or the hyperbolic plane when M = 2, M = 3, or M => 4. The analogous 
homomorphism f’ maps G,, to the full (M, M, M) triangle group, mapping a, b, 
and c to the standard generators. 

The two subgroups H and H' intersect (as seen from the effects of f and f’) 
they generate Gj,, and they commute with each other. Therefore, G), is the 
product H xX H' of the two triangle groups. 

Now we need to determine the kernel J of the quotient Gf, > G),, and the 
structure of the central extension. As in the tribone case, we can do this geometri- 
cally, in terms of areas enclosed by curves. The graph I of the full (M + 1, 
M +1,M + 1) triangle group is formed from copies of three kinds of 2(M + 1)- 
gons, with perimeters labeled (ab)”, (ca)” and (bc)”, with one of each kind 
meeting at each vertex. Arrange the orientation so that 1 is an “even” vertex that 
is, the a, b, and c edges emanating from 1 are in counterclockwise order. Then the 
relation t,, based at vu encloses positively one copy of each type of polygon, while 
the conjugate bt,,b encloses negatively one copy of each type of polygon. 

Similarly, the graph I’ of the full (MM, M, M) triangle group is made from three 
kinds of 2M-gons. Starting at the 1, which we suppose is an even vertex, the 
relation t,, encloses positively one copy of each type of polygon, while bt,,b 
encloses negatively one copy of each. However, in the case M = 2, the entire 
graph is finite: it is the 1-skeleton of a cube, and the number of polygons enclosed 
by a curve is well-defined only modulo 2. 
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First let’s deal with the case M > 2. We can define an extension K of Gy, as an 
equivalence relation on elements of F%, as follows. An element g of F* deter- 
mines paths p(g) in I and p’(g) in I’. We define g to be equivalent to h if p(g) 
ends at the same point as p(h), p’(g) ends at the same point as p’(h), and if the 
closed loop p(g)p~'(h) encloses the same numbers of ab-polygons, bc-polygons, 
and ca-polygons as p'(g)p’ '(h). 

In particular, an element of the kernel of the map of K to H X H’ maps to 
closed loops in both pictures, and is determined by the triple of differences of the 
number of polygons enclosed. The elements s, t and u map to (1,0, 0), (0, 1, 0), 
and (0,0, 1). It follows that K = G¢,, and J = Z° (provided M > 2). 

The boundary of the size N triangle 7,, can be described by the element 
ty = (ab)(ca)"(bc)’. The path p(t,,) in I closes only when N is 0 or —1 mod 
M + 1, while the path p'(t,,) closes only when N is 0 or —1 mod M. Since M and 
M + 1 are relatively prime, there are four solutions modulo M(M + 1): 0, M, 
M? — 1, — 1. For values of N satisfying one of these congruence conditions, the 
invariant in G;, is 0, so the invariant is in J; it is a positive multiple of (1, 1,1) in 
all but the trivial case N = M. 


THEOREM (Conway). When N > M > 2, the triangular array T, of hexagons 
cannot be tiled by Ty,’s. 


This analysis has an interesting variation case M = 2. Given two elements g 
and h of F*°, we can define them to be equivalent if p(g) and p(h) have the same 
endpoints, p’(g) and p’(h ) have the same endpoints, and if the numbers of 
polygons of the three types enclosed by the path p(g)p(h)~' is a multiple k of 
(1,1,1) which has the same parity as the number of polygons enclosed by 
p'(g)p(h)~'. This defines a central extension of H X H' by Z? modulo the 
subgroup generated by s*t*u* = 1. To justify that this group is in fact GS, we must 
prove that s*t*u* = (ab)'!*(ca)'*(bc)"” = 1 in this group, or even better, that it is 
possible to tile T,,. Such a tiling can be found fairly easily—see FiGure 5.1, the 
12-stack by 2-stacks. 


Fic. 5.1. The 12-stack by 2-stacks. The triangle T,, can be tiled by T,’s. 
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The computation of the mod 2 invariant for tilings by T,’s can be rather 
annoying when done directly. However, there is a neat trick, which enables one to 
see this invariant geometrically: most regions which have a multiple of 3 hexagons 
can be tiled easily by T,’s along with tribones. The boundary abababcabababc of a 
tribone maps to closed paths in both I’ and I’. In I, it encloses a net of 0 of each 
type of hexagon, as we saw before. In I”, this curve winds counterclockwise 1.5 
revolutions about an ab-face of the cube, goes down a c-edge to the opposite face, 
winds 1.5 revolutions counterclockwise (with respect to the orientation of the 
square), and goes up again to close. It is therefore equivalent, in terms of which 
kinds of squares it encloses, to abcabc, which is an odd multiple of (1, 1, 1). 

Therefore, if a region can be tiled with a collection of T,’s together with an odd 
number of tribones, it cannot be tiled with 7,’s. For 0 < N < 12, only for the 
values 2, 3, 5, 6, 8, 9, 11 is the number of tiles a multiple of 3. One quickly finds 
that in the cases T;, T;, T, and 7, there is a tiling by one tribone and the rest T,’s, 
while T,, JT), and T,, can be tiled. 

Given any tiling or partial tiling of T,, with k > 1, it can be extended to a tiling 
or partial tiling of 7,,,, by adding a 12 X k parallelogram, together with a Tp. 
The 12 x k parallelogram can be tiled by subdividing into 2 X 6 and 3 X 6 
parallelograms. 


THEOREM (Conway). A triangular array T,, of hexagons can be tiled by T,’s if and 
only if k is congruent to 0, 2, 9, or 11 modulo 12. 


REFERENCE 
[CoLa] J. H. Conway and J. C. Lagarias, Tiling with Polyominoes and Combinatorial Group Theory. 
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The popular Brief Calculus—now part of a remarkable series! 


Brief Calculus with Applications Finite Mathematics 


Third Edition 1991 Cloth 500 pages 


1991 Cloth 812 
0 pages Finite Mathematics with Calculus 


Brief Calculus with Applications 1991 Cloth 918 pages 
Alternate Third Edition 
1991 Cloth 648 pages — Roland E. Larson, 
The Pennsylvania State University, 
Roland E. Larson and Robert P. Hostetler, The Behrend College 
both of The Pennsylvania State University, Bruce H. Edwards, University of Florida 
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Bruce H. Edwards, University of Florida 
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Understandable Statistics Fun and Games Basic Mathematics 
Fourth Edition A Text on Game Theory Third Edition 
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GRAPHS AND THEIR USES 


Oystein Ore 
Revised and updated by Robin J. Wilson 


In 1963 Oystein Ore wrote this classic vol- 
ume, which was published in the New Math- 
ematical Library Series. This elegant book 
has provided students and teachers with an 
excellent introduction to the field of graph 
theory for close to thirty years. Robin Wil- 
son's revision adds strength to the book by 
updating the terminology and notation bring- 
ing them in line with contemporary usage. 
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Toetjes 


TomAs FEpDER,* Computer Science Department, Stanford University, Stanford, CA 94305 


ToMAS FEDER received his B.S. in Mathematical and Computational Sci- 
ence from Stanford University, California. He is currently working towards 
his Ph.D. in Computer Science at Stanford University. He has also worked 
at the Xerox Palo Alto Research Center, the IBM Almaden Research 
Center in San Jose, and AT&T Bell Laboratories at Murray Hill. His 
research interests are in theoretical computer science, particularly algo- 
rithms and computational complexity. 


Abstract. A number is secretly chosen from the interval [0, 1], and players try in turn to guess this 
number. When the secret number is revealed, the player with the closest guess wins. We describe an 
optimal strategy for a version of this game. 


1. Introduction. The Toetjes Problem was posed at the Programming and 
Problem Solving Seminar [1] held at Stanford University in the winter of 1987. It 
was suggested by Sape Mullender, who described it as follows: 


In Amsterdam, where I grew up, dessert is usually referred to as “toetje” 
(Dutch for “‘afters”). The problem of allocating a leftover toetje to one of the 
children in my family became the Toetjes Problem. The algorithm was the 
following: First my mother would choose a secret number between one and a 
hundred. Then the children, in turn, from youngest to oldest, would try to 
guess the number. After the last guess my mother would indicate whose 
guess was closest to her secret number and the winner would get the toetje. 


Now that I have a degree in mathematics, the problem still puzzles me: 
Given that the secret number is chosen randomly from the interval [0, 1], 
what is the optimal strategy for choosing a number for the ith child in a 
family of n children? The ith child knows what the first 1 — 1 children chose, 
and knows that all the children choose optimally (i.e., choose to maximize 
their own chance without consideration for the chances of any other child in 
particular). 


We will study Mullender’s “continuous” version of the problem, in which the 
secret number is chosen randomly from the interval [0, 1]. Notice that the descrip- 
tion we have given so far is, in fact, incomplete. A first ambiguity arises because 
the optimal move for a player may not be unique. If this is the case, the criterion 
that the player uses to choose among equally desirable moves can influence the 
strategy followed by earlier players. We will resolve this ambiguity by assuming 
that there is a given tie-breaking rule that all players must follow when choosing 
among several equally desirable moves. 

The choice of the tie-breaking rule affects the play quite drastically. To 
illustrate this, let x,,..., x, be the (distinct) numbers chosen by the players in the 


*This research was primarily supported by a Bell Laboratories Scholarship, and in part by the 
National Science Foundation under grant CCR-86-10181, and by Office of Naval Research contract 
N00014-87-K-0502. 
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interval [0, 1]. Suppose player n plays between two earlier players i and j such that 
no other player has already played between them. Then his payoff (ie., his 
probability of winning) is the same no matter where in the interval (x,, x,) he plays: 
it equals (x, — x,)/2, half the interval size. However, his choice will affect the 
payoff (and hence the strategy) for players i and j. 

A second difficulty occurs because, in this continuous version of the problem, 
each player has an infinite number of choices, so there may not be an optimal 
move. Instead, it can happen that for each possible choice there is another choice 
that gives a slightly better payoff. For example, in the two-player game, suppose 
the first player chooses 1/2; then the second player will get a better payoff the 
closer he plays to the first player, so there is no “‘optimal’? move, but instead a 
family of moves arbitrarily close to optimal. We will later see that there are 
situations where a player can obtain a better payoff the closer he plays to a certain 
number €, while playing exactly at € would give him a very low payoff. We handle 
this difficulty by defining an optimal /imiting play which is such that all players can 
obtain payoffs arbitrarily close to optimal by playing sufficiently close to this 
limiting play. 

In this paper we characterize the optimal strategy under two simple tie-breaking 
rules: the “closest to the first player’ rule and the “rightmost” rule. In particular, 
we prove that under the first of these tie-breaking rules, the payoff for the first and 
last players is exactly half the payoff for each of the remaining players. Under the 
second tie-breaking rule, the last two players get a payoff equal to half the payoff 
of each of the other players. 

The remainder of the paper is organized as follows. In section 2 we give a 
rigorous definition of the game, which resolves the two ambiguities mentioned 
above. In section 3 we develop a plausible optimal playing strategy by making some 
assumptions about the tie-breaking rule. Then in section 4 we prove that the 
strategy of section 3 is in fact optimal under a simple tie-breaking rule, the “closest 
to the first player” rule. Section 5 extends the results of the previous section to the 
“rightmost” tie-breaking rule. Section 6 explores the “random” tie-breaking rule. 


2. Definitions. We now give a precise definition of the “optimal” playing 
strategy. Given the & distinct numbers x,,..., x, chosen by the first k players, we 
consider limiting plays for the remaining n-—k_ players of the form 
(Xp445+++5X,_)~), Where < is an ordering of the numbers x,,...,x,, that extends 
the usual < ordering into a total ordering. Intuitively, a limiting play of this form 
indicates that the last n — k players play arbitrarily close to x,,,,..., x, and in 
such a way that the resulting ordering of the n players in the [0, 1] interval is given 
by the total ordering ~< . While the numbers played are all distinct, some of the 
n — k limiting values can in fact be equal to each other or to the numbers chosen 
by the first k players. In the case where two or more values are equal, the 
left-to-right order of the m moves in the [0,1] interval needs to be specified: only 
then can the payoff for each player corresponding to this limiting play be 
evaluated. This is the reason for including in the limiting play (x,,,,...,*,, < ) an 
ordering ~< that extends the partial ordering < on the numbers x,,...,x, toa 
total ordering. 

The winning probability (or payoff) p(i, x,,...,%x,, < ) for player i under such a 
limiting play can now be easily computed [see Fic. 1]: (a) If the move x, is between 
x, and x, in the ~< ordering, then the payoff is (x; — x,)/2 + (x, — x,)/2 = 
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(a) @ TEEPE EEE 


x, Xx, XK 
(b) 
) Xx, XK 
(c) | @ HELTER 
*) x 1 


Fic. 1. Payoff for player /. 


(x, — x,)/2. (b) If x; is the smallest move and is followed by x,, then the payoff is 
(x, — 0) + (x, —x))/2 = (x; + x,)/2. (c) If x; is the largest move and is preceded 
by x;, then the payoff is (x, —x,)/2 + 1 —x,) =1— (x; +x,)/2. We use this 
payoff function to define the optimal limiting play opt(x,,...,x,) = 
(Xp419+++,X,, ~~ ) for the last n — k players, given the k distinct numbers x,,..., x, 
chosen by the first k players. The definition is by induction on the number n — k 
of players left to play. The base case k = n is simply opt(x,,..., x,,) = (« ), where 
< isthe < ordering on the distinct values x,,..., x,. Now we assume inductively 
that the function opt(x,,...,x,) has been defined for a given k, and we define 
opt(x,,...,X,_1). 

So fix the numbers x,,...,x,_, chosen by the first k — 1 players. We say that 
(X,,..-,X,, < ) is a limiting play available to player k if for all « > 0 there are 
numbers y,,...,y, with |y, —x,| << e« such that opt(x,,...,%*, 1, ¥,) = 
(Vegir--+>¥y> < ). In other words, the limiting play is available to player k if 
player k can play close to x, so that the remaining players will then play close to 
Xpapreey Xp: 

The optimal limiting payoff @ for player & is now defined to be the least upper 
bound of the payoff p(k, x,,...,x,, < ) over all limiting plays (x,,...,%,,~< ) 
available to player k. We can show that there exists a limiting play available to 
player k which achieves this maximum payoff a by considering a sequence of 
limiting plays available to player k that achieve payoffs arbitrarily close to a, and 
then using the Bolzano-Weierstrass theorem to obtain a limit point (x,,...,x,, < ) 


for this sequence. We let S be the set of limiting plays (x,,..., x, < ) available to 
player k which give player & this maximum payoff a; thus S is the set of optimal 
limiting plays for player k. It is not difficult to show that S is in fact a closed set’. 

We can now apply any chosen tie-breaking rule to pick an optimal limiting play 
(x,,..-,X,, < ) from S. We could, for example, always pick a limiting play from S$ 
that has a largest x, value (such a play exists because S is a closed set); this is in 
fact a particular case of the family of “rightmost” tie-breaking rules that we will 
consider in section 5. The chosen (x,,...,x,, <=) is then the value of 
opt(x,,..., X,_,), completing the inductive definition. 

Finally, if all players play optimally and follow the given tie-breaking rule 
to choose among equally desirable moves, their numbers will be given in the 
limit by opt() = (x4,...,x,, <), with corresponding winning probabilities 
PU, X4,.--,Xp, <). 


n? 


'That is, if for all ¢ > 0 there are y, with |y, —x,|<e such that (y,,..., Vn ~) €S, then 
(x, peeay Xn ~S ) e€ §. 
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3. A candidate strategy. Finding a constructive characterization of the optimal 
strategy opt(x,,...,x,) is, under many tie-breaking rules, a difficult task. It turns 
out that for the “closest to the first player” tie-breaking rule considered in the next 
section the optimal strategy is, surprisingly, fairly simple. However, instead of just 
giving the strategy and proving that it is optimal (we will do this in the next 
section), we would like to describe here the properties of the tie-breaking rule 
which can lead us to guess such a strategy. These properties might in turn be 
useful for finding optimal strategies under other possible tie-breaking rules. 

After examining the problem for small values of n, one observes that many 
tie-breaking rules seem to obey some form of the following optimality principle: A 
player who does not play optimally for himself can only increase the chances of winning 
for the players that follow him. 

We use restricted versions of this principle to obtain a plausible playing 
strategy. When applied to the last player, this principle says that the chances for 
the last player are worst when the players before him play optimally (since his 
chances only increase when those players play suboptimally). We thus make the 
assumption that all players play so as to make the payoff of the last player as small 
as possible. 

The optimal payoff for the last player given the numbers chosen by the players 
before him is easy to compute. Suppose the moves of the first n — 1 players have 
partitioned [0, 1] into 1 intervals of respective lengths a,,...,a,,, from left to right. 
Then the optimal limiting move for the last player gives him a payoff of 
max(a,,@,/2,...,da,-,/2,4,), since the payoff when playing anywhere between 
two other players is half the distance between the two players, while the payoff 
when playing in the leftmost (rightmost) interval can be made arbitrarily close to 
the interval size by playing sufficiently close to the leftmost (resp., rightmost) 
player. Now suppose that the first & — 1 players have already played, for some 
k > 2, partitioning [0,1] into k intervals J,,...,J, of respective sizes c,,...,C,, 
from left to right. Given any real B > 0, the payoff of the last player can be forced 
to be B/2 or less by the n — k players before him that are left to pray, provided 
that at least | ,| — 1 players move into the interval J, for each 1 <i <k. Here p, 
is the ratio 


c,/B ifl<i<k, 
0; = Lp a (1) 
c/B+1/2 ifi=lori=k. 


This is so because in the case 1 <i <k the interval J; must be partitioned into 
subintervals of size B or less, while in the cases i = 1 and i = k the subintervals at 
both ends of [0, 1] must be of size B/2 or less, to ensure that the payoff for the last 
player is at most B/2. This can be achieved if and only if 


» (Tp; | — 1) <r, (2) 


l<i<k 


where r=n—k&+ 1 is the number of players left to play. If, following the 
optimality principle, the players before the last one play so as to make the payoff 
B /2 of the last player as small as possible, then B must be the least value of B 
satisfying (2). An equivalent definition for this key parameter is that B is the 
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largest B satisfying 


» Loe) ers (3) 
l<i<k 

in fact B is the only value of B which satisfies (2) and (3) simultaneously. This is 
because the summands in (2) and (3) are equal unless p, is an integer, in which 
case they differ by 1. Since only r players are left to play, it is natural in view of (3) 
to allocate at most | p,| players to the interval J,, including player n. Then if r,; out 

of the last r players move into the interval Mi we will have 
[e,.l|-—1<r,<\|p,] forall <i<k, with) (=P. (4) 


l<ick! 


We will try to determine more precisely where these r,; players should play inside 
the interval J,, for 1 <i<k. The first r, — 1 players to play in J; partition this 
interval into r; subintervals. There must be a subinterval J; among these of size at 
least B, since r, < |c,/B] by (4). The last player p to play in J; should then be able 
to obtain a payoff of at least B/2 by playing inside J,;. Now the optimality 
principle applied to player p says that the r, — 1 players that play in the interval J; 
before player p will play so as to make the size of the subinterval J, as small as 
possible. However, the other subintervals of J; should not be made larger than B 
because otherwise player n could move into J, and obtain a payoff greater than 
B/2. Thus the first r, — 1 players in J; should partition this interval into r; — 1 
subintervals of size B, and one subinterval J; of size at least B for player p. If we 
assume that player p plays in J; so as to create a subinterval of size B as well, we 
obtain a simple playing strategy: The r, players that play in J,,1 <i <k, partition 
this interval into r, subintervals of size ‘B and a leftover subinterval K, of size at 
most B. 

This is illustrated in Fic. 2. A subinterval of size B /2 (instead of B) is used at 
both ends of [0, 1] to keep the potential payoff for the last player bounded by B/2. 
Also notice that we have placed the leftover subinterval K, in each J; as close as 


possible to the number x, played by player 1; this will be justified by the 
tie-breaking rule of the next section. 


0 X4 X,%9 xy X3 1 


Fic. 2. Optimal play (x,,...,%,, <) given (x,,...,x,_,) for m = 17 and k = 5, 
under the “‘closest to the first player” tie-breaking rule. Here x, =x, with x, ~ Xx». 


4. Optimal strategy under the “closest to the first player” tie-breaking rule. In 
this section we characterize the optimal playing strategy under the ‘‘closest to the 
first player” tie-breaking rule. This tie-breaking rule assumes that the optimal 
limiting play (x,,...,%,, < ) chosen by a player k > 2 is such that no other 
optimal limiting play that has the first k players in the same relative order in the 
interval [0, 1] as the chosen play can have player k closer to the first player. Notice 
that we make no assumptions about how the tie-breaking rule chooses among 
equivalent moves for the first player, or about how it chooses the relative position 
of a player k > 2 with respect to earlier players if several optimal alternatives are 
available: It turns out that the optimal strategy is essentially the same regardless of 
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how these choices are made. Hence our solution applies not just to a single 
tie-breaking rule, but rather to a family of possible tie-breaking rules. 

We recall some definitions from the previous section. Suppose for now that 
k > 2. The numbers x,,...,x,~, chosen by the first k — 1 players partition [0, 1] 
into k intervals J,,...,J,, from left to right, of respective lengths c,,...,c,. Let B 
be the unique value of B satisfying both (2) and (3), where the p, are given as 
functions of B by (1), and r =n — k + 1; then B is also the smallest B satisfying 
(2) and the largest B satisfying (3). 

We define |p,| marked positions J,1),..., J,(.p;|) inside each J;. Let JO) and 
J(lp;| + 1) be the endpoints of J; which are farthest from and closest to x,, 


L 


respectively. Then the marked positions inside J, are given, for j = 1,2,...,|p,], by 
(see Fic. 2) 
| jB if1 <i<k, 
FAG) _ J;(0)| — A, ; (S) 
(j-1/2)B ifi=lori=k. 


We associate with each marked position J,(j) the payoff,that a player at J,(j) 
would get if each marked position was played by one player. Therefore, the payoff 
of every marked position equals B, except for the last marked position J; ilp:) of 
each interval J,; this last position has a payoff of (B + /(i))/2, where /(i) is the 
size of the leftover interval K; = (J,(.p;,1), J,lp;] + 1) and is given by 


. c, mod B if1 <i<k, 
(i) = (6) 


(c, + B/2)mod B- ifi=1ori=k. 
We can now state our main result: 


THEOREM 1. The optimal limiting play under the “closest to the first player” 


tie-breaking rule is given for k > 2 by opt(x,,...,x,_,) = (x,,...,X,, < ), where 
Xp5+++,X, are n —k +1 marked positions of highest associated payoff, in order of 
decreasing payoff. 


Proof. Let us first determine which marked positions will be played according to 
this theorem. Notice that the total number of marked positions is at least r, the 
number of players left to play, by equation (3). Among these marked positions, 
those of least payoff are the positions J;([p;|) with p; an integer, which have 
I(i) = 0 and a payoff of B/2. The other marked positions will thus be used first; 
however, there are only | p;| — 1 of these other marked positions in each J,, and 
hence less than r in total by (2), so the theorem implies that one or more of the 
marked positions of low payoff B /2 will have to be used as well. From these 
remarks, it follows that the number of players that will end up in each J, 
(according to the theorem) satisfies (4). 


We will prove the theorem by induction on the number n — k + 1 of players 
left to play. For the base case k = n, the marked positions are endpoints J,(1p;] + 1) 
of highest payoff, and these are also the optimal moves for the last player under 
the “closest to the first player” tie-breaking rule. For the inductive step, suppose 
without loss of generality that player & <n plays in an interval J; with the 


endpoint J,(0) to the left and the endpoint J,(_p;| + 1) to the right. Then J, is split 
into a left interval L = (J,(0), x,) and a right interval R = (x,,J/,0p;] + 1). The 
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new set of k + 1 intervals will then have a new parameter B' satisfying (2) and (3), 
and we can assume inductively that the last n — k players will Play as indicated by, 
the theorem for this new parameter B'. Notice that B' > B, since B is still a 
solution to (3) for this new set of intervals, while B’ is by definition the largest 
solution to (3). 


Lemma 1 (Low payoff for unmarked positions). If player k plays between Jj — 1) 
and Jj) (or at J(j)), his payoff will be no greater than the payoff associated with 
marked position J, ( J), for 1 <j <\p,], and smaller than B/2 forj =\|p;| +1. 


Proof. We claim that if player k plays at x,, between J,(j — 1) and J,(j) (or at 
J(j)), then for j > 1 some player will play in L at J,(j — 1) or higher, and for 
J <\p;] some player will play in R at J,(j + 1) or lower. This claim implies the 
lemma: For 1 <j <|p,], the payoff associated with marked position J,(j) is at 
least as high, since this payoff corresponds to a situation where x, = J,(j) and no 
player plays between Jj — 1) and Jj + 1). For j =|p;] + 1, either j > 1, in 
which case the payoff is less than B /2 because [(i) < B, or j = 1, in which case 
the payoff is less than B/2 because | p,| = 0 so that J, is a small interval. 


L 


To prove the claim, we consider two cases. If B' = B, then some player after 
player k will play at J,(j — 1), for j > 1. Also, some player will play at distance B 
from x, in R, and hence at J,(j + 1) or lower, for j <[ p; |. The remaining case is 
B' > B. Since the new parameter B’ is greater than B, the number of players that 
can play in each interval is now at most j — 1 for L, at most [p,;| — j for R, and at 
most |p,] — 1 for each of the other intervals J, with i’ # 1. By equation (2), these 
numbers can only add up to r — 1, the number of players left to play after player 
k, if exactly j — 1 players play in L, exactly [p;] —j in R, and exactly [p,;| — 1 in 
the remaining J,. Then, for j > 1, the (j — Dth player in L (from left to right) is 
above J,(j — 1). Also, for j <|p,], if the leftmost player in R were at J,(j + 1) or 
above, then there could be no more than [p,] —j — 1 players in R, since B’ > B, 
so the leftmost player in R must in fact be below J,(j + 1). This proves the claim 
and completes the proof of the lemma. O 


Lemma 2 (Preservation of marked positions). Let J,(j) be a marked position of 
highest associated payoff. Then among the limiting moves (x,,...,X,, < ) available 


to player k there is one in which x, = Jj), and x,,...,X, aren —k + 1 marked 
positions of highest associated payoff, in order of decreasing payoff. 


Proof. If player k plays at a marked position J,(j) (this is feasible unless 
I(i) = 0), then B also satisfies (2) for the new set of k + 1 intervals, so B’ = B. 
The new set of k + 1 intervals therefore has a new marked position L(j) at J,()), 
with L(j) < J,(j). If none of the remaining players plays at L(/), then they will all 
play at the old marked positions in order of decreasing payoff, as desired. On the 
other hand, if one of them plays at L(j), then the condition of the lemma is not 
satisfied and in fact player A obtains a low payoff of B/2 or less. 

However, it is still possible for player k to obtain the limiting play described in 
the lemma by playing just below J,(j), at J,(j) — e, where e < B — I(i). Now if 
B' = B, then the marked positions remain the same after player k has played, 
except for the marked positions J,(j’) with j’ > j, which change to R(j’ —j) = 
Jj’) — e. Letting « go to zero gives a limiting play as stated in the lemma. The 
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remaining case Is B’> B. Then, as in the proof of Lemma 1, we must have j — 1 
players in L,[p;| —j players in R, and | p,;|— 1 players in the remaining J,. As 
before, the leftmost player in R must be below Jj + 1) (provided that [p,;] — j > 0 
so that R has at least one player), so B' < B+ «; if we let ¢ go to zero then B’ 
approaches B and the positions of the players approach the marked positions of 
highest payoff. If[p,] — j = 0, then J,(/) is a marked position with payoff B/2 and 
can only be a position of highest payoff if{p,,|—1 Pequas zero for all intervals J,’. 
But [p,|— 1 is the number of players in each J,, so player k would be the last 
player, contrary to the assumption that k <n. 5 


We use these two lemmas to prove the theorem. If a is the payoff associated 
with a marked position J,(j) of highest payoff, then player k can obtain a payoff of 
a in the limit by playing at J,(j) or just below J,(j) as in Lemma 2. On the other 
hand, by Lemma 1, the only plays that could possibly give the same payoff a to 
player k involve playing between J,(j — 1) and J,(j), where J,(j) is a marked 
position of payoff a, but then position J,(j) is closer to x, and hence should be 
preferred by player k under the “closest to the first player” tie-breaking rule. Thus 
the optimal limiting play is the one given in Lemma 2, completing the induction 
and proving the theorem. O 


To complete the analysis, we must examine the optimal strategy for the first 
player. Consider the n — 1 positions p, = (Qi — 1)/2(m — 1) for 1 <i<n-—1. 
The above theorem implies that if player 1 plays at one of these nositions, then the 
remaining n — 1 players will play in the limit at these n — | positions (one of them 
coinciding with player 1), giving player 1 a payoff of 1/2(m — 1) [see Fic. 3]. If on 
the other hand player 1 does not play at one of these positions, the remaining 
n — 1 players will play near these n — 1 positions but closer to player 1, giving him 
a payoff smaller than 1/2(n — 1). This gives: 


L—9——__9——_co-—-—_o——_0--_o-_0-——_—o-——_o— 


0 X3 x9 X Xp X4 X5 uw Xy— 4 1 


Fic. 3. Optimal play (x,,..., X,, ~) for n = 10 under the “closest 
to the first player” tie-breaking rule. Here x,, =x, with x, < x,, 


CoroLiary 1. In the optimal limiting play opt() = (x,,...,%X,, < ) under the 
“closest to the first player” tie-breaking rule, the limiting moves x,,...,X,—, Of the 
first n — 1 players are the numbers 1/2(n — 1),3/2(n — 1),...,(@2n — 3)/2(n — 1), 
in some order, and the limiting move x, of the last player equals x,. This gives a 
limiting payoff of 1/2(n — 1) for the first and last players, and 1/(n — 1) for the 
remaining players. 


Thus the youngest and the oldest children will be less likely to get the leftover 
toetje. 

The preceding analysis can in fact be carried through even if the choice among 
equally desirable intervals is made at random or nondeterministically (i.e., chosen 
arbitrarily by each player when it is his turn to play), and even if only the last 
players—those that are left to play once each interval has at most one marked 
position—use the “closest to the first player” tie-breaking rule to select a position 
inside the chosen interval. 
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5. The “rightmost” tie-breaking rule. We now consider the “rightmost”  tie- 
breaking rule. This tie-breaking rule assumes that the optimal limiting play 
(X,,.-+,X,, < ) chosen by a player k is such that no other optimal limiting play 
that has the first k players in the same relative order in the interval [0,1] as the 
chosen play can have player & further to the right. Again, we make no assumptions 
about how the tie-breaking rule chooses the relative position of a player with 
respect to earlier players if several optimal alternatives are available, so our 
solution actually applies to a family of tie-breaking rules. 

It turns out that the solution for the “rightmost” tie-breaking rule is very similar 
to the solution for the “closest to the first player” rule, provided that we make a 
few changes to the marked positions. We again consider the intervals J,,..., J,, 
and define marked positions as follows. Let J,(0) and J,([p;] + 1) be the left and 
the right endpoints of J, respectively. If i < k, the |p,;| marked positions for J, and 
their associated payoffs are defined as before. For J,, on the other hand, we define 
|p,| marked positions by letting J,(j) = J,(0) + jB if 1 <j <\p,| and J,A.p,) = 
Jlp;) + D- B/2. Therefore, the associated payoff of every marked position in 
J, equals B, except for the last two marked positions J,(lp,|— 1) and J,(.p,)); 
these two positions have a payoff of (B + /(k))/2, where /(k) is the size of the 
leftover interval K; = (VJ, p,] — 1), 4,Lp,))) and is given by /(k) = (c, + B/2) 
mod B. In particular, if p, is an integer with p, > 2, then we have a double marked 
position J,(\p,] — D = J..p,) with J,.dp,| -— D < Alp, and each of these two 
positions has an associated payoff of B/2. For the particular case p, = 2, we will 
also need to consider a special position J,() just to the right of J,(0), that is, 
JQ) = J,(0) with J,(0) < J,Q; this special position is not considered a marked 
position. 


Using this new set of marked positions, the results of the previous section for 
the “closest to the first player” tie-breaking rule can be extended to the “‘right- 
most” tie-breaking rule as well, provided that the somewhat subtle situations that 
arise when a double marked position is present are properly treated. 


THEOREM 2. The optimal limiting play under the “rightmost” tie-breaking rule is 
given for k > 2 by opt(x,,...,X,_1) = (X,,..-,X,, < ), where one of the following 
conditions holds: 


1. There is no double marked position, and x,,...,X, are marked positions of 
highest payoff, in order of decreasing payoff. 

2. There is a double marked position, two x, among x,,...,X, are at the double 
marked position, and x,,..., X, are marked positions of highest payoff, in order 
of decreasing payoff. 

3. There is a double marked position, only one x; among X,,...,X,—1 18 at the 
double marked position, and X,,,... 5 Xj) Xj44)+++»X, ave marked positions of 


highest payoff, in order of decreasing payoff. 

4. There is a special position, x, is at the special position, no x, is at the double 
marked position, and X,,...,X,—, are marked positions of highest payoff, in 
order of decreasing payoff. 


If we now consider, for each possible move of the first player, the optimal play 
for the remaining players described by the four cases in the theorem, we obtain: 


CorOLLARY 2. In the optimal limiting play opt() = (x,,...,%X,, < ) under the 
“rightmost” tie-breaking rule, the limiting moves x,,...,X,—, Of the first n — 1 
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players are the numbers 1/2(n — 1),3/2(n — 1),...,(2n — 3)/2(n — 1), in some 
order, and the limiting move x, of the last player equals x,,_,. This gives a limiting 
payoff of 1/2(n — 1) for the last two players, and 1/(n — 1) for the remaining 
players. 


6. Other tie-breaking rules. For other tie-breaking rules, the situation is more 
complicated. To simplify things and avoid dealing with the special intervals J, and 
J,., let us assume that for all the intervals J,, both endpoints of J; have been taken 
by earlier players. This will be the case if the numbers 0 and 1 are taken at the 
beginning of the game (or, alternatively, if the game is played on a circle). 

Under the simplifying assumption that the endpoints have been taken at the 
beginning of the game, there is no difference between the “closest to the first 
player” and the “rightmost” tie-breaking rules. We now examine the situation 
under the more equitable tie-breaking rule in which the choice among equivalent 
moves is made uniformly at random. 

If we follow the heuristic argument of section 3, we would expect that the 
number of players r; in each J, would still satisfy (4). Under the “random” 


tie-breaking rule, however, the subintervals of size B would be taken by the first 
r,— 1 players in J; at random from either the left end or the right end of J,, 
leaving a subinterval J, of size greater than B somewhere in between. Then the 
last player in J; would play at random inside J; but in such a way as to partition J; 
into two subintervals of size at most B. 

This is indeed the optimal strategy when the total number of players in each 
interval is small. The inductive step, however, fails when the number of players is 
large. Thus, in the seven-player game with both endpoints taken, if all players 
follow the strategy we described, the expected payoff for the first player will be 
(1/7) — (1/2)’) < 1/7, but if the first player chooses the midpoint of the [0, 1] 
interval and the remaining six players follow the strategy we described, playing 
three to each side of the first player, the expected payoff for the first player will be 
(1/6) — (1/2)°) > 1/7 (see [1] for the details of these calculations). 


Nevertheless, since this strategy is optimal when the number of players is small, 
it may provide a good base case from which to extend the solution for the 
“random” tie-breaking rule to larger values of x. 


7. Conclusion. We have shown that a simple zero-sum game in which all 
players play independently so as to maximize their own payoff can be viewed, 
under certain conditions, as a game in which the players cooperate to minimize the 
payoff of the last player. Furthermore, this approach is helpful in developing an 
optimal strategy for the game. An interesting question is whether this principle can 
be applied to a broader family of zero-sum games. A candidate problem for this 
approach may be the two-dimensional version of toetjes. 
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Introduction. The max and the taxicab metrics in R? are accessible yet interest- 
ing examples of non-Euclidean metrics. Each is a specific case of the more general 
Minkowski metric where the defining unit “circle” is a centrally symmetric closed 
convex curve. It has been shown in [3] for the max metric p in R” and in [6] for the 
taxicab metric d; in R* that the respective isometries of the whole space form a 
subgroup of the Euclidean isometries of the whole space. There are, nevertheless, 
many isometries from one set to another which cannot be extended to an isometry 
of the space onto itself and are evidently quite different from Euclidean isome- 
tries. 

Let p be the metric in R” given by p(p, q) = max, .;.,,|x; — y,| where p = 
(x,,---,x,) and q = (y,,..., y,). For E, F CR", f: E — F is a p-isometry and E 
and F are p-isometric or p-congruent if f is surjective and p(f(p), f(q)) = p(p, q) 
for all points p and q in E. The usual Euclidean distance between p and q will be 
denoted by e(p, q). f is a distance preserving bijection from one set to another set. 

For the most part, this paper deals with the case of the max norm or metric in 
R’, but many results pertain to the max norm in R”. In addition, with methods 
which are natural extensions of those used in this paper, the reader will see that 
some proofs can be modified for a Minkowski metric where the defining unit 
“circle” is a polygon. 

There is a class of simple 27-point sets in R” (called y-stars) on which a 
p-isometry must be Euclidean and can be extended uniquely to an unbounded set. 
Any E Cc R” of positive Lebesgue measure contains many such sets. Then F ¢ R” 
whose closure has positive Lebesgue measure can be partitioned up to a set of 
measure zero into disjoint subsets, on each of which a p-isometry is Euclidean. 
Depending on the geometry and not the cardinality of the set involved, some 
p-isometries cannot be extended at all, others can be extended but not uniquely, 
still others can be extended uniquely to R”. Finally, another type of four-point set 
is introduced in R*; together with y-stars they are minimal sets in R? on which 
p-isometries must be Euclidean, answering a conjecture of [3] in the negative. 


2. Examples in R? and preliminaries. 


ExamPLe 1. Let E =[-—1/2,1/2] x {0} and f((x,0)) = (x, x7). This is a 
p-isometry on E since, for x > —1/2 and z<1/2,|x +2z| <1. Then 
p((x, x7), (z, z7)) = max(|x — z|, |x? — z7)) = |x — z| = p((x, 0), (z, 0)). So a straight 
line segment is p-congruent to a segment of a curve. 
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EXAMPLE 2. Let E be the x-axis and 0 < |m| < 1. g: (x,0) > (x, m|x)) and f: 
(x, 0) — (x, mx) are both p-isometries. Both E and f(£) are vector subspaces, but 
it is easily seen that f cannot be extended by even one point: a point p €¢ E has 
infinitely many points on the x-axis at a distance a from p, whereas any point qg 
not on the line f(£) has at most two points on f(£) at a distance a from gq. Thus 
no point gq could be the image of p under an extension of f. (See Fic. 1) 


fCE) 


Fic. 1. A p-isometry which cannot be extended. 


Analogously, let F be the y-axis and h: (0, y) > (my, y) for 0 <|m| < l.hisa 
p-isometry mapping the y-axis onto the line y = (1/m)x. 


Remark. Since the parallelogram law does not hold, (R*,p) is not an inner 
product space. The Witt Extension Theorem [5] states: If a metric vector space V 
is equipped with a non-degenerate quadratic form, then any isometry of a subspace 
U, onto a subspace U, can be extended to an isometry of V ~ V. The term 
isometry here means a bijective linear map which preserves the quadratic form. 
The above example is an easily available illustration that such an extension 
theorem is not always possible in a vector space where the metric is given by a 
norm and not by an inner product. 


Example 3. p,; = (1,0, p, = (-1,1), p3 = (-1,-1), py = GA, — 1). pCp;, p;) 
= 2 for i,j = 1,2,3,4. Any permutation f of the four points is a p-isometry; in 
particular, take f(p,) = py, fp.) = po, f(p3) = p3 and f(p,4) = py. 


EXAMPLE 4. The above p-isometry can be considered as a localization of the 
p-isometry g on E = {(x, y)||x| > ly} where g((x, y)) =(x,y) if x <0 and 
e((x, y)) = (x, —y) if x > 0. There is also the p-isometry A on the set F = 
{(x, y)||y| > |x} where A((x, y)) = (x, y) if y < 0 and A(x, y) = (—x, y) if y > 0. 
The upper cone is reflected about the y-axis, the lower cone remains fixed. 


A geometric feature of this metric gives rise to some of these unusual congru- 
ences. In R* with the usual metric, every point can be uniquely located by 
specifying its distances from three non-collinear points. In the max metric this is 
not the case, nor is it so for any Minkowski metric where the defining unit “circle” 
contains a straight line segment. In Fic. 2(i), every point on the heavy line segment 
has distance a from p, b from q and c from r. Using this property of the max p or 
taxicab metric d,, many examples can be constructed of three (or more) point sets, 
([4],[6], and Fic. 2@i)) which are p- or d;-congruent as sets, but are not e-con- 
gruent. We shall see that it is true in general that for any three-point set in the 
plane there is a p-isometry which is not Euclidean. Since p-isometries of R* —> R? 
are Euclidean, there is no configuration of three points such that any p-isometry 
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Fic. 2. (i) Points are not uniquely located by specifying distances from three noncollinear points. 
(ii) All sets with p,q fixed and t on the heavy line segment are p-congruent. 


on those three points can be extended to a p-isometry of the space onto itself. We 
shall see later that, although p-isometries on some sets must be Euclidean, there is 
always a non-Euclidean extension of a p-isometry which is a relativization of an 
element from the group of symmetries of a square on a bounded set. 

Some terminology will clarify the proofs of these assertions: given a point 
p © R’, construct lines L and L’ of slopes +1 through p, obtaining the four open 
quadrants Q(p), Q,(p), Q,(p), Q,(p) (right, left, up, down) of p. Note that if 
p © Q,(q) then gq € Q,(p) (where the bar denotes closure with the usual topol- 
ogy) and the p-distance of p to q is the absolute value of the difference in their x- 
or horizontal coordinates. Also if p € Q,(q), then q € Q,( p), and their p-dis- 
tance is the same as their vertical e-distance. Whenever p € Q,(q),Q,(p)< Q,(q) 
fori =r,l,u,d. 

Generalizing Example 4, a half-pivot about a point p is defined as one of four 
mappings with respect to the quadrants of p. For a right (left) half-pivot about p, 
the closed right (left) quadrant of p is reflected about a horizontal line through p 
while the closed left (right) quadrant stays fixed. Likewise, for a lower (upper) half 
pivot about p, the closed lower (upper) quadrant of p is reflected about a vertical 
line through p, while the closed upper (lower) quadrant stays fixed. Relativizations 
of these mappings are p-isometries for any sets contained in the mentioned 
quadrants: e.g., if E CQ,(p) U Q,(p), then an upper or lower pivot about p is a 
p-isometry on E. See Fic. 3. 


PRoposiITION 1. For any three points in R* there is a p-isometry which is not 
Euclidean. 


(ii) 


Fic. 3. (i) An upper half-pivot about p. (ii) e(q, r) is equal to the sum of the e-lengths of the legs of the 
triangle pgr’, and e(q,r’) is the hypotenuse, while p(q, r) = p(q,r’). 


798 REGINA B. COHEN [November 


Case (a). If the three points p,, p, and p, are collinear, map them into another 
line using the restriction of the p-isometry f, f—',h, or h~! of Example 2 to the 
three points. 

Case (b). Suppose two of the points, say p, and p,, are on a line L of slope +1 
or —1. Construct a line L’ through p, perpendicular to L. L OL’ = p, which 
could be coincident with p, or p,. In any case, one of the two points is in one 
closed quadrant of p and the other two are in the opposite closed quadrant. 
Half-pivot about p so that p, is now on L, and either all three points are on L or 
p, is on L while p, is on L’. In either case, two points which had for e-distance 
the hypotenuse of a right triangle now have e-distance equal to the sum of the 
lengths of the legs. 

Case (c). If no pair is on the same line of slope +1 or —1, then construct lines 
of slope +1 through all three points. One of the lines L, lies between the other 
two, L, and L,. Label the points so that p, ]@L,, p, & L,, and p, € L;. 
Construct Li, and L’, through p, and pz, respectively, with slope —1. q,; = L, OL’ 
and q, = L, OL}. If p, is interior to the rectangle formed by L,, L,, L‘, and L4, 
then a half-pivot about q, or a half-pivot about q, will change Euclidean 
distances, each bringing p, or p,, respectively, onto L,. If p, is not interior to the 
rectangle, choose whichever of qg, or g,; on L, is further from p,. A right or left 
half-pivot about that point will bring p, or p, onto L,, leaving p, fixed on L,. 
p-distances are maintained (Fic. 4). 


P3 


No 


P2 


(i) (ii) (iii) 


Fic. 4. (i) case (b). (ii) case (d). (iii) case (d). Half-pivot points are circled. 


3. p-isometries on y-stars and extensions. It is not difficult to find four-point 
sets on which p-isometries are not Euclidean (Example 3), but there are four-point 
sets on which p-isometries must be Euclidean. We shall see that configuration 
matters. Define a y-star at p = (x, y) to be the four-point set (x + y, y) = p,, 
(x—y,y)=p3,(x%,y+y)=p, and (x,y — y)=p,, where y > 0. There is a 
simpler proof of Lemma 1 given in the proof of Prop. 4, but the method here is 
used later in the proof of Prop. 3. 


Lemma 1. Let S be a y-star at (x, y) and f a p-isometry on S. Then f is Euclidean, 
and f(S) is a y-star at p', the intersection of the line segment between f( p,) and f(p3) 


1990] SET ISOMETRIES AND THEIR EXTENSIONS IN THE MAX METRIC 799 


and that between f(p,) and f(p,). f is a translation followed by an element from the 
group of the square. 


Proof. 


P(P1,P3) = pC P2, D4) = 2y 
and 


P(P1,P2) = P( P2, P3) = P(P3, Pa) = P( Pas P1) = (1) 


All squares mentioned have sides parallel to the axes. f(p,) = (x', y’) can be 
considered a translation of p,. f(p3;) must be on a square S, of side 4y centered 
at f(p,); suppose it is on the top side. Both f(p,) and f(p,) must be on a square 
S, side 2y centered also at f(p,); since they are 2y apart, however, they must be 
on opposite faces, right and left, of this square. f(p,) and f(p,) are at distance + 
from f(p3), and must also be on opposite faces of a square S, of side 2 centered 
at f(p,). The only configuration which will satisfy all the distance requirements is 
the y-star centered at (x’,y’+ y): f(p,) =(x', y), flp,) = (' + y,y' + y), 
fps) = (x', y’ + 2y), f(p,) = G' — y, y’ + y). fCp,) and f(p,) can be inter- 
changed. By symmetry, the argument follows a similar pattern if f(p3) were chosen 
to be on the bottom or a side of the square S,. One can see directly that the only 
mappings of a y-star into a y-star which preserve the given distances (1) are 
translations followed by one of the eight elements of the group of the square 
centered at (x’, y’ + y). We shall refer to this group as G. 

y-Stars are “small” sets on which p-isometries must be Euclidean. They are 
important for two other reasons: first, any p-isometry on a y-star can be uniquely 
extended to a certain unbounded set; second, any set of positive Lebesgue measure 
contains many y-stars. 


Lemma 2. Let S be the y-star at (x,y) in R* and EDS any subset of 
QO = {((x, yI|x —y| < y} U {(x, y)||x + y| < y}. Then any p-isometry f on S has a 
unique extension to E. In particular, by Lemma 1, any p-isometry is Euclidean and 
belongs to G. 


Proof. First consider the geometry of the situation in Fic. 5, both for under- 
standing and to clarify the notation. We split Q into five subsets: 


QO, = {(x, y)|lx —yl <y} N (Cx, y)|x +y > ¥} 

QO, = {(x, y)|lx +¥l<y} A {C(x y)ly —x > ¥} 
Q,={(x, y)|lx —yl <y} A {Cx yx +y < —y} (2) 
Q,={(x,y)|lx tyl<y} 9 {Cx y)ly —x < —y} 

Os = {(x, y)|lxl + ly] < y}. 


Every point p in, for example, Q, is at the intersection of the vertical side of a 
Square centered at qg, and the horizontal side of a square centered at q, as 
illustrated in Fic. 5(ii). Only p’ in Q, has that same set of p-distances from q, and 
q>; p and p’ have different p-distances from q, as well as from q,. Points on the 
line segment between q, and q, are uniquely located by their p-distances from q, 
and q,. Any point p in all of Q is at the intersection of a horizontal and a vertical 
side of squares centered at two adjacent points of q,, q,, q3 and q,. p can be 


800 REGINA B. COHEN [ November 


(i) (ii) 


Fic. 5. (i) The set Q. (ii) p is uniquely located in terms of its distances from points of a y-star. 


located uniquely by specifying its p-distances from these four points, which are 
fixed throughout and determine Q. 

For calculation, these p-distances can be given in terms of the coordinates of 
the point p = (x,y). Associate with any point in Q = Q((a, b), y) a four-tuple 
[P15 P25 P3, P4] Of its p-distances from the defining points of Q: q,, q5, d3 4. 

If the p-isometry f is simply a translation, then the assertion is immediate in 
view of the discussion above. For ease of computation and without loss of 
generality, let (a, b) = (0,0); that is, consider f modulo a translation of E. p € E 
and the four-tuple [p,, 95, 93, 94] in terms of the coordinates of p are 


Ly,x,yt+x,yt+ylifp EQ, 
ly—x,-x,y,y+y]ifp EQ, 

[y —x.y—y, -y, -x] if p € Q; (3) 
l[-y,y-y,yt+x,xlifp]eQ, 
ly-x,y—-y,yt+x,y +yJif p © Qs. 


The eight possible p-isometries acting on the y-star S are rotations 90°, 180°, 
270°, and 360° counterclockwise; reflections across the lines x = 0, y = 0, y =x, 
and y = —x. p € Q, has the set of p-distances [y, x, y + x, y + y] from the points 
of S; p; =q,, 1 = 1,2,3,4. If f is to be extended to an isometry on p consistent 
with f on S, then f(p) will have the same p-distances from points of f(S). If f 
is a 90° rotation counterclockwise then f(p,) = q,, flp,) = 43, flp3) = q@, and 
f(p4) = q;. We have p, = p(p, a4) = p(y, p4) = p( f(r), fp) = p(f(p), q1). This 
last is now the first entry in the four-tuple of p-distances of f(p) from q,, 
i = 1,2,3,4, yielding [y + y, y, x, y + x], which is the unique set of p-distances for 
a point in Q, with coordinates (—y, x). This represents a 90° rotation counter- 
clockwise of p = (x, y). 

Another case for p@=Q, and f on S is a reflection across y =x. The 
p-distances of p from points of S will be [y —x,y —y,y+x,y + yl. fl(p)) =@, 
fp») = a1 f(p3) = 44, f(p4) = G3, So that f(p) has the four-tuple of p-distances 
ly-y,y—x,y +ty,y +x]. f(p) is uniquely located in Q, with coordinates (y, x). 
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The verification of Lemma 2 for the other cases is straightforward, following the 
same ideas as the examples chosen and using the symmetry properties of the 
square. 

If p €Q, say in Q,, where Q, = {(x,y)|x +y>y and x—y> y}, its dis- 
tances from points of § are given by [x — y, x, x, x + y]. Any point on the vertical 
line x =k,|y| <k — y, has the same set of distances from S. Points on that line 
segment cannot be distinguished from one another in terms of their distances from 
points of §. A certain amount of vertical (or horizontal) movement cannot be 
detected from points within the convex hull of the y-star. 

A more concise proof of the analogy of Lemma 2 in R” for n > 2 is possible if 
attention is restricted to the convex hull of the n-dimensional y-star. See Lemma 2 
of [3]. This is sufficient to answer the questions of Schattschneider [6] as to the 
nature of the group of isometries of R” with respect to the max metric. Any two 
points p,q € R” belong to the convex hull of some y-star S$. By Lemma 2 cited 
above, any p-isometry of R” — R” must map the convex hull of S in the same way 
as it maps the points of S; namely a translation followed by a transformation of the 
group of the n-cube, which maps points of an n-dimensional y-star to other such 
points, so that Euclidean distances are preserved. To relate the group of isometries 
of (R”, d,) to that of (R”, p) we cite here a theorem from [2]: 


THEOREM 1. If U, and U, are the unit spheres of the n-dimensional Minkowski 
spaces M, and M,, then any linear transformation which maps U, onto U, is an 
isometry of M, onto M,. 


These results can be generalized for a Minkowski metric whose unit “circle”’ is 
given by a closed centrally symmetric polygon. For example, with a hexagon of side 
y the analog of a y-star would be the set of six corner points H. A point of H has 
distance y to an adjacent point, distance 2y to all other points of H. Every point 
inside the hexagon can be located uniquely in terms of its distances from points of 
H. Any p-isometry (p defined by a hexagon metric now) must carry points of H to 
points of H via some Euclidean transformation, a translation followed by a motion 
which is a symmetry of the hexagon. 


4. p-isometries and extensions on more general sets. Returning to p-isometries 
with the max metric, if FE is an open set in the usual topology, there is a y-star 
S CE centered at every point p © E. Any p-isometry acting on FE must be locally 
Euclidean. If F is open and connected, then f is the same Euclidean isometry on 
all of &. Any two points p and q in E can be connected by a simple arc, which can 
be covered by a chain of finitely many overlapping closed convex hulls of y-stars 
contained in EF. Starting at p and proceeding from set to set in the chain, the 
p-isometry at q must have the same action on q as on p. 

Looking back at Example 3 and forward to Example 5, one can see that it is 
possible for a p-isometry to act like a different Euclidean isometry on different 
parts of a set. It might seem that a y-star configuration is too specific to be useful 
in studying the action of a p-isometry on more general sets in R* or R”. It turns 
out, however, that at almost every point x of a measurable set FE there are 
infinitely many y-stars of arbitrarily small size centered at x. Thus, when consider- 
ing the action of a p-isometry f on E, we can look locally at its action on a y-star. 
m denotes linear Lebesgue measure, and A = m X m. 
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Lemma 3. [3] E is a measurable set in R*, ACE) > 0, and 6 > 0. At almost every 
point p © E there is a y-star at p contained in E with 0 < y < 6. 


The proof uses the Lebesgue density theorem and the Fubini Theorem. The 
remaining points (x, y) are those which are linear m-density points on both the 
horizontal E” and vertical E, sections at (x, y) in E. This result shows that there 
is a kind of local geometric symmetry on an arbitrarily small scale at almost every 
point in a measurable set. 

The type of set we may consider can be widened beyond the class of measurable 
sets. By standard methods it is easy to show that a p-isometry f on a set E may be 
uniquely extended to a p-isometry f on the closure of E so that f/, = f. Then if 
ACE) > 0, the following theorem describes a p-isometry on E, except on a set of 
measure zero. 


THEOREM 2. [3] E C R’, ACE) > 0. There exist pairwise disjoint closed squares, 
sides of slope +1,{S,}°_,, such that for any p-isometry f of E and any i > 1 the 
restriction of f to E QS, is Euclidean belonging to G. ALE \. U*_,S,) = 0. 

The proof, which holds in R”, employs the Vitali covering theorem. The closed 
squares are closed convex hulls of y-stars in E. On “most” of a set, a p-isometry 
must be locally Euclidean. 

There are many examples of p-isometries where the set E has countably many 


disjoint parts and the p-isometry is a different Euclidean isometry on each. 


Examp_e 5. In R* consider the family of isosceles triangles 7,,i = 0,1,2,... 
with vertices (31,0), (3i + 1, 1), Gi + 2,0). Include the interior points. Let f be a 
transformation whose action is given by the sequence (6), 6,,55,...),6, = 0 or 1 
where 1 in the ith place represents leaving T, fixed and 0 represents reflection of 
T. across the x-axis. Any such f is a p-isometry on U;_,)J; = T. Also, given any 
sequence of real numbers (f),¢,,¢,,...) where ft) =0 and |t,,,-—t|)<1, a 
p-isometry g can be defined so that the quantity rf, represents a vertical translation 
of the triangle 7;. f and g may be composed, resulting in a combination of local 
reflections and translations. 

Here £ is an unbounded set, but the example can be revised so that the bases 
and heights of the triangles are terms of a convergent series, and the resuiting set 
U*_ 7; will be bounded. This sort of example can be constructed for a connected 
set: for i=0,1,2,... let T,’ be the triangle with vertices (2/,0),(2i + 1, 1), 
(2i + 2,0) and its interior points; as above, define the action of the p-isometry f 
by the sequence (6), 6,,65,...). Again, the construction can easily be revised so 
that JT’ = Uf_ T;’ is bounded as well as connected. Furthermore, sets of measure 
zero such as line segments can be attached. When placed appropriately, the 
p-isometry may be locally non-Euclidean on these while remaining locally Eu- 
clidean on the “substantial” part. 


Remark. Two sets E and F are said to be equivalent by enumerabie decomposi- 
tion, in the sense of Banach and Tarski, if each can be partitioned into disjoint 
subsets E,, E,,... and F,, F,,... so that E, is e-congruent to F, for all i > 1; [1]. 
The p-congruent sets T, f(T), g(T) and fe g(T) are equivalent by enumerable 
decomposition, as are T’ and f(7"), but the p-congruent sets of Example 1 are not. 

On what types of sets can a p-isometry be uniquely extended to a p-isometry 
R’ > R*? No bounded set will suffice. 
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PROPOSITION 2. E C R? is bounded and f is the identity on E. Then there is a 
point p € E and a p-isometry f, f |~ = fand fi is not Euclidean on E U { p}. 


Proof. E and E are bounded, so there is a square with sides of slope +1 
enclosing E. Let q = (Xo, Yo) be its highest point. With k > 0, choose p = 
(x', Yo + k) with x9 < x’ <x9 +k. Define f = f on E and f(p) = f(x’, yo +k)= 
(A(x) + x' +k), yy +k). f on {p} is a small horizontal displacement. f is a 
p-isometry and not an e-isometry on E U {p}. There exist non-Euclidean exten- 
sions to much larger sets, for example to Q,,(q) and Q,(q') where q’ is the vertex 
opposite to g. Define f as the identity on F and as a reflection about the line 
through q and q' for points in Q,(q) and Q,,(q’). 

On the other hand, a sparse set E which is non-dense and of measure zero can 
have the property that a p-isometry defined on FE is uniquely extendable to a 
p-isometry of R” — R”. One example in R? is the set E = {(0,n),(m,0)|m,n © Z} 
which contains a y-star for arbitrarily large y. This condition is sufficient but not 
necessary, e€.g., a p-isometry on an infinite vertical strip of positive width must be 
Euclidean. 

If the boundary of the convex set defining the Minkowski metric contains a 
Straight line segment, it will always be the case that for a bounded set EF a 
non-Euclidean extension of the identity can be constructed as above. Ali points on 
some line segment outside F and parallel to the line segment in the unit “circle” 
have the same distance from a point in E. 


5. Other four-point sets on which p-isometries must be Euclidean. It is an 
interesting question whether y-stars are the only four-point sets in R* on which 
p-isometries must be Euclidean. The answer is no: a variation on a y-star produces 
such a set, but the variation must be done symmetrically. One type of four point 
set on which p-isometries must be Euclidean will be called a skew y-star at p (see 
Fic. 6). The four points are centered at p = (x,y) with coordinates p, = 
(x+y,y—e) pr, =(xt+e,y+y) py=(x-—e,y—y) pz =(*-y,y +6), |e] < 
y. (Call ¢ the eccentricity of the y-star.) p-distances to adjacent points are all 
y + |e|, and 2y to opposite points. Notice that the convex hull of a skew y-star is a 
tilted square. 


® 
t 
d 
') 
@----'-~---® 
e 
‘ 
e 
e 
Ld 
$ 


(i) (ii) 


Fic. 6. (i) a y-star and a skew y-star. (ii) f(p,) = (k — y — e, y + ©) and f(p,) = (y + €, y — ©). 
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PROPOSITION 3. A p-isometry f on a skew y-star must be Euclidean. 


Proof. Without loss of generality, « > 0. f(p,) is a translation of p, to (0,0). 
f(p3) must be on a square S, of side 4y centered at f(p,). Suppose it is on the top 
side. f(p,) and f(p,) are on a square S, of side 2y + 2¢ centered at f(p,) also; 
they are located as well on another square S, of the same size centered at f(p;). 
Where on the top side of S, must f(p;) be? One can eliminate directly the points 
at the corners or the center of the top of S,. Let f(p3) be an e-distance k > 0 to 
the right of center (see Fic. 6). 

Now p(f(p,), f(p4)) = 2y = max(2y + 2e — k,2e). This implies k = 2 and 
the points {f(p,)}?_, are in the configuration of skew y-stars. The other cases are 
similar. 


Modulo translations, there are sixteen p- (and e-) congruences for any skew 
y-Star. Each is an element from the group of the square or such an element 
composed with a reflection across one of the lines passing through opposite points 
of the skew y-star. The p-isometry mapping the skew y-star at p with eccentricity 
e > 0 to the skew y-star at p with eccentricity —e is equivalent to a (Euclidean) 
rotation through 2 arctan ¢/y. Therefore, for any angle of rotation, there is a skew 
y-Star for which that rotation is a p-isometry. 

The other type of set on which a p-isometry must be Euclidean is the set of 
vertices of a rectangle with sides of slope +1. In a sense, the y-star is the limiting 
case of all these sets. For brevity, denote the collection of all such sets, including 
y-Stars, as S. 

The case of four points whose convex hull is a rectangle F, sides of slope +1, is 
treated in a way which points out an important feature of the max metric in R’. qo 
is said to be between q, and q, iff p(q,, q) = p(q,, do) + p(4o, q>). The set of 
points satisfying this equation is denoted by B(q,, q,). Geometrically, it is the 
rectangle and its interior with sides of slope +1 such that the line segment from q, 
to q, is the diagonal. If q{,q5 € B(q,, q,) then B(q!, gi) C B(q,, gz) with equality 
only if qj and q{ are coincident with q, and q, or are the other vertices of 
B(q,, Qo). 


PROPOSITION 4. A p-isometry f on the vertices of a rectangle F whose sides have 
slope +1 must be Euclidean. 


Proof. If {p,, D>, P3, P4} = P are vertices of the rectangle F, counterclockwise, 
then F = B(p,, p3) = B(p;, p,). We have 


p( Pi, Pr) = PC D3, Pa) = 4; 0(P>,P3) =p(Pp Ps) =b 
P(P>,Ps) = p(P1,Pp3) =a +b where a,b > 0. (4) 


f is a p-isometry on P. f(p,) and f(p,) belong to BC f(p,), f(p3)), so the latter 
must contain B(f(p,), f(p,)). f(p,) and f(p;) belong to B(f(p,), f(p4)) as well, 
so the two sets are equal and f(p,), i = 1,2,3,4 are vertices of a rectangle, with 
sides of slope +1, of the same dimensions (4) as F, and f will be Euclidean. 


6. The separating set of Schoenberg. To establish that the class S contains the 
only four-point sets and thus the minimal sets on which p-isometries must be 
Euclidean, we use a construction from [7] attributed by the author to I. J. 
Schoenberg. This is perhaps a specialized result, but its proof illustrates the 
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usefulness of the Schoenberg construction for the study of p-isometries on finite 
sets in R*. One can construct separating sets in R* for some other Minkowski 
metrics as well. 

Definition of a separating set: EF is a finite set with an even number of points 
2m. Consider first the lines of equation y = x + s such that m of the points lie 
above the line and m below. y =x +s, and y =x +, are the upper and lower 
bounds of such lines. Likewise, for lines y = —x +t, find the lower bound 
y = —x +t, and the upper bound y = —x + t,. The closed rectangle formed by 
these four lines and its interior is called the separating set of E. See Fic. 7. Notice 
several useful facts: For p, in the separating set 


P(DPa> Pc) = P(Pa> Po) + P(Po> Pc), for pp = BC Dy, Pc)» Di = Q; (5) 


p-distances between points in Q, and Q. and between points in Q, and Q,, are 
their horizontal and vertical e-distances respectively. Many different sets will have 
the same separating set. There are no points of EF in the two strips whose 
intersection is the rectangle; otherwise the lines would be positioned differently. 
For the same reason when F consists of four points, it will not be the case that 
adjacent open quadrants, e.g., Q, and Q,, both contain points of E. 


@ 
Ou P3 


Fic. 7. The separating set of P = {p1, D>, D3, D4; Ds Do}- 


ProposiTION 5. For any four-point set E = {p,, p>, P3, D4} which is not of the 
class S, there is a p-isometry on E which is not Euclidean. 


Proof. We construct a non-Euclidean p-isometry for any EF €S. Treat the 
simpler cases first. (a) If all four points are on a line of slope +1 or —1, use the 
inverse of the p-isometry of Example 2. (b) If three points are on a line L of slope 
+1 or —1, construct line L’ through p, perpendicular to L. g = LN L’. Half-pivot 
about q, bringing p, onto L and changing its e-distances to at least one of p,, Do, 
and p;. In what follows, L, and L’ are lines of slope +1 containing p, > L; 1 L' 
for i = 1,2,3,4. (c) Two points p, and p, are on a line L of slope +1 or —1. 


(i) The other two points are to one side of L. 

If there is a line L’ | L so that three of the points lie to one side of L’ 
and L' L = 4q is Strictly between p, and p,, then half-pivot about gq, 
bringing one of p, or p, onto L’ and leaving the other on L, changing 
e-distance. 
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If there is no such line as L' above, see Fic. 8, p, lies on or above 
L,,L,1L,p,=L5,OL and p, lies below L!, Li LL, p, =L'i nL. L, 
and L, are parallel to L, with p, € L, and p, € L,. One of those lines lies 
between L and the other; let it be L;. g = Li, L, has three points of F in 
one quadrant and the fourth in the opposite. A half-pivot about g puts p, 
and p, on the same line L’, changing e-distance. For other configurations, 
one can always choose one of L'!, 1 L3,Li0L,,L5,9L, or LZ V1 L, asa 
half-pivot point. 


P2 P3 


P4 


Fic. 8. Case c(i) of Proposition 5. 


If both pairs p,, p, and p3;, p, are on parallel lines, of slope +1 or —1, 
and if the projection of the segment from p, to p, onto L, would be 
properly contained in the segment from p, to p,, then use p, and p, as the 
beginning pair and there will be a line L’ as above. 

(ii) Two points p, and p, are on a line L of slope +1 or —1; p; is on one side 
of L, p, is on the other. L = L, = L,. 

If there is a line L’ 1 L so that p, is on one side of L’ and p, is on the 
other and if g = L’ \ L is strictly between p, and p,, there are two points 
of EF in one quadrant of q and two in the opposite. Haif-pivot about q, 
bringing p, onto L’. 

If L’, and L', lie outside and both above or both below the segment from 
p, to p, then construct lines L, and L, through p, and p, parallel to L. 
Suppose L’, lies between L’, and L’. gq = LO L',. One closed quadrant of 
q contains three points of EF, and p, is in the opposite closed quadrant. 
Half-pivot about qg and bring p, onto L. If L,=L),,L,=L, or Ly = 
Li, = L4, let gq =L OL’, also. 


(d) Only one point of E is on any one line of slope +1 or —1. Construct the 
separating set of /; there are four cases where a half-pivot point can be chosen 
easily, as in Fic. 9. Neither these points nor those of cases (b) and (c) are unique. 

Consider now a four-point set FE, not of types already studied, where there is 
only one point of F on each line of the separating set S$, and none at any vertex. 
See Fic. 10. Without loss of generality, the center of S, is at (0,0). The four lines 
bounding S, are y=x +b and y= —x +a, taking a > b > 0. The coordinates 
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Fic. 9. Left/right half-pivot points are circled. Other possible configurations are obvious variations of 
these. 


“ (0, 0) 


re ee one On 


Fic. 10. Gi) a > b > O. Gi) a = b. 


of the vertices of S') are 


a+b a-—b —(a+b) -(a-b) 
ne (SS) a(S] 
a-—b a+b —(a—b) —-(a+b) 
=| 2’ 2 n= (So 


{z,}/_, are positive numbers; then the set E must have the configuration of the 
points {p,}_, or {q)}¢_, with e(p,,v,) = e(q,, u,;) = V2z,, i = 1,2,3,4 where the 
coordinates are given as in Fic. 10. 


a+b a—b a+b a—b 
p= ( , tay S421] a-{ 5 +, 5 a4 
a—b a+b a—b a+b 
p= ( 5 -2n +2] a= | 7 +n = +2] 
(6) 
—(a +b) —(a —b) —(a+b) —(a—b) 
ps= (SSS ss] as 7 — £35 7 ss] 
—(a—b) —(a+t+b) —(a-—b) —(a+b) 
r= [Stee Sa) ge (SS a SS a 


Keep in mind, for example, that Q,(p,) and Q,(p,) are both contained in 
QO,(p;); then p(p,, p;) and p(p,, p;) are the differences in the x-coordinates of the 
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points, and the x-coordinates of p, and p, are both larger than that of ps3. 
p(p;, pj) = pq; 4;) for i, j = 1,2,3,4. Namely, 


P(P\,P3) =at+b+2z, +2; P(P1,P4) =44+2, +2, 
P( P,P) =at+b+2z,+2, P( P25 P3) =A4+2Z,7 23 (7) 
P( Py, Po) =b+2,+2, P( P35 D4) =b+234+ 24. 


Therefore, f(p,;) = q,;, i = 1,2,3,4 is a p-isometry on E. f is not Euclidean, for 
e(p,, D3) # e(q,, gz) unless a = b. e(p,, p,) # e(q,, q>) unless z, = z,. This can 
be checked by the ordinary distance formula in (R’,e). One can also see this 
geometrically by considering the legs of the two right triangles for which each of 
those e-lengths is the hypotenuse. In general, f will preserve e-distances iff the 
separating set is a square and z, =z, =z, = 2Z,. This is another characterization 
of a skew y-star. If E has the configuration {p,}/_,, apply p-isometry f; if E has 
the configuration {q,}*_,, apply p-isometry f-'. The case b > a > 0 is analogous. f 
is a variation of the p-isometry employed in Lemma 1 of [7]. 


Remark. The action of f could be thought of as simultaneous half-pivots about 
the points v,, V5, V3 and v,. For any point within the rectangle the p-distances to 
the points {p,}*_, are the same as those to {q,}/_,. In other words, the p-isometry 
on the set E can be extended to the rectangle S$, by defining f there as the 
identity map. f can also be extended to the rays of slope +1 from each vertex in 
which p, lies, i = 1,2,3,4. Each ray is mapped to the ray of opposite slope which 
contains v, and q,, 1 = 1,2,3, 4. 


7. Some further questions. The non-Euclidean p-isometries of Examples 1 and 
4 and the extended f of the above Remark have a maximal set beyond which they 
cannot be extended. Does each such p-isometry have a maximal set and can these 
maximal sets be characterized? Interesting combinatorial problems are encoun- 
tered in looking for minimal sets on which p-isometries must be Euclidean in 
(R”, p), n > 2, or for other Minkowski metrics where the unit “circle” is a polygon 
in R’. Lastly, it would be instructive to explore specific examples of set isometries 
for Minkowski metrics where the unit “circle” contains arcs of ordinary circles as 
well as straight line segments. 
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1. Introduction. In this paper we study the behavior of invariant subspaces of 
matrices, under small additive perturbations of these matrices. In particular, we 
will be interested in studying the rate of convergence (in terms of the norm of the 
perturbation) of the invariant subspace of the perturbed matrix. It turns out that 
this rate of convergence can be different from that of eigenvectors only. In some 
cases the rate of convergence of invariant subspaces is actually much better than 
that of eigenvectors if one has the choice of an invariant subspace for the 
perturbed matrix. Also, new phenomena appear; for example, the rate of conver- 
gence can be different if all nearby invariant subspaces of the perturbed matrix are 
considered. In this paper we develop the concepts of stability that reflect the rate 
of convergence of invariant subspaces. We point out some new phenomena that 
occur in this situation and prove partial results concerning characterization of 
invariant subspaces with a given rate of convergence. Many questions here are 
open and we indicate some of them. 

It is well known that the eigenvalues of an nm Xn (complex or real) matrix 
depend continuously on the matrix. The behavior of eigenvectors has been exten- 
sively studied from the point of view of applications in numerical analysis [W, GvL]. 
In fact, the behavior of eigenvectors can be basically described as follows: it falls in 
either one of the following three classes: 

1) hopeless: consider, for example, the eigenvector [2] of the matrix E ol. 


Perturbing the matrix to [° 4 with ¢ a small number we see that the eigenvector 
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of the perturbed matrix is far away from )3]. Of course, there is nothing special 


about | (|; any other eigenvector of | || is just as hopeless. 
2) continuous but bad: the standard example here is the n X n matrix 


0 1 0 en 0 
1=|: 0 
. 4 
bee wee lg 
Perturbing this matrix with an e > 0 in the left lower corner 
0 1 0 nO 
Ie) =]: he OY, 
0 od 
we have the following eigenvectors of J,(e): ye) = (6, e7,...,e7 'Y, j= 
1,..., where e, is given by e, = |e|'/"exp(27ij/n). Clearly, for the vectors y,; we 
have ||y, — e,|| = O(je|'“”) where e, = (1,0,...,0)” is the eigenvector of J,,. (Here 


and everywhere in the paper we use the Euclidean norm for the column vectors.) 
We call it bad because the rate of convergence of the eigenvectors of J,(e) to e,, is 
O(|e|'“”"), which is slow for large n. 

3) continuous and good: the case of an m Xn matrix with n different and 
well-separated eigenvalues. In this case, as we shall also see later, the perturbation 
of the eigenvectors is of the same order of magnitude as the perturbation of the 
matrix. 

It is natural to classify along these lines the behavior of general invariant 
subspaces. (Clearly, an eigenvector can be identified with the one-dimensional 
subspace it generates). Given an n X n complex matrix A, a subspace M c C” is 
called A-invariant if AM CM. Here Cc _ stands for (not necessarily proper) 
inclusion. Every invariant subspace is spanned by a set of eigenvectors and 
generalized eigenvectors of A (recall that y # 0 is a generalized eigenvector of A 
belonging to the eigenvalue of A if (A — AI)/y = 0 for some j > 1). 

In the next section we shall recall some of the results on stability of invariant 
subspaces, before going to the main topic of this paper, which is treated in sections 
three and four. 


2. Stable invariant subspaces. The stability properties of invariant subspaces 
under perturbations of the matrix have been studied extensively in the last decade 
[BGK, CD, KvdMR, GLR2, GR]. A source where all this material is gathered (and 
which includes much more) is [GLR2]. We start with the definition of stable 
invariant subspaces. We shall denote the set of invariant subspaces of a matrix A 
by Inv(A). 

Let A be an n Xn complex matrix and let M € Inv(A). M is called stable if 
for all « > 0 there is a 6 > 0 such that ||B — A|| <6 implies the existence of 
N € Inv(B) with ||P), — Py|| < «. Here P,, (resp., Py) is the orthogonal projection 
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on M (resp., N), and ||X|| denotes the operator norm of a matrix X, ie., 
|| X |] = max) )-il| Xx, with |||] being the usual Euclidean norm, |[(y,,..-, Xa MI 
= £"_,\x,|’. The main result on stability is the following theorem. 


THEOREM 2.1. [BGK, CD]. Let A be an n X n complex matrix and M &€ Inv(A). 
Then M is stable if and only if for each eigenvalue of A with dim Ker(A — AI) > 1 
either M \ Ker(A — AI)” is (O) or it is Ker(A — AI)”. For eigenvalues of A with 
dim Ker(A — AJ) = 1 the space MQ Ker(A — AI)” is an arbitrary A-invariant 
subspace contained in Ker(A — AI)”. 


We denote by Ker X the kernel of the matrix X. Observe that Ker(A — AJ)” is 
just the root subspace corresponding to A, i.e., the subspace spanned by all 
eigenvectors and generalized eigenvectors belonging to the eigenvalue A. Equiva- 
lently, Ker(A — AJ)” is the largest A-invariant subspace N with the property that 
the restriction A/N has A as its only eigenvalue. 

This stability result just tells us when M is continuously dependent on A. In 
case M is stable both the “continuous and good” and the “continuous but bad” 
cases mentioned in the introduction can occur. To distinguish between the two we 
need another definition. An invariant subspace M € Inv(A) is called Lipschitz 
stable if there exist positive numbers K and 6 such that ||B — A|| < 6 implies the 
existence of N € Inv(B) with ||P,, — Py|| < K-: ||B — Al]. Obviously, this defines 
precisely what we mean by “continuous and good.” The following theorem 
describes such subspaces. 


THEOREM 2.2 [KvdMR]. Let A be an n Xn complex matrix, let M © Inv(A). 
Then M is Lipschitz stable if and only if for each eigenvalue of A either MO 
Ker(Al — A)” = (0).or M > Ker(Al — A)”. 


As the proof is illustrative for our approach to another problem (in the next 
section) we shall give an outline of the proof here. In addition to [GLR2], [GK] is 
an excellent source for the background theory used. 


Proof. First suppose that M satisfies the condition of the theorem. Then M is a 
spectral subspace; i.e., there is a contour I in C such that A — AJ is invertible for 
A &T and 


M=!I : AI —A)'da 


where we denote by Im X the range (i.e., the column space) of the matrix X. The 
matrix (1/27i){-(AI — A)~' dA is a (not necessarily orthogonal) projection. Now 
take 6 small enough such that AJ — B is invertible on I for ||B — A|| < 6. Then 
put 


1 ~] 
N= Im| = [Ar = B) aa. 


The matrix (1/27ri){-(AI — B)~' dA is again a projection. At this point we need 
the fact (the proof of which can be found, e.g., in [GLR]) that given subspaces 
M,N < C” with orthogonal projection P,, and P,, on M and N, respectively, we 
have 


Pu — Pall < Qa — Qnill, (2.1) 
where Q,, and Q,, are arbitrary (not necessarily orthogonal) projections on M and 
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N. Using this fact we obtain 


1 
P., — <||—— [ ((aI-— B)”' — (AI — A) ')da 
Pu ~ Pull <|5— f (CAT B)' = (AT A)™) | 
1 —1 —] 
— say f (AT - B) (B—A)(AI—A) da 
2714 
1 
< =—(length of I’) max ||(AZ — B)~'||max||(al — A)~'|| (|B — All 
277 Aer Woe 
So 


|Pu — Pyll < K||B — Al 
for all B sufficiently close to A where 
1 
K = —(length of P’){ max||(ar— 4)~'[[} 
7 AEP 


So M is Lipschitz stable. 

Conversely, suppose M is Lipschitz stable, but the condition in the theorem 
does not hold. It takes a little argument, which is left out here, to see that in that 
case also M 1 Ker(A — AJ)” is Lipschitz stable for each eigenvalue A. Hence, we 
can restrict our attention to the case A = J, is the n X n Jordan block with zero 
eigenvalue. Consider again the perturbation J,(<) of J, given by 


0 1 0 en | 

J(é) = 0 
Put y, = (1,¢,,...,€" ')’, where «, is an nth root of «(j = 1,...,n). For some 
k <n we have M = span{e,,...,e,} where e, is the jth unit vector. Using the 


easily proved fact that M is a Lipschitz stable A-invariant subspace if and only if 
M~ is a Lipschitz stable A*-invariant subspace, we can assume without loss of 
generality that kK < n/2. Take any k-dimensional invariant subspace N of J,(e). It 
is spanned by k of the vectors y,, and by renumbering the ¢,;’s we may assume 
N = spanty,,..., y,}. The subspace N is given by 


Y, 
N = Im} _- me = Im _" 
Yn mY Y,-« 
where 
Y;, 
Y_, = Ly--- Vel 


is the n X k matrix with the vector y, in its jth column. Thus Y, is the k xk 
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Vandermonde matrix with (¢,,..., €,) as its second row: 
1 1 1 
E, &5 _— E, 
Y, 7 e e * 

kl gk-1 4, gk l 


At this point we need to compute the last column of Y, '. It is easily done by using 
the formulas Y,' = (det Y,)~ (adj Y,) and det Y, = I], ~ (e; — ¢;). Further, com- 


puting the last column of Y,_,Y, ! one obtains that it equals (p,(é,,.. +5 Eye 
Py —1hE1>+++> €,))" where p(xis ey Xe) = UVP IVS2 +++ XR, where the sum is taken 
over all ordered k-tuples of non-negative integers a@,,...,a@,, such that their sum 


is j. Put u= (0,...,0,1, Pile... 8p)yee+s Dy _pl€ pees 8) Then u EN. 
Therefore, 
I? 


| Pu — wll? > |( Pu — Py )u ull? - 5||U — e, 
lull? ~ |ju 


> |p(e,,- - ,€,)|° 2 


j= 


| Re 


n—-k 
; D Iplep-- eI? (2.2) 
j=l 


Ta 


for ¢ small enough. Suppose p,(¢,,...,€,) = 0 for j=1,...,.2-—k. Ask <n/2 
we can form Y, the matrix given by the first k rows of a ,. Then, YY;! has a 


zero last column, so det YY, ! = 0. But det Y = e;...e, * det Y, . SO det YY,! # 0). 
Hence, at least one of Dileq -- +5 Egy ees ne Leng e,) is non-zero. As 
|pley,...5&)| = cle|//” in case pjé,,...,&,) # 0 for some constant c > 0 inde- 


pendent of ¢, this shows, in view of (2.2), that ||P,, — Py||? > C,le|“/”, where 
C, > 0 is independent of «. It follows that M cannot be Lipschitz stable. 

There is a remarkable property shared by stable and Lipschitz stable subspaces. 
Let A be an m Xn complex matrix and M <€ InwA). Let A,,...,A, be the 
different eigenvalues of A, and for each of these let I; be a small contour around 
A; with A; outside IT; for 1 #j. Denote by P; the spectral projection of A 
corresponding to I, Le., 

1 
-= ——|[(AI-A)"' da. 
‘Qi ¢ ) 
The subspace M is called strongly stable if for each sequence A, — A and any 


M,, € Inv(A,,) with dim P.M = dim P,,M,,, (where P,,, is the spectral projection of 
A,, corresponding to I’;) we have 


Pu — Py || 279, n> . (2.3) 
Replacing (2.3) by 
Pu — Pu, || < KA — All 


for some K > 0 independent of n, we obtain the definition of strong Lipschitz 
stability. Roughly speaking, strong stability means that any sequence of subspaces, 
which has a chance of converging to M (in the sense of dim P.M = dim P,,M,,), 
really does converge to M. 

As an n X n matrix has an invariant subspace of any dimension between 0 and 
n, Clearly every strongly stable (or strongly Lipschitz stable) subspace is stable (or 
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strongly stable). Remarkably, the converse is also true. Inspection of the proof of 
Theorem 2.1 given in [BGK, GLR2] and of the proof of Theorem 2.2 yields 


THEOREM 2.3. (a) M € Inv(A) is stable if and only if it is strongly stable. 
(b) M € Inv(A) is Lipschitz stable if and only if it is strongly Lipschitz stable. 


3. a-Stability. In the previous section it became clear that Lipschitz stability is 
the key notion for understanding “continuous and good”-behaved stable invariant 
subspaces. However, there remains a gap between stable subspaces and Lipschitz 
stable subspaces. To better understand the behavior of stable invariant subspaces 
we need a more refined classification of stable invariant subspaces. Given a > 0 
and M «€ Inv(A) a stable invariant subspace, M is called a-stable if there exist 
constants K > 0,6 > O such that ||B —_Al| < 6 implies the existence of N € Inv(B) 
with ||P,, — Pyll < K||A — B\l'”%. (Note that since ||P,, — Pyll| < 2 for any pair of 
subspaces M, N < C” we can actually drop the requirement on B and A being 
sufficiently close, with the expense of replacing K by a bigger constant.) Clearly, if 
M is a-stable it is also B-stable with B > a. 

As in the cases discussed before we also introduce strong a-stability. Again, let 
A,,...,Ap be the different eigenvalues of A, I; contours around A,, with A; 
outside I'(i # j). Then a stable M € Inv(A) is called strongly a-stable if for any 
sequence A, — A and any M, € Inv(4,,) with dim P.M = dim P,,M,, we have 


|Pu — Py || < K||A — A,|\'7*. (3.1) 


Roughly speaking, this means that any sequence M, having a chance to converge 
to M does so and with rate specified by (3.1). Clearly, strong a-stability implies 
a-stability. Strong a-stability was studied in [RR] where the following result was 
proved: 


THEOREM 3.1 [RR]. Let A be an n Xn matrix. Then M & Inv(A) is strongly 
a-stable if and only if the following holds. The subspace M \ Ker(A — AI)" is either 
(0) or Ker(A — AI)” for each eigenvalue X of A satisfying one of the following two 
conditions: 

(a) dim Ker(A — AJ) > 1 

(b) dim Ker(A — AJ) = 1 and a < dim Ker(A — AJ)", 
and for any eigenvalue X not satisfying (a) or (b) MO Ker(A — AI)” is an arbitrary 
A-invariant subspace of Ker(A — AI)". 


Several questions arise naturally in connection with the notions of a-stability 
and strong a-stability. First, are they equivalent? One might expect (in view of 
Theorem 2.3) that the answer is affirmative. However, we will produce an example 
showing that, in general, these notions are not equivalent. The next question then 
is: when does a-stability imply strong a-stability? We do not know the complete 
answer, but some observations concerning this question will be made later. 
Another problem is to find a description of all a-stable subspaces. 

In case of eigenvectors (i.e., one-dimensional invariant subspaces) a-stability 
and strong a-stability are the same. 


PROPOSITION 3.2. Let A be ann Xn matrix and let M © Inv(A) be stable, and 
dim M = 1. Then M is strongly a-stable if and only if M is a-stable. 
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Proof. Let M be spanned by an eigenvector y corresponding to an eigenvalue A 
of A. If dim Ker(AI — A) > 1 M cannot be stable, so necessarily Ker(AJ — A) = 
M. By a standard reduction we may assume A =0, and A=J, is the n Xn 
Jordan block with zero eigenvalue. Take A(e) the matrix obtained from A by 
replacing zero in the left lower entry by « > 0. 

Suppose M is a-stable. Then, in particular, there is an eigenvector y(e) of A(e) 
such that with M(e) = span{y(e)} we have ||Py — Pryeyl| < K\| A — ACe)||'7% = 
K : «'/*, Now 
1 1/2 
Par — Paresll > ‘WPuy(e) ~ vel > 5 > ye" 


n—-1 
> g2i/n 
j=1 


a 
llyCe)I| 


So it follows that a > n. From Theorem 3.1 one now sees that M in fact is strongly 
stable. 

The rest of this section is devoted to producing an example of an a-stable 
subspace which is not strongly a-stable. Consider 


1 0 0 


A= M = span{e,,e,}. 


oo Oo © 


0 1 O 
0 0 1P 
0 0 O 


By Theorem 3.1, M is not strongly a-stable for any a < 4. We shall show that it is 
a-stable for some a@ < 4. 

Consider a general perturbation A’ of A. By applying a similarity transforma- 
tion close to the identity we can always reduce to the case 


a 1 0 0 
,_ 1b 0 1 O 
A c 0 0 1] 
d 0 0 O 
Actually, one can achieve this reduction by means of a similarity that depends 
analytically on the entries of A’ (see [A]). The eigenvalues of A’ are the solutions 
of the equation 


Mad’ — bX -— cA -—d=0. (3.2) 


In the sequel we shall use the fact that one can solve (3.2) explicitly. In fact, using 
Cardano’s method we find two roots A, and A, of (3.2) such that A, + A, = (a@/2) 
— a where a* = 2y + (a*/4) + b and y is any solution to the cubic equation. 


2 


a 
2y+—+b 


2 
—_— = 4 
(ay —c) 4 


(y? +d) (3.3) 


(see, e.g., [M, pp. 671-676] or [BS, pp. 160-164]). Equation (3.3) can be rewritten 
in the form 


ac + 4d 


3 1 2 
+ by? + 
y> + xby ri 


yt 3((a? + 4b)d — c*) = 0. 
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Substituting w = y + (1/6)b leads to the equation w° + pw = q = 0 in w, where 


ac + 4d I GB 4) 
pe" 12 
+b? b(ac+ 4d), 
— —_ — —______ 4. 1((q? + 4b)d — c”). 3.5 
q 108 AA + *((a ) Cc ) ( ) 


Our first aim is to show that a, and hence also A, + Aj, is O(||A — A’||'7°~*) for 
arbitrarily small « > 0. For this we need the following lemma. 


LEMMA 3.3. Let p and q be complex numbers depending polynomially on complex 
parameters z = (z,,...,Z,,) such that | p(z)| < C,||z\|, |¢(z)| < Cy|lz||* for \\z|| small 
enough, where the positive constants C, and C, are independent of z. Given any 
5 > 0 all solutions w of the cubic equation w’ + pw +q=0 will satisfy |w| < 
C;||z||°/°~°, for ||z|| sufficiently small (and C3 > 0 independent of z). 


Proof. Recall that w is found as follows. Put 


Se Gee 


Then w = u + v, where the cubic roots in u and vu are chosen such that uu = —p/3 
(see e.g., [M] pp. 671-676 or [BS] pp. 160-164). We now consider two cases 
separately. 

(1) |p(z)| > C,||z||4/°~? for some 5 > 0. 

(2) |p(z)| < Cy ||z||*”°~® for some 6 > 0. 

(We denote by C; positive constants independent of z.) 

In case (1), we write 


Then 
q 1@ 5 @ 
B 9B 162 B 


= 
I 


1 1 
3 


V-B + VB-a=V-B + VB 


13, q 
3185 + a 


Clearly this series converges, as |q| < |8| for sufficiently small ||z/||. Indeed, 
Boi 1 1 p° 
—_ s_— ——_—— — + ——_ , 
q 2 4 27 q 


4-38-4 _ —36 
> Cs ° [IZ = C6 °|[z||™ > © as |[z|| > 0. 


A/S 


and 


g 


Thus, B/q — as ||z|| — 0, and consequently, |q| < || for ||z|| small enough. Now 
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for sufficiently small ||z|| we have 


|q| 


[w| < 7 Ep? , (3.6) 


On the other hand, 


SVG) 


So for ||z|| sufficiently small 


3 
p=o(|" 
q 


4-35-2 _ 2-35 
> CgllzZ|| = C,[|zZI|"~°. 


So 
2/3 4/3-—26 
|B°7°| > Cyllzi"7°-*, 
and thus (using (3.6)), |w| < Cy, ° ||z||?/°**°. This finishes case (1). 
In case (2) we estimate 


3 
2-3/28 2/3-1/28 
<Cpp|[ZI°°7? 80 |u| < Cyslfz|[°7° 17°. 


Likewise, also 


2/3-1/28 
JV] < Cyq [ZIP 17°. 


Thus, 
Iw] < Cysllz 77°77”. 
This proves case (2). 

Returning to our original problem, we see that p and q given by (3.4) and (3.5) 
indeed satisfy the hypothesis of the Lemma 3.3. Thus, |w| < C||A — A’||°/°~° for 
|| A — A’|| sufficiently small. Hence, the same holds for y, and thus, with a* = 2y 
+ (a?/4) + b, we have |a| < C,||A —A’||'/°-* for some arbitrarily small « > 0 
provided || A — A’|| is sufficiently small. Now, pick A, and A, two eigenvalues of A’ 
such that 


a 
A, + Aa|=|5 — a] < CIA ~ att 


Let x = (1, x1, ¥2, V3)’ and y = (1, y,, y>, y3)” be the corresponding eigenvectors; 
then 


1 , 0 0 0 
Xr) ty 1 5 0 10 
X2| {Aj Oy Ay 1 “Vo} 
X3 AS vN dy 1 


and a similar formula holds for (1, y,,y¥>,y3)’ replacing A, by A,. Assuming 
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A, # A,, consider (x — y)/(A, — A,) = u. We have: 
0 0 


0 

x—y 1 0 0 
os A, +A; aly — 414 
Mo+A,A, + 2% A, +A, 1 


So, if we put M’ = span{x, y} = span{x, u} we have M’ = Im| 4, where 
MA — ad, —b-A (A, + Az) +A,4 A, +A, -a 
Ay — aay — bAL — 6 — AG(AE + AYA, +A4-— GA, — GA, —b) AE + AAQ + AZ -a(A, +A) - 5 


o-[4 Sh 2-[6 Sh 


Then Q” = Q' is a projection onto M’, while Q is a projection onto M. Clearly for 
|| A —A’|| sufficiently small ||Q’ — Q|| = ||R|| < C||A — A'|'73~*, as the dominant 
term in R is in its upper right corner. Then (using (2.1)) we obtain 


' nl /3—e 
Pu — Pull < 12 — Q'I < CIA — Air’. 
We have proved that for any positive « there exists a constant C > 0 such that 


every 4 X 4 matrix A’ with distinct eigenvalues has an invariant subspace M’ such 
that 


Now let 


Pu — Prrll < C\|A — Al'?e-*. 


This does not mean yet that M is 3(1 — 3¢)~'-stable, because we have to account 
for matrices A’ with multiple eigenvalues. This can be done as follows: 

Since the set of matrices with distinct eigenvalues is dense, given any 4 X 4 
matrix A’ there is a sequence of 4 X 4 matrices {A‘,}" _, with distinct eigenvalues 
such that lim,, _,,.4’, =A’. By the already proved statement for every m there 
exists an A’ -invariant subspace M/ such that 


Pu — Pr, Il < Cl-A — Aj, -*. (3.7) 


Now we make use of the fact that the set of all subspaces in C” is a complete 
compact metric space in the metric given by ||P,, — P,||, where M,N are sub- 
spaces in C”. Moreover, a subspace N which is a limit (in this metric) of a 
sequence of subspaces {N_}” _, can be described as 


CO 
m=l1 


N= { yv € C| there is a sequence of vectors { y,,,} 


such that y,, © N,, forall mand lim y,, = x}. 
wn 00 


A proof of this fact can be found, for example, in [GLR1] or [GLR2]. Returning to 
the sequence of subspaces {M/}" _, introduced above, choose a converging subse- 
quence {M,, J,_, with a limit M’. Using the description of a limit given by (3.8) we 
easily verify that M’ is A’-invariant. Passing to the limit in (3.7) (with m replaced 
by m,) we obtain the desired inequality 


Pu — Pll < C\|A — Aer’. 
We have proved that M is 3(1 — 3<)~'-stable for every ¢ > 0. 


F 
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Clearly, for « small enough 3(1 — 3<)~' < 4. This shows that although M is not 
strongly a-stable for any a < 4, it is a-stable for every a with 3 <a < 4. 


4. The stability index. It follows from Theorem 3.1 that every stable A- 
invariant subspace is strongly a-stable (and hence also a-stable) with a@ equal to 
the height of A. The height of A is defined as the size of the largest Jordan block 
in the Jordan canonical form corresponding to eigenvalues A of A with dim Ker( A 
— AI) = 1. (if for all eigenvalues A of A one has dim Ker(A — AJ) > 1 we put 
the height of A equal to one.) 

We next define the stability index of a stable A-invariant subspace M to be the 
infimum of all @ > 0 such that M is a-stable, and this number we denote by 
s(M, A). To avoid trivialities we will apply this notion only to the non-trivial (Le., 
different from (0) and the whole space) stable invariant subspaces. 

In this section we study the stability index of a stable invariant subspace. Here 
too, many questions remain unsolved. We do give some bounds and estimates, 
which allow us to compute the stability index in special cases. 

A standard reduction (see e.g., Section 15.3 in [GLR] or Section 8.1 in [BGK]) 
establishes the following fact. 


PROPOSITION 4.1. Let A,,...,A, be the distinct eigenvalues of the n X n matrix A 
such that dim Ker(A — A,J) = 1. Put R; = Ker(A,;I — A)”. Then for any stable 
M € Inv(A) we have s(M, A) = max, .;-,5(M 1 R;, A/R;), where A/R, stands 
for the restriction of A to its invariant subspace R,,. 


The next proposition gives upper and lower bounds for s(M, A). 


PROPOSITION 4.2. s(M, A) is less than or equal to the height of A, and s(M, A) > 
1, provided (0) # M # C”. 


Proof. The fact that s(M, A) does not exceed the height of A is a consequence 
of the previous proposition and the remarks in the first paragraph of this section. 

To show s(M, A) > 1, assume the contrary. By Proposition 4.1 we can assume 
without loss of generality that A =J, is the nm Xn Jordan block with zero 
eigenvalue and M = spanf{e,,...,e,}, k <n. Let J,(e) be the matrix obtained 
from J, by putting « > 0 in the left lower corner. By assumption, there is a 
J,(é)-invariant subspace M(e) such that || Py — Pry.)|| < C||A — A(e)||'”% = Ce'/* 
for « small enough. Let y(e) = (,¢,,...,¢7 ')’ © M(e) be an eigenvector of 
J.(e) corresponding to its eigenvalue «, which is an nth root of «. Then, since 
y(e) € M(e), we have 


(Par — Puce) XC) |] = Pux(e) — x(e)|| < Ce'/4\Lx(e) I. 


Thus, in particular considering the (k + 1)st coordinate of P,,y(e) — y(e) = e* 
we have 


S 


jJ=0 


1/2 
jelk/" = Jet < Celv* a < Cel/e 


for e« small enough, where C’ > 0 is independent of «. Clearly this is impossible for 
a <1. 
Next we state another useful fact, which is easily verified. 
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Proposition 4.3. If M € Inv(A) is stable then M~ e& Inv(A*) is stable and 
s(M, A) = s(M~ , A*). 


By Proposition 4.1 the study of the stability index is reduced to the case where 
A isan n X n Jordan block with a single eigenvalue. Without loss of generality this 
single eigenvalue can be taken. to be zero, as one sees by considering A — Al 
instead of A. So we can assume A = J,. For k = 1,...,n — 1 let s(k,n) be the 
stability index of the unique k-dimensional invariant subspace of J,. (We ignore 
the trivial cases k = 0 and k =n.) From Proposition 4.3 it follows that s(k,n) = 
s(n — k,n), and hence we can assume k < (n/2) without loss of generality. In 
general, we are unable to find the exact value of s(k, n). Inspection of the proof of 
Theorem 2.2 shows that s(k, n) > n/k, and that actually one can say s(k, n) > n/jp 
where j, = min{j|p(¢,...,¢,) #0 for all choices of different ¢,,...,¢, out of 
E1,-.+,€,}. However, these bounds are not very satisfying. 

The next theorem is of more interest and is our main result. To state it we need 
the following definition. Given two positive integers k,n with k <n, we say that 
the pair (k,n) is admissible if no sum of k distinct nth roots of unity is equal to 
zero. In particular (1, n) is admissible for all n > 2. 


PRoposiTIONn 4.4. If n is prime, then (k,n) is admissible for every 1 < k <n. 


Proof. Use the fact that the cyclotomic polynomial y”~' + y”~* + +:: +1 is 
irreducible over the field of rational numbers (see, e.g., [McD]). Assume now that 
the primitive root £ = cos(27/n) + isin(27/n) satisfies an equation of type 
C™ + 6 tees +O™ =) (n-1Lem,>m,> -::: >m,>O0 are integers). 
Then the cyclotomic polynomial must be a divisor of vy”! + y"2 + ++: +y’*« (see 
e.g., the proof of Theorem 38.1 in [McD]), which is possible only if k =n. This 
proves the proposition. 


THEOREM 4.5. Let k <n/2. Assume there is a smallest integer p,0 <p <k — 1, 
such that (q,n — p) is admissible for all integers q with k —-p<q«<k. Then 
s(k,n) >n — p. 


Proof. For ¢ small and positive, let A(e) be the matrix obtained from J, by 
putting « in the (n — p,1)-entry in place of zero. 

The eigenvalues of A(e) are zero and the (n — p)th roots of «, call them 
Niy+++5Ny—-p The eigenvectors corresponding to 7,,...,1,—» are 


_p-1 T _ _ pe T 
y= (1m... 07 7 1,0,...,0) 5.66, Mp = (1 tn pr 02 1,0,...,0) 
and there is a Jordan chain corresponding to the eigenvalue 0: 

C1 — €€n 41> 2 — El€n p42. ++ +9 ly — Els 


where by e, we denote the unit coordinate vector with 1 in the qth position and 
zeros elsewhere. Consequently, every k-dimensional A(e)-invariant subspace has 
the form 


N= span{y;,.--; Vir 1 — €€n—pa isn — E€n—pyrs+ +9 Ck—g — E€n —pitk—gs 


for some g(k —p <q <n-—p;q <k) and some choice of the vectors Vier Vi, 
among Y,,...,Y,—p- Introduce also the subspace M = spanty,,..., Yj.» 
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€1,€n,-++, ex a}. Clearly, 

Py — Pull < Kye (4.1) 
for some constant K, independent of «. After some elementary transformations of 
the set of spanning vectors for M, we obtain that M is spanned by e,, €9,...,€,_, 
and by the vectors 

h, = Ck-—q+1 +1; eK—g+2 t ; oem ae 'Cn—p> t= 1,...,4 


where s =n — p — (k — q). (It follows from our assumptions on & and p that s is 
positive.) Denoting by M, the k-dimensional J,-invariant subspace, it is easily seen 
now that 


Pu, 7 Pull — l|_Pepant f, pees fas 7 Pepantey—g +1 yess ell 


Consider the n X q matrix W with columns f,,...,f,: W=[f,, fa,...,f,], and 
observe that the rows k — q + 1 through & in W form the Vandermonde matrix 


1 1 1 
N, Ni, CO Ni, 
Wo= |: : Df, (4.2) 
nie 1 ni 1 wae ni l 
iy in i 
which is invertible. A computation (see e.g., [GLR, Section 16.5]) shows that the 
matrix WW," has the (k + 1, q)-entry equal to ,, +7; 
Now use the assumption that (q,n — p) is an i nissible pair ‘since k-p<q 
< k). It follows that there is a constant K > 0 such that |A, + +A,| > K for 
any choice of q (n — p)th roots of unity A,,...,A,. Then also 
Ini, too +0;,) > K- el (4.3) 
for any choice of the gq numbers 7,,... ,;,. Now consider the last column in 
WW, '. Call this vector y. Clearly y € span{ fp... fj} and y is of the form 
(0,...,0,1,;, + ++ +7;,,...)" where the first k — 1 entries are zero. Then (4.2) 
gives 
1 
Mo M|| 2 yl] Pypantf;, pees fyX — Pepantey_ —gtlerres e,jX 


1 
= —|ly —e,|ll> —-|n, + °°: +7; 1. 
bal em Wy q 


For ¢ small enough we obtain by using (4.3) that 


Pu, — Pull = Ky|e|'/"P? 


for some constant K, > 0 independent of ¢. Taking into account (4.1), we get 
Pn — Pull = 2k 3)e|1/" P (4.4) 


for any k-dimensional N € Inv( A(<)), where K, > 0 is independent of both ¢ and 
N. Inequality (4.4) shows that M, € Inv(J,,) cannot be a-stable for a <n — p, 
which proves s(n, k) > n — p, and our theorem is proved. 
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As a corollary to Theorem 4.5 and Proposition 4.4 we have 


COROLLARY 4.6. Let r be the smallest number such that n —r is prime. Then 
s(k,n) >n-—rforall kA <k <n -— 1). In particular, s(k,n) =n for prime n. 


It follows, for example, that s(2,4) > 3, and combining with our results ob- 
tained in the example in the previous section we see that s(2, 4) = 3. 

In terms of the stability index we can say precisely when a-stability for a 
subspace M implies strong a-stability. To state the result we need the following 
notion. Given an n X n matrix A and M € Inv(A) we introduce the height of A 
with respect to M as follows. Let {A,,...,A,} be the set of all distinct eigenvalues of 
A with dim Ker(A ,J — A) = 1, and let o = {ijl <i <r,0 # MN Ker(A,I — A)" # 
Ker(A ,J — A)"}. Then the height of A with respect to M, denoted by h(M, A) is 
the height of A|,, where N is the sum of the subspaces Ker(A,J — A)", ica. 
Again, the height of A with respect to M is defined to be 1 if o is empty. With 
this definition we have the following result. 


THEOREM 4.7. The following two statements are equivalent for a stable A-invariant 
subspace M: 

(a) s(M, A) = h(M, A) 

(b) For any a: M is strongly a-stable if and only if M is a-stable. 


Proof. The case when o = ¢ is easy. In this case M is the sum of root subspaces 
and hence Lipschitz stable and strongly Lipschitz stable by Theorem 2.2, hence 
Theorem 4.7 follows from the definition of h(M, A). So we assume o # ¢ from 
now on. Clearly, both s(M, A) and h(M, A) as well as a-stability and strong 
a-stability for M@ depend only on MO N and on the restriction A|,,, where N is as 
above. So we can assume A=J,@::: @/J,, M=M,@®-:- @M., where M, 1s 
J-invariant with 0 # M, # Ker(A,;J — A)”. We also can arrange it so that n, > n, 
> ++: >n,, in which case h(M, A) = n,. 

Further, M is a-stable or strongly a-stable with respect to A if and only if each 
M, is a-stable or strongly a-stable with respect to J;. 

Now suppose (a) holds. Then if M is a-stable a must be larger than or equal to 
n,. Then certainly a >n, for 1 <i <r. By Theorem 3.1 it then follows that M is 
strongly a-stable. As strong a-stability implies a-stability always, this proves (b). 

Conversely, suppose (b) holds for any a. Then, if M is a-stable, it must be 
strongly a-stable and hence Theorem 3.1 implies that a =n, for each n,. Thus, 
a > h(M, A). This shows s(M, A) > h(M, A). But clearly M is h(M, A)-(strongly) 
stable. So s(M, A) = h(M, A). Note that we use here a sharpening of Proposition 
4.2. The theorem is proved. 

We have introduced the notion of stability index and have produced some 
bounds on the stability index. These bounds allowed us to find the precise value of 
the stability index in some cases. It is an open problem to find the stability index in 
the general case. 


5. Open problems. We summarize here some open problems (including those 
mentioned already in the text) that arise naturally in the context of a-stability. 


Problem 5. For a given a, describe (in terms of the intersections of M with the 
root subspaces Ker(A — AJ)”) all a-stable subspaces M of a matrix A. 
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Problem 5.2. Describe the relationship between a-stability and strong a-stabil- 
ity. In particular, a more concrete problem is: When does a-stability imply strong 
a-stability? Again, the answer should be given in terms of intersections of the 
invariant subspace in question with the root subspaces. 


Problem 5.3. Find the stability index of every stable A-invariant subspace M, in 
terms of the dimensions of Ker(A — AI)” and MN Ker(A — AI)”, for every 
eigenvalue A of A. In particular, is the stability index always a positive integer? 

The solution of Problem 5.3 hinges upon solution to the following: 


Problem 5.4. Find s(k,n) for all positive integers k andn,1<k<n-—1. 

Again, partial answers and bounds for s(k, n) are found in the text. It follows, in 
particular, that s(k,n) =n for all (k,n) with 1<k<n-—1;2<n<5; except 
that s(2,4) = 3. Also, s(k,6) = 6 if k= 1, 5 and 5 < s(k,6) < 6 for k = 2,3,4. 
Hence the first concrete open question is: 


Problem 5.5. Find the values of s(2, 6) = s(4,6) and s(3, 6). 

Acknowledgments. We are grateful to Prof. M. A. Kaashoek for his interest in the material of this 
paper and for his suggestion to write it up. 
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LETTERS TO THE EDITOR 


To the Editor: 


In a recent note [1], Friedberg gives counterexamples to the principal axis 
theorem (‘‘a real symmetric matrix is orthogonally equivalent to a diagonal matrix”) 
over some fields other than the reals, and he concludes by asking for which fields 
the theorem is true. The answer [2] is that it is true precisely when the field is real 
closed or an intersection of real closed fields. Furthermore, the principal axis 
theorem is actually true if we make only the seemingly weaker assumption that 
every symmetric matrix over the field is similar to a diagonal matrix. The fields in 
[2] are assumed to have characteristics different from 2, but it is easy to see that 
the theorem can never hold in characteristic 2: any 2 X 2 symmetric matrix there 
with all entries equal is nilpotent and thus cannot be diagonalized. 
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William C. Waterhouse 
Penn State 
University Park , PA 16802 


To the Editor: 


In his paper (this MONTHLY, vol. 95, pp. 950-952), L. J. Gerston introduced a 
new algorithm of computing the rank of a matrix. In fact, this method had been 
published in 1978, by Professor Xie Bangjie of Jilin University [1]. In the book he 
referred to it as depression of order. 
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Jiangsu , China 

To the Editor: 


It was good to see the note by T. W. Hungerford in the MontTHLy (January, 
1990). As he said, some presentations of Galois theory consider field extensions of 
the rationals given by radicals and claim that they will be Galois when appropriate 
roots of unity are adjoined. He gave an example to show that this does not have to 
be true with the “obvious” choices for appropriate roots of unity. I want to 
reinforce his warning by offering another example. Take Q(i) (with i* = —1), 
and let K = Q(i)\(u) with u’? = 2 + i. Clearly 2 +i is not a cube in Q(i), since 
(2 + i)(2 +i)=5 is not a cube in Q; thus K has degree 6 over Q. I claim that it 
will not become Galois no matter what roots of unity are adjoined. 

To see this, we first need to know a little about L, the smallest Galois extension 
of @ containing K. Clearly L must contain a cube root of 1, say ¢. The complex 
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conjugation on (7) extends to an automorphism of L, which will take u to a cube 
root v of 2 —1i. Hence L contains a cube root of 5, and the Galois extension 
M = Q(é,°V5) is inside L. It is easy to see that L is the composite K - M. A little 
more work shows further that L has degree 36 over Q, and hence KN M = Q. 
I omit the proof; elementary ones are possible, though it’s faster to use the fact 
that 2 + i and 2 — i are relatively prime elements in Z[i]. 


THEOREM. Let E be any extension of © with abelian Galois group. Then K- E 
does not contain L. 


Proof. If K- E contains L, then of course K: E = L : E; hence the field L- E 
is the composite of two Galois extensions and is Galois over ©. Let G be the 
Galois group Gal (L : E/Q). Let N, be the normal subgroup fixing E, and define 
N, and N,, similarly. Let H be the subgroup (not normal) that fixes K. Since 
KOM=Q, we have H-N,, = G. Hence H maps homomorphically onto the 
quotient G/N, = Gal(M/Q) = S,. However, since K:E =L- E, we also have 
HON, = {1}, and hence H must map one-to-one into the abelian group G/N, = 
Gal(E /Q). This contradiction proves the theorem. O 


The theorem yields what we want, since roots of unity generate abelian 
extensions of @. The example also illustrates exactly what is wrong in the 
erroneous attempted proofs: if we have an extension of © containing elements 
(like 2 +i and 2 —1) conjugate over Q, their roots may well give extensions 
differing by more than roots of unity. 


William C. Waterhouse 
Penn State 
University Park, PA 16802 


To the Editor: 


Dr. Larry Zalcman has pointed out some errors in my “Putnam Trivia” article 
(96 (1989), 711-713). The answer to question 11 should have been: Arthur Rubin 
and D. J. Newman. The answer to question 23 should have been: City College of 
New York, Stanford, and the University of Maryland. The answer to question 17 
should include a second place finish by Dartmouth in 1962, since at that time they 
were strictly a four year liberal arts college. I regret having made these errors. 


Joe Gallian 
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A New Derivation of Stirling’s Approximation to n! 


GEORGE MARSAGLIA 
Supercomputer Computations Research Institute and Department of Statistics, Florida State University 


Joun C. W. MARSAGLIA 
Department of Computer Science, Western Oregon State College, Monmouth, OR 


Stirling’s formula, n! ~ n"e~"y27n, perhaps the most widely used relation in 
asymptotic analysis, is usually mentioned in Calculus II courses. It is the first term 
in an asymptotic series whose derivation requires deeper methods than are 
ordinarily covered in Calculus II. It would be nice to have a development that only 
requires methods within the scope of the usual Calculus IJ course. That is the 
purpose of this note. 

The classical method for developing Stirling’s approximation uses Euler’s sum- 
mation formula: 

n m B, 

FL) + FQ) + HF0n) = PFO) + DU) - P10) +R, 

k=1° 


with the B’s Bernoulli numbers and f(x) = In(x). This leads to 


1 1 
aa 12n 36003 * 
The development is somewhat delicate. It requires establishing that the constant c 
is Inv27 (missing from Stirling’s original development). Then the asymptotic 
series for In(n!) with the Bernoulli numbers must be exponentiated to get the 
asymptotic series for n!. See, for example, Knopp [1, Chap. 14]. 

We will derive an asymptotic series for n! directly, without recourse to Bernoulli 
numbers and Euler’s summation formula for In(v!). No new series will appear, of 
course, since a function can have at most one asymptotic series, but the derivation 
appears more direct. It involves only elementary transformations of integrals and 
term-by-term integration of a series. 

Write 


In(n!) = In(1) + «++ +In(n) = In(n) -n+c+ 


oe) of Xx n 
ni= | x"e*dx =n"e" [etn] dx. 
I I 
Then make a change of variable, y = x/n, to give 
ni= nr*temn f (ye!-¥)" dy, 
0 
which may be written 
1 n 2 n 
nim ne" f (ue'~“) du +f (ve'~”) av}, (1) 
0 1 


The idea, after Watson [3] and Marsaglia [2], is to break the function ye!~” into 
two monotone parts, ue!~“, increasing for 0 <u < 1, and ve!~’, decreasing for 
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1 <v < ~, Consider the transformation defined by the relation 


This defines a one-to-one relation between v in [1,) and z in [0,). We cannot 
express v explicitly in terms of z, but we may assume a power series, 


v=1+a 24+ 4,27 +432° +ayz* t+, 


then find the coefficients by this device: Let E(z) represent e7 227 Differentiating 
ve'—” = E(z) with respect to z gets v’E(z)/v — v'E(z) = —zE(z) and hence 
u'(v — 1) = zv. Multiplying the resulting series and equating like powers of z 
establishes that aj = 1, a, = 1/3, a, = 1/36a, and, for n > 3, 


(n + l)aya, =a,_, — 2a,4,_, — 3a3a,_.— +++ — (n— )a,-1a). (3) 


There are two choices for a,: 1 or —1. Since v increases with z, a, = 1 is the 
right choice. This gives a, = 1/3, a, = 1/36. Then a,,a;,... are completely 


determined by the above recursion. The coefficients a,,a,,...,a@,; are 
1 1 -1 1 1 — 139 1 
' 3° 36 270° 4320’ 17010’ 5443200” 204120’ 
—571 — 281 163879 


2351462400’ 1515591000 ’ 2172751257600 


On the other hand, the choice a, = —1 leads to the same numerical values for 
subsequent a’s, except that it changes the sign at the odd indices. But this choice 
provides a series for the u defined by ue'~“ = e7 2° with u in [0, 1]. In summary: 
if a, = 1 and a,,a,,... are determined as above, then 


v=1lt+a,z+a,z* + 43,2°> + a,z4* + a52° + °°: 
. : — yp _ 1,2 . . . 
is the solution to ve!~* = e~ *, with v in [1, ©), while 

u=1—a,2 + ay2* — a32° +: A42z* — gz? + 


is the solution to ue'~“ =e~* with u in [0, 1]. 
We are now ready to make the transformation of integrals in (1). Let 


u=1-—a4,2 + 4,27 —a,z°>+°°° du/dz = —a, + 2a,z — 3a3z7 + °°, 
with boundary conditions 
=o when u=QO0; z=0 when u=1l. 


This transforms the left integral in (1), (after changing dz to —dz and switching 
limits), to 


1 0° 
J [ue!-“]° du = / e~"2"(a, — 2anz + 3a,z? — 4a,z3 +--+) dz. 
0 0 
To transform the second integral in (1), let 
v=l+a,zta,z?+a,z°+°°:, dv/dz =a, + 2a,z + 3a3z* + °°, 
with boundary conditions 


z=0 when v= 1; z=o when v=o, 
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This transforms the second integral to 
f [vel] dv = i e-P"(a, + 2a,z + 3a3,z* + 4a,z°> + °°: ) dz. 
1 0 


The odd powers of z cancel on adding these two integrals. Then term-by-term 
integration, using 


oo ) k+4 1 
2f e me z2k*az=1—| Tlk+— 
[oe we de — 
yields, from (1), 
2\2 1 2\? /3 2\2 (5 
ni=n"*te“")|—| T{ —] + 3a,{—] T] =|] + 5a, -—] Tl =] + °°: I. 
n 2 n 2 n 2 


Finally, factoring out the first term, with [(4) = yz, [(3) = 5TG), etc., gives the 
desired expression for n!: 


3a 3-5a 3-5°-7a 3-5-7: 9a 
ni=n'e "y2ian ° ° Py Dg i, 


1+—+ ee 
n n* n> n* 
with the a’s determined by a, = 1, a, = 1/3 and recursion (3). 
If we write this asymptotic expansion in the form 


_ Cy Co C3 C4 
ni=n"e "Pan {t+ + + + Se), 
n n 


nr ne 
then coefficients c,,C5,...,Cg are: 


1 1 —-139 —571 163879 5246819 


— 534703531 — 4483131259 432261921612371 
902961561600 ’ 86684309913600 ’ 514904800886784000 — 


The sign pattern is + +——. 

In the classroom, students would be reminded that the above is an asymptotic 
series. It does not converge, as do most of the series they have encountered, but it 
still provides valuable information about n!. Successive partial sums approach the 
true value, n!, then diverge. The approach is erratic. Improvement is not mono- 
tonic, nor do successive sums, or pairs of sums, bracket the target value. 

Multiple-precision computer routines are necessary for demonstrating the ap- 
proach to n! before divergence, however, unless n is small. For example, 69! is an 
integer of 99 digits, the limiting size for scientific notation in many hand calcula- 
tors. It begins 17112245... and ends ...4224000000000000000. Let Sp, S,,... be 
successive partial sums for Stirling’s series, so that S$, =n"e "V2mn, S, = 
n"e~"V/2an(1 + gn), etc. Then for n = 69, S, gives 0.1709159 x 10°, accurate to 
some 3 places, while S, provides about 6 places. Going much further, $,, provides 
48 places of accuracy, S;, provides 70, S,, provides 94 and Sj, provides 111 
places of accuracy: Si9, = 69!— 0.22 x 10~'°, while Sj); = 69!+ ¢ with 0 <«< 
107 7°, 

The erratic behavior, then divergence, of the series may be demonstrated for 
small values of m, which need not be an integer, of course, but only for integers do 
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we know n! exactly. Consider n = 4. Then 4!= 24, S) = 23.506... . Next, S$, = 
23.995 gives about 4 places of accuracy, and so on to the best sum, S$, = 
24.00000000002184.... After that, the sums slowly get less accurate, with 
S65, = 24.079, Sip = 28.026, S5, = —6.281, Sj, = 276.44, $5, = — 2098.1, and then 
chaos. 


Knopp [1] is a good source for properties and examples of asymptotic series, 
§65-66, pages 535-555. 
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Fibonacci Numbers and Aitken Sequences Revisited 


M. J. JAMIESON 
Department of Computing Science, University of Glasgow, Glasgow, Scotland 


Phillips [1] has shown that Aitken’s transformation applied to the sequence of 
ratios of consecutive Fibonacci numbers yields further ratios of Fibonacci numbers 
and that this remains true if the transformation is applied repeatedly to the new 
sequences so obtained. He has shown also that the secant and Newton Raphson 
methods for solving the quadratic equation 


x*-x-1=0 (1) 


satisfied by the sequence limit (the golden ratio 7), yield estimates which are ratios 
of consecutive Fibonacci numbers if the starting values (two for the secant method, 
one for Newton Raphson) are such ratios. The purpose of this brief note is to 
generalize these results to ratios of every pth Fibonacci number. 

The Fibonacci numbers are given by 


Uiy1 =U; + Uj_1; i> (2) 


with u, = 1 and u, = 1. (The initial subscripts are chosen here to make subse- 
quent notation simpler.) There are p different possible sequences of ratios of 
every pth Fibonacci number depending on the choice of denominator from 
U;,Uz,...,U, for the first one. These sequences, {w{}, are given by 


k __ . 
wi y= Un i+ty—k/Upi-k> i=l (3) 


where k in equation (3) and subsequently in this note lies in the range 0,1,..., 
(p — 1). In Phillips’s sequence p has the value unity. The limit of the sequence of 
ratios of consecutive Fibonacci numbers satisfies the quadratic equation (1). We 
wish now to find a quadratic equation to be satisfied by the limit of the sequence 
{w‘}. It is well known that 


u, = [ri -(-7) "| V5. (4) 
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From equation (2) we may prove by induction that 


Dp . 
Ui+p = Up); ~~ (- 1) Uj_p> l > D (5) 
where vu, depends only on p being the pth term of the series of Lucas numbers 
1,3,4,... which satisfy the same recurrence relation (2) as the Fibonacci numbers 


but have different starting values. Dividing equation (5) through by wu, and 
replacing i by ip — k we find 


ws) =v, — (-1)°/w™,, i > 1. (6) 


The limit of the sequence {w‘} as i tends to infinity therefore satisfies the 
quadratic equation 


x* —v,x + (-1)" =0. (7) 


From equations (3) and (4) we see that this limit is 7? which is independent of k. 
From the properties of the roots of a quadratic or from the properties of Lucas 
numbers we can determine uv, explicitly and replace equation (7) by 


x? —[7? + (-7) *]x+(-1)’ =0. (8) 


Let us consider the secant method approximation to a root of equation (8). We 
construct a new estimate to the root from two existing estimates. Let these be 
drawn from all the possible sequences {w{*?}; they could be, for example, w‘ and 
w‘ in which case the secant estimate is 


| wh —(- 1)”|/| ws + wh? — 7? — (-7) | = who ifk+I1<p 
=witl ifk+l>p (9) 


where / is in the range 0,1,...,(p — 1). Equation (9) is obtained from equations 
(3) and (4). Hence we always get another ratio of every pth Fibonacci number. We 
note that only with k = 1 =0 do we find that w{”, w{? and the secant estimate 
belong to the same sequence ({w{} in this case). 

We now consider the Newton Raphson approximation to a root of equation (8). 
We construct a new estimate to the root from w‘“’. The formula is the same as the 
left-hand side of equation (9) with w{” replaced by w{”, and again we obtain 
another ratio of every pth Fibonacci number; it belongs to the same sequence as 
w'*) only for k = 0. 

Finally we apply Aitken’s acceleration formula to w{?,, w and wi. From 
equation (9) we have 


wy) = (-1)" + | ws + wh) — 7P — (-r) *|we if 2k <p 
= (-1)? + [wh + w© — 7? — (7) P| WRK if 2k = p. (10) 
The Aitken transformation is 


k k k k k k k 
[ wit, — ww] /[w, — 20 + wi®,] 


itm "i-m 
= wZ) if2k <p 
=wFt> if2k>p (11) 


where equation (11) is obtained from equation (10) by first substituting i + m for i 
and i—m for j and then i for i and i for j. We obtain a ratio of every pth 
Fibonacci number which belongs to the same sequence as the estimates put into 
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Aitken’s transformation only for k = 0. With m set to unity we see that the 
transformation applied to members of the sequence {w} produces further mem- 
bers of that sequence, namely the subsequence {w$?}. By replacing i by 2, 4i, 8i,... 
and m by 2,4,8,... in equation (11) we see that subsequent applications of 
Aitken’s transformation to the new subsequences so obtained yields the subse- 
quences {w{}, (wY}, (w(},.... 

In conclusion we see that Phillips’s results can be generalized to ratios of every 
pth Fibonacci number. There are p different sequences of such ratios and the 
application of the secant, Newton Raphson and Aitken acceleration methods 
return results in the same sequence only for one of these, namely the sequence 
Uyp)/Uy,Uz,)/Uzp,--- - 

The conclusions are illustrated below for p = 2 in which case v, of equation (7) 
has the value 3. 


TABLE. The sequences {w!”} and {w?} for p = 2. 
l 1 2 3 4 5 6 7 8 9 10 
u 1 1 2 3 5 8 13 21 34 55 


w) 3/1 8/3 21/8 55/21... 
w) 2/1 5/2 13/5 34/13... — 


The secant method applied to w® and w® yields 21/8 or w, applied to w{” and 
ws) yields 13/5 or w$? and applied to w{? and w$? yields 8/3 or w% in 
agreement with equation (9). The Newton Raphson method applied to w{” yields 
8/3 or w) and applied to w{” yields 3/1 or w also as expected. Finally Aitken’s 
transformation applied to w$”, wS and w yields 55/21 or w and applied to 
w°?, wS? and w$ yields 21/8 or w in agreement with equation (11). 


I am pleased to thank the referee for a useful comment. 
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A Cayley Theorem for Boolean Algebras 


STEPHEN L. BLoom! 
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ZOLTAN Esik 
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Ernest G. MANES” 
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The well-known Cayley theorem for groups may be summarized as follows: any 
group is isomorphic to a group of transformations on some set. To make an 
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analogy below, we rephrase this theorem in two parts. Let X be a set and let 
P(X) be the collection of all permutations of X—i.e. each g in P(X) is a bijective 
function 


gix > X. 
We define two operations on PCX ): composition 


(gceh)(x) =g(A(x)); 


and inverse 


g '(x) =y@eg(y) =x. 


CAYLEY THEOREM, Part I. If P(X) is as above and if G is a subset of P(X) 
which contains the identity function of X and is closed under composition and inverse 
then G is a group under these operations. 


Such groups are called transformation groups on X. 


CAYLEY THEOREM, Part II. Jf G is any group, there is some set X such that G is 
isomorphic to a transformation group on X. 


Now let X be a set and let f be a binary function on X. We say that the 
function f is idempotent if 


f(x, x) =x 


for all x € X; f is diagonal (see [4)) if 


f( f(x,y), f(u,v)) = f(x, v) 
for all x, y,u, v € X. If g is a binary function on X, we say that f and g commute 
if 
f(g(x,y), g(u,v)) =8( f(x, 4), fly,v)) 


for all x, y,u, v © X. We define some operations on the set BinCX) of all binary 
functions on X: 


(frAg)(x,y) =f(glxy),y); 
(fV g)(x,y) = fx, g(x, y)); 
f(x,y) = fly, x); 
True(x,y) =X; 
False(x,y) *=y. 


The following two-part theorem constitutes a “Cayley theorem for Boolean alge- 
bras.” 


Part I. (Cf. [1, §3]). Let B be a subset of Bin X ) with the following properties: B 
contains True and is closed under the operations of A, V, and '. Then (B,/, 
V ,’, True, False) is a Boolean algebra providing each function in B is idempotent and 
diagonal and any two such functions commute. 
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We call such Boolean algebras of binary functions on X “guard algebras on X” 
for reasons related to the concerns of [3, Chapter 4] and [2, §5]. (In computer 
science terminology, a ‘‘guard” is a Boolean condition which must hold before the 
procedure it guards can begin. Here we think of a guard P as represented by the 
binary function 7 on the set of functions computed by programs defined by 
n(f,g) =“if P then f else g.” To understand our emphasis of the idempotency 
and diagonality properties, first represent the subset P of X by the partial 
function p:X — X defined by x — x with domain P. If g is another such function 
corresponding to the subset Q, it is easily checked that P and Q are complemen- 
tary if and only if n(f, g) =fep V g°q is idempotent and diagonal.) 


Part Il. If <B, A, V ,’, 1,0) is any Boolean algebra, there is a set X such that B 
is isomorphic to a guard algebra on X. 


The proof of Part I is routine. In order to prove part II, we introduce on any 
Boolean algebra B the binary functions if,, for b € B, defined by 


if,(x,y) =(bAx)V (b' Ay). 


We let the reader verify the following facts, which in ensemble constitute the proof 
of Part II. 


if,(x,x) =x; 

if,(if,(x, ¥), if,(u,v)) = f(x, v); 

if,(if.(x,y), f(u,v)) = fif,(+, 4), f(y, ¥)); 
if (x,y) = True(x,y) =x; 

ifo(x,y) = False(x,y) =y; 

if, (x,y) = if,(y,x) = (f,)'(% y); 

ifpr (xy) =H, Y),Y) = (fy A PCY); 
ify (x, ¥) = if, (x, f(x, y)) = Ch, Vv F(X, )5 
if,(1,0) = b. 


The last equation shows that the map b +> if, is injective; the other equations 
show that each function if, is idempotent and diagonal, that any two jf,, if. 
commute and that the map is a Boolean homomorphism. 


We thank George Bergman for providing a specific example of a 4-element Boolean algebra of 
binary functions which meets all the requirements for a guard algebra save that the nontrivial elements 
are not diagonal. This shows that the converse of Part I does not hold. If the definition of guard algebra 
were strengthened so that all its binary functions were required to be associative then both directions of 
Part I would hold and the corresponding version of Part II would also be true. 
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Rouche’s Theorem for Polynomials 


MICHAEL FILASETA 
Mathematics Department, University of South Carolina, Columbia, SC 29208 


Rouché’s Theorem (cf. Conway [1]) often permits one to determine something 
about the location of the roots or poles of a meromorphic function f(z) by 
comparing f(z) to another meromorphic function whose roots and poles are 
known. In the case of polynomials this is particularly useful to deduce the 
irreducibility of a given polynomial f(z). Thus, for example, in this manner one 
can obtain Perron’s Theorem [4]: if f(z) =z" + a4,_,z”"' + -++ +a, has integer 
coefficients with a, # 0 and ja,_,| > 1+ |a,_,| + ++: +]ao|, then f(z) is irre- 
ducible over the integers. 

The purpose of this paper is to present a proof of Rouché’s Theorem for 
polynomials. We shall take the simplest case (stated below) where the closed curve 
under consideration is the unit circle. It will be clear from the proof that more 
general curves could be used. 

Before proceeding, we note that the proof given here makes use of the 
Fundamental Theorem of Algebra. On the other hand, one can prove the Funda- 
mental Theorem of Algebra using Rouché’s Theorem. Thus, one may view this 
paper as establishing that Rouché’s Theorem for polynomials is equivalent to the 
Fundamental Theorem of Algebra (i.e., that each is easily derivable from the 
other). We add that proofs of the Fundamental Theorem of Algebra which do not 
make use of Rouché’s Theorem can be found in Dickson [2], Grove [3], or Rudin 
[5], and that it is also an easy consequence of Liouville’s Theorem. 


RoucHeé’s THEOREM. Let f(z) and g(z) be polynomials with complex coefficients 
and C = {z: |z| = 1}. If the strict inequality 


f(z) + ez) < [FC + Ie(2)I 
holds at each point on the circle C, then f(z) and g(z) must have the same total 
number of zeros (counting multiplicity ) strictly inside C. 


Proof. Let k be the number of roots of f(z) inside C, and let t be the number 
of roots of g(z) inside C. By the symmetry of the roles of f(z) and g(z) in the 
statement of the Lemma, it suffices to show that k <t. Assume k >t. We will 
obtain a contradiction by showing that there is a ¢ € [0, 277) such that 


| f(e'®) + g(e’*)| =| f(e)| +[8(e)|. 
By the triangle inequality, it suffices to show that there is a ¢ € [0, 277) such that 
arg( f(e'*)) = arg(g(e'*)), where for z © C we take arge(z) € (—7, 7]. For a € C, 
la| #1, define w(a;0) = arg(1 — a) and for 6 € R — {0} wa; 6) = arg(e’” — a) 
+ 2s7 where s = s(@) is an integer chosen so that w(a;@) is continuous on R. 


Geometrically, w(a; 0) — w(a;0) is a continuous representation of the angle with 
vertex at a and rays emanating through 1 and e’’. Thus, 


w(a;2m) — w(a;0) = a HGS inside the circle C (*) 


Let a be the leading coefficient of f(z) and b be the leading coefficient of g(z). 
Let a,,a5,...,a, be the roots of f(z) and B,, B,,..., B,, the roots of g(z) (the 
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roots appearing as many times as their multiplicity). The inequality in the hypothe- 
sis of the theorem implies that each |a,;| # 1 and each |8,| # 1. Define 


F(0) = arg(a) + 3 w(a;;0) + 2ur, 
j=l 


where u is an integer chosen so that F(O) € (—277,0]. Define 


G(6@) = are(b) + ¥ w(B,; 8) + 2u7, 


j=1 


where v is an integer chosen so that G(O) € [F(O), F(O) + 277). Thus, F(@) and 
G(6@) are continuous on R. Also, since 


are( f(e’*)) = are a (e’? — «)| = arg(a) + y arg(e’” 7 a;) (mod 277), 


j=l 
we get that 
F(0) = arg(f(e)) (mod 2m). (**) 
Similarly, 
G(0) = arg(g(e’)) (mod27). (***) 


Furthermore, by (*), 
F(27) =F(0) + 2ka7 and G(27) = G(0) + 2tr. 


Let H(@) = F(@) — G(@). Then H(0) = F(O) — GO) < 0 and H(27) = FQ7) - 
G(27) = 2(k — t)a7 + F(O) — G(O) > 0. By the Intermediate Value Theorem, 
there is a ¢ & [0,277) for which H(¢) = 0. By (**) and (***), we now get that 
are( f(e'®)) = arg(g(e’®)), completing the proof. 
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A Simple Derivation of the Maclaurin Series for Sine 
and Cosine 


DENG Bo 
Coal Exploration No. 174, Team of Gui Zhou, Zhi Jin, Gui Zhou Province, 
People’s Republic of China 


The standard derivations of the Maclaurin series for sine and cosine via 
differentiation are well-known. In this note I show how integration can be used to 
obtain the two series and prove that they converge to the sine and cosine functions. 

Clearly, it suffices to do this for x > 0. Because cos x < 1, ff cos ydy < ff 1 dy. 
That is, sin x < x. From this it follows that {7 sin ydy < fgydy. That is, 1 — cos x 
< x’/2!. Continuing in this fashion, we obtain the inequalities 


1 - a1 <cosx<l 
x? . 
x- 31 <sinx <x 
x* xt x? x? x? 
1-5 an 6 SOS * SIO a 
xe x? x! x? =x? 
*~ 37 sry SNA SX Bt 51. 
Now, still assuming x > 0, induction can be used to prove that S(2r) < cosx < 
S(Qr— 1) and T(r) < sinx < T(@r — 1) for r= 1,2,..., where 
k- x2! k- 2i+1 
S(k) = 1 d T(k)= 1 ’ 
( ) 1+ EC oni an ( ) oe OED! 


The upper and lower bounds differ by x”/m!, which, for fixed x, can easily be 
shown to approach 0 as m > ©, 


An Infinite Product Expansion for the Square Root Function 


ErIc WINGLER 
Department of Mathematical and Computer Sciences, Youngstown State University, 
Youngstown, OH 44555 


Occasionally when we try to solve a problem by one method we discover 
another method that works better than the one we had in mind. Consider the 
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problem of calculating the square root of a number using the Maclaurin series for 
vl+x, 

1 1, 1, ; 

5% — + 16% , (1) 
Since this series has a radius of convergence equal to 1 (see [2, §8.6]), we can only 
use it directly to find square roots of numbers in the interval [0,2]. However, it is 
readily seen that to calculate ¥10, say, all that is needed is to compute y 10/9 
using (1) and then to multiply by 3. But even if we are using this series to compute 
square roots of numbers in the interval [0, 2], the process can be quite lengthy if we 
desire a reasonable degree of accuracy. For example, it takes approximately 10,000 
terms to approximate ¥2 to within 3 x 107’. 

Is there any way to improve convergence? It is in answering this question that 
we discover something remarkable. Obviously the series (1) converges best for 
values of x close to 0. So in calculating /2 we try to express 2 as a product of a 
Square and a number closer to 1 than 2 is. For example, 


a 


This works so well that we are tempted to try this san 


= 978 = - (| F 
/ a7 fa 17 Y 388 


But where do we stop? We don’t! This method works so weil that there is no need 
to return to using power series. 

The trick is to find the appropriate thing to factor out. One way of choosing 
things to factor out is suggested by the =“ 


_2xte 
x4+2 ay da” 


x+2 
2x +2 


Thus if we let a, =x, a,,, = a2/(4a, + 4), and b, = (2a, + 2)/(a, + 2) for 
each integer n > O, then we have 


vVlit¢x = it, 


Since the numbers a,, converge quadratically to 0, this product converges rapidly. 
For example, if we use x = 1, we have |V2 — b,b,b,b3| < 4 X 10-17, 

As you might have guessed from (2), the way to improve convergence is to find 
two functions f and g such that 


vit+x =f(x)y1l+s(x), (3) 


where g is of smaller order than x as x approaches 0. We can find good choices 
for f and g by using (1). Of course, once f is chosen, then g is automatically 
determined by (3). Specifically, 


g(x) = — 1. (4) 


vVl+tix = 
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First, observe that if f(x) — V1 +x is O(x”) as x approaches 0, then 1 + x = 
f(x)? + O(x"), so that g(x) = O(x") by (4). So if the first n terms of the 
Maclaurin series for f match those of V1 + x, then g(x) = O(x”). 

The nicest choices for f seem to be rational functions whose numerator and 
denominator are of the same degree. We list here some specific cases. 


3 


fx) =, (x) = Be (5) 
5x* + 20x + 16 x? 

PO) = a5 qe? 8) = (5x? 4 20x 4 16)" (6) 
7x°> + 56x? + 112x + 64 x! 

f(*) = ~s 352 4 80, wea 80) ~ ay sox? 4 11d b OA) (7) 


These functions are easy to obtain. Simply find the coefficients for two polynomials 
P(x) and Q(x) that will make the first n terms of the Maclaurin series for 
P(x) — Q(x\V1 + x) disappear. In general if deg P = deg Q =k, then n = 2k + 
1. Then f(x) = P(x)/Q(x) and g(x) = mx**t!/P(x)*, where m = 2Q(1)* —- 
P(1)*. The functions f obtained by using this procedure are called the Padé 
approximants of y1 + x. The functions in (5), (6), and (7) are the Padé approxi- 
mants denoted by f, 1, f., and f3 in [3, Chap. XX] (also see [1)). 

For any functions f and g satisfying (3) and such that g(x) = O(x”) with 
n > 2, the method for calculating ¥1 +x is the same. Let a) = x,a,,, = g(a,), 
and b, = f(a,,) for each integer n > 0. Then 


k 
vVl+x = [T1?, Vl t+ ay41- 


n=0 


So for x > 0 


vi+a,,,-1)< yi +X, 


since 


k a 
IT, < vi +x and /1+4,,, -1< - 


It should be pointed out that all of the functions g in (5), (6), and (7) are 
contractive; that is, |g(x)| < a|x| for some a < 1 and all x > 0. For example, using 
the function in (5), we have for x > 0 


3 
|x” [x|" || 


= — —~ <-> F= . 
BOO By ap Sox 9 


Therefore, the sequence (a, ) will converge rapidly to 0. In fact, if (7) is used with 
x = 1, then we get 


V2 — byb,b,| <5 x 107. 
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Finally, we remark that this procedure can be used for other roots as well; for 
example, 


2x> +x? 


2x +3 
me + ———- . 
(2x + 3) 


x+3 
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Some Models of Plane Geometries* 
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Undergraduate courses on the foundations of geometry need examples different 
from the traditional Euclidean plane in order to give students experience in 
dealing with axiomatic systems. In most texts, such examples are given in the 
discussion of the axioms of incidence; they consist, almost invariably, of finite 
geometries or related systems (see, e.g., [3]). In our experience, such examples have 
very little value for most of the mathematically rather unsophisticated students 
that take the geometry-for-teachers courses. Moreover, as soon as one reaches the 
axioms of order (in any of their embodiments), these examples have to be 
abandoned and there are very few others to take their place. (The text [6] by 
Millman and Parker is a notable exception to the tendency of presenting only a 
very small collection of models. Gans [2] also describes a model rarely mentioned 
by other writers; it is the topic of a recent paper [5] in the Monruty. For other 
discussions of this and related models see [4, p. 88] and the references given there.) 
The aim of the present note is to provide several models the first author has used 
in such courses. These models seem to lead to better understanding of axiomatics 
by the students—probably because they are, in a psychological sense, more like 
real geometry than the usual examples. (This experience seems to be in agreement 
with the findings of Nisbett et al. [7].) It is hoped that the models may provide 
some amusement and some surprises for the teacher as well. The exploration of 
the models can be particularly satisfying using a Macintosh computer and such 
software as MacDraw. 

All the models are set in the usual Euclidean plane E?, the “ordinary plane” of 
the student’s experience; the points of each of our geometries coincide with the 
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points of E*. If a more formal setting is desired then, just as with the Euclidean 
geometry, the points P can be taken as ordered pairs P = (x, y) of real numbers. 
The geometries we discuss differ from each other by their sets of lines. 


Geometry G1. In this geometry a “line” is any one of the following objects 
(which visually have quite different shapes; see Figure 1). Let U be a fixed 
(Euclidean) parabola, the axis of which we shall assume to be vertical; in the 
analytic formulation, we can take U as the set given by U = {(x, y): y = x*}. Then 
the lines of G1 are: 

(i) all the translates of U: 

(ii) all the vertical Euclidean lines (given by {(x, y): x = c}). 


Fic. 1. Examples of lines in the geometry G1. 


Geometry G2. Here a “line” is any of the following objects (see FiGuRE 2): 
(i) a vertical Euclidean line: 
Gi) a horizontal Euclidean line; 
(iii) a translate of the first-quadrant branch L of a fixed equilateral hyperbola 
with vertical and horizontal asymptotes; analytically, this can be taken as 
L = {(x,y):x>0,y > 0, xy = 1}; 
(iv) a translate of the reflection L* of L in the y-axis, that is, 


* = {(x,y): x <0,y > 0,xy = - 1}. 


Fic. 2. Examples of lines in the geometry G2. 
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Geometry G3. Here a “line” is any of the following objects (see FiGure 3): 
(i) a horizontal Euclidean line; 
(ii) a translate of the branch L to the left of the Y-axis of a hyperbola with 
asymptotes at +60° to the x-axis; analytically, this can be taken as 


L = {(x,y): x <0,3x7 —y? = 4}; 


(iii) any object obtained by rotating one of those in (i) or (ii) through 120° or 
240°. 


Fic. 3. Examples of lines in the geometry G3. 


To study these geometries, two ways are open; it is best to use them both. On 
the one hand, practical experience concerning the meaning of the axioms, and 
whether they are satisfied or not, can be obtained by experimentation in drawing. 
“Rulers” for the “lines” are quite easily prepared; if a photocopying machine that 
accepts cardboard is available, the teacher can provide such “‘rulers” (ready to be 
cut out) to the students. Otherwise copies on paper should be pasted by students 
onto cardboard, and then cut. To assure that only translates of a fixed parabola or 
hyperbola are used, it is convenient to draw on ruled (or engineering) paper, and 
make a slit in the template, which enables one to check alignment. On the other 
hand, one can use the equations to proceed in an algebraic way to verify the 
various claims, and to settle disputes concerning the drawing (which inevitably 
arise due to the limited precision of the practical procedures). Clearly, each 
“parabolic” line L in G1 will have an equation of the type y = (x — a)* +b, 
where (a, b) is the apex of L. Similarly, the “hyperbolic” lines of G2 will have 
equations of the type (x — a)(y — b) = +1, with suitable inequalities for x and y; 
the appropriate relations for G3 are also easily found. 

Naturally, for a model of a geometry one needs not only points and lines, but 
incidences, order (or betweenness), and so on. In class these can be either imposed 
by the teacher or “discovered” by the students. The advantage of the models under 
discussion over the usual ones is that even the study of axioms of incidence and 
order provides meaningful experiences, as do questions on parallelism and related 
topics. Experimenting first, and verifying later (by computation or in some other 
way), will show that the three models satisfy all the usual axioms on incidence and 
order. Also, by defining the distance between two points as being equal to the 
distance in the Euclidean sense along the “segment” determined by the points, it 
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is easily seen that the congruence and continuity axioms are also satisfied, with the 
exception of those that deal with triangles. As far as parallels are concerned, G1 
behaves like the Euclidean plane (in that for each line and each point there is a 
unique parallel to the line through that point) while G2 and G3 do not. An 
interesting difference between G2 and G3 is that in the former there are lines 
which for some points have unique parallels; this does not happen in G3, which is 
therefore a “hyperbolic” model. From an axiomatic point of view, model G2 is 
interesting since in it, in contrast to the situation usually considered in axiomatic 
treatments, the failure of the Playfair axiom for one line and one point does not 
imply its failure for all such pairs; thus G2 shows that the theorem about the 
uniform behavior of line-point pairs does not have to hold if one of the congruence 
axioms is not assumed. 

With that much established, one can investigate such topics as the numbers of 
regions, segments, and intersection points of finite families of lines every two of 
which intersect, but no three have a common point, or of more general families. Or 
one can investigate whether such results as the theorems of Pappus or Desargues 
(or suitable special cases, which may be easier to formulate and to check) are valid; 
see FIGURE 4. 


Fic. 4. Illustration of the theorem of Pappus in the geometry Gil. The vertices of the hexagon 
ABCDEF alternate between two lines, hence the intersection points P,Q, R of pairs of opposite sides 
of the hexagon are collinear. 


An ambitious teacher may define the midpoint of a segment in G1 (as the usual 
midpoint for vertical segments, and as the point of the parabola at which the 
tangent is parallel to the chord determined by the endpoints of the segment for 
segments that are parabolic arcs) and prove (or have students prove) the concur- 
rence of the medians of a triangle (see FiGuRE 5), and other simple facts. An even 
more ambitious project is to verify that in G1 a distance function (different from 
the one mentioned above) can be defined between pairs of points P, = (x, y,) 
and P, = (x, y2) by the (somewhat intimidating) formula 


1/2 
d(P,, P,) = ((x, —x,)° + (y, —y,—x? +x3)') 


Then it is possible to establish that with this distance function the geometry Gl 
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Fic. 5. In geometry G1, the medians AD, BE, and CF of a triangle ABC meet at the centroid G. 


satisfies all the axioms of congruence and of continuity—in short, that G1 is a 
model of the Euclidean geometry! 

A simpler way to see that G1 is just an unusual version of the usual Euclidean 
plane is to consider the mapping ¢ of the plane of G1 to the Euclidean plane E? 
given by f(x, y) = (x, y — x”). It is clear that # is a continuous bijection, and that 
it maps the lines of G1 onto lines of the Euclidean plane, and vice versa. Thus G1, 
as we have defined it, is just a line-preserving homeomorphic image of the 
Euclidean plane—but this does not have to be told to the students until an 
appropriate later time. 

Since in G2 some point-line pairs behave as in the Euclidean plane and some as 
in the hyperbolic plane, this geometry is not a homeomorphic line-preserving 
image of any of the standard geometries. As mentioned earlier, the geometry G3 
behaves as expected from the hyperbolic plane, and it is natural to ask whether it 
can be metrized in such a way as to be homeomorphic to the usual hyperbolic 
plane (under a line-preserving mapping). There seem to be no direct methods 
available at present to decide such metrization problems, or of finding a metric if it 
exists. However, an indirect proof for the essential difference between G3 and the 
hyperbolic plane (and thus the nonexistence of a line-preserving homeomorphism 
between them) follows from the observation made by Walter Taylor that in G3 
some angles are not “angles of parallelism.” Indeed, there is no common parallel 
to the positive rays of y = ¥3x and y = — y3x, in contrast to the situation 
regarding any two non-opposite rays in the hyperbolic plane. 

As an additional project one could consider geometries analogous to those 
described above, but with other suitable curves substituted for the parabolas or 
hyperbolas. As examples one could use y = cosh x instead of the parabola in G1, 
or translates of y = e* and y =e ~* instead of the branches of the hyperbolas in 
G2. In fact, only certain mild monotonicity conditions have to be imposed on the 
curves used to obtain geometries satisfying the axioms of incidence, order and 
continuity, as well as most of the axioms of congruence. Moreover, there is no 
need for the curves in different sectors to be congruent; they can be chosen with a 
large degree of freedom. Another way of extending the family of models is to take 
n equi-inclined (Euclidean) straight lines through the origin (with n > 2), and a 
hyperbola H (or other suitable curve) having as asymptotes two rays (determined 
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by these lines) which enclose an angle of (n — 1)a/n. The lines of the geometry 
Gn are translates of the nm lines we started with, as well as translates of H and of 
the hyperbolas obtained from it by certain rotations. For odd a, it is most 
convenient to choose the n rotations of H through multiples of 277 /n. For even n, 
the rotations through ka/n with 0 <k <n-— 1 can be taken. (For n = 2 or 3 
these definitions agree with the previous ones.) Other sets of rotations can also be 
used, but tend to lead to messy case distinctions. 

In contrast to this, we shall prove that G1 is in a certain sense unique. To 
formulate our result precisely, let . denote the family of all straight lines in the 
Cartesian plane R’, and let 4 denote the family consisting of all vertical lines in 
R* and all curves that are translation-congruent to one given by the equation 
y = p(x), where p is a function defined for all x © R. We say that 6 is an 
isomorphism from ._/ to .4 if 6 is a bijection of R* onto itself which induces a 
bijection of .“ onto .Z. 


THEOREM. [If there exists a continuous isomorphism 0 from £ to 4 and p(x) is 
differentiable for all x, then p is a polynomial of degree two. 


We do not know whether the assumptions of differentiability of p and continu- 
ity of 6 are essential.* (We know that measurability or the property of Baire of 6 
would suffice.) The following is a well-known result on collineations (see, e.g., [1, 
p. 156] for a proof of the projective geometry formulation; the result is often called 
the “Fundamental Theorem of Projective Geometry”). It suggests that such 
assumptions about p or @ could be superfluous. 


Lemma. Every bijection @ of R* onto itself which induces a bijection of & onto 
itself is affine linear, that is, 


o(x,y) = (ax + by +u,cx + dy + v), 


where ad — bc # 0. 


ProoF (in outline). Let % = ao, where a is affine linear. We can choose a 
such that w fixes three non-collinear points po, p,, p». Interpreting the straight 
lines pp; (i = 1,2) in the natural way as copies of the field of real numbers, with 
Py = 0 and p; = 1, we can verify by means of familiar constructions that the 
restriction of w to pyp; is an automorphism of this field. Since the field of real 
numbers has no automorphisms different from the identity, it follows that %(p) = p 
for all p € pop,. Then it is easy to check that #(p) =p for all p € R*. Hence 
~ = a! which proves the Lemma. Oo 


Proof of the Theorem. Let 6: R* > R* be a continuous isomorphism from ./ to 
. Then for any affine linear automorphism a of -2, the composition k = 0° a is 
still a continuous isomorphism from ./ to 4. Since the family of vertical straight 
lines is a maximal family of parallel lines both in -_# and in -4, we can choose a 
such that x maps this family onto itself. Hence « is of the form k(x, y) = 
(f(x), g(x, y)), where f and g are continuous. Refining our choice of a@ we can 
assume that f(0) = 0. 


*V. Faber and J. Mycielski have shown that the continuity of p and @ suffices [to appear]. 
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Now consider the one-parameter group of automorphisms of .4 given by 
T(x, y) =(x +t, y). Then the maps 7/ = x~'o7,°x constitute a one-parameter 
group of automorphisms of .2. By the lemma, all 7/ are affine linear, and 
moreover each 7/ preserves the family of vertical lines but does not fix any of them 
for t # 0. Hence 

T/(x,y¥) = (x + A(t), a(t)x + b(t)y + c(t)), (*) 
where h, a, b and c are some functions and h(s + t) = h(s) + A(t) for all s,t, € R. 
Since ko T/ = T,0K, we have f(x + A(t)) = f(x) + t. From f(0) = 0, putting x = 0, 
we get fc A(t) = ¢ for all t. Hence h = f~! and h is continuous. By the theorem of 
Cauchy about the linearity of continuous additive functions it follows that h(t) = 
at for some a # 0. (At this point it would have been sufficient to know that h is 
measurable, or has the property of Baire.) Hence f(x) = x/a. Refining again our 
choice of a we can assume that a = 1. So we get k(x, y) = (x, g(x, y)). 

Now consider the one-parameter group of automorphisms of .“ and of 4 
given by o,(x, y) = (x, y +t). This is the only one-parameter group of automor- 
phisms of -# (and of -4) which preserve every vertical line and acts without fixed 
points for t 0. Hence the map oa, ~ o/ = k '°o,°x must be an automorphism 
of this group. Therefore o,/(x, y) = (x,y + k(t)), where k is continuous 
and additive. So again k(t) = at, where a # 0. Since koa,’ =0,°k, we have 
(x,y + k(t)) = g(x,y) +t. Hence g(x, y) = g(x,0) + y/a. Refining again our 
choice of a we can assume that a = 1 and then, by the definition of -Z, the graph 
of g(x,0) must be translation-congruent to the graph of p(x). It follows that 
g(x, y) = q(x) + y, where q(x) = cy + p(x + c,) and cy and c, are suitable con- 
stants. 

Hence we have x(x, y) = (x, y + q(x)) and, returning to the transformations 7/ 
defined above, we get 


TIi(x,y) =k 'e7,cK(x,y) 


Kk lor (x,y +q(x)) 
Kk '(x+t,y +q(x)) 
(x+t,y+q(x)-—q(x+t)). 


Comparing with (*) we get b(t) = 1 and 
q(x) —q(x+t) =a(t)x+ c(t). 
Then putting x = 0 we get q(0) — g(t) = c(t). Hence, for x = 1, 
a(t) =q(1) — 4(1 + t) — (0) + a(¢), 
and 
q(x) — q(x +t) = (q(1) — a1 +t) — g(0) + a(t)) x + 4(0) — a(t). 
Differentiating relative to t we get 
~G(x +t) = (g(t) —g(1+1t))x—- q(t), 
and, for ¢ = 0, 
q(x) = (4(1) — 4(0))x + q(0). 
Integrating in x we see that q(x) is a polynomial of degree at most 2, and the 
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same is true for p. Since a linear function p does not satisfy the assumptions of the 
Theorem, it follows that p is a polynomial of degree 2. oO 
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ELEMENTARY PROBLEMS 


E 3403. Proposed by Paul Erdos, Hungarian Academy of Sciences, Budapest. 


It is well known that the maximum size of a subset of {1,2,..., m} containing no 
pair i,j with i|j is ((m + 1)/2]. Prove that the maximum size of a subset of 
{1,2,...,} containing no pair i, j with i|2j is 4n/9 + O(log n) for large n. 


E 3404. Proposed by the editors. (A modification of a problem proposed by the late 
Reuven Gurevic.) 
Suppose f is a function from R to R such that Lf(a,) is convergent whenever 


La, is a convergent series of real terms. Prove that f is differentiable at the origin. 


E 3405. Proposed by Charles Vanden Eynden, Illinois State University, 
Normal, IL. 


Suppose m,n are integers with 1 <m <n. Let S, be the thick-walled box 
obtained by removing the cube 


{(x,y,z):1<x,y,z<n-—1} 
from the cube 
{(x,y,z):0<x,y,z <n}. 


For which pairs of integers m,n can S, be constructed with m x 1 xX 1 blocks? 
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E 3406. Proposed by Jeffrey Shallit, Dartmouth College, Hanover, NH. 


Consider three circles in the plane that intersect to form seven bounded 
regions. In each region there is a token that is white on one side and black on the 
other. At any stage the following two operations are permissible: (a) we can invert 
(flip over) all four tokens inside one of the three circles, or (b) we can invert those 
tokens showing black inside one of-.the three circles so that afterwards all tokens in 
that circle show white. From the starting configuration in which all tokens show 
white, can we reach the configuration in which all tokens show white except that 
the central region common to the three discs shows black? 


E 3407. Proposed by Clark Kimberling, University of Evansville, Evansville, IN. 


Suppose ABC is a given triangle. Prove the existence of a triangle that is in 
perspective with every antipedal triangle of ABC. (If P is a point not collinear 
with any two of the the points A, B, C, the lines through A, B, C perpendicular to 
PA, PB, PC respectively form a triangle called the antipedal triangle of P with 
respect to ABC. Two triangles A,B,C, and A,B,C, are said to be in perspective if 
the lines A,A,, B,B,, C,C, are concurrent.) 


E 3408. Proposed by Juan V. Savall and Jesus Ferrer, Oliva, Valencia, Spain 


For each positive integer k let f(k) denote the number of triangles with sides of 
integer length whose area is k times the perimeter. It is well-known (cf. E2420 
[1973, 691; 1974, 662]) that f(1) = 5. Obtain an upper bound for f(x) in terms 
of k. 

The analogous problem for right triangles appeared as Problem 1447 in Crux 
Mathematicorum, 15 (1989) 148. 


SOLUTIONS OF ELEMENTARY PROBLEMS 
Conncected Graphs with Every Edge in a Triangle 


E 3255 [1988, 259]. Proposed by Paul Erdés, Hungarian Academy of Sciences. 


What is the minimum number of edges in a connected n-vertex graph such that 
every edge belongs to a triangle? 


Solution by David Callan, University of Bridgeport, Bridgeport, CT. Say that G isa 
T-graph if it is connected and each edge belongs to a triangle. Let G = (V, E) be 
an n-vertex T-graph with the minimum number a, of edges, and let b, = 
|(3n — 2) /2| = [3(n — 1)/2]. Note that a, = b, = 3. If n is odd, then (n — 1)/2 
triangles sharing a common vertex form a 7-graph with b,, edges. If n is even, then 
a similar structure with (n — 2)/2 triangles together with an additional vertex 
adjacent to the endpoints of one edge of this forms a 7-graph with 5, edges. 
Hence a, < b,. 

To see the opposite inequality, consider an n-vertex T-graph G with a,, edges. 
Since a, < 3n/2, the sum of degrees in G is less than 3n, so G must have a vertex 
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v, of degree 2. Let v,,v, be the neighbors of v,. Since each edge belongs to a 
triangle, v,v, € E(G). If n > 3, form a smaller T-graph G’ = (V’, E’) as follows. 
Delete v, from G, and if v,,v,; have no common neighbor in G other than v,, 
then contract v,v,;. Every edge of G’ belongs to a triangle that contained it in G. 
The change reduces the number of vertices by 1 or 2 and reduces the number of 
edges by at least 3/2 times the reduction in the number of vertices. Repeating this 
process leads eventually to a 3-vertex graph, which must be a triangle. It follows 
that a, > 3 4+ 3(n — 3)/2 = 3(n — 1)/2. Since a, is an integer, we conclude 
a, 2b 


Editorial comment. F. Galvin points out that |((3n — 2)/2] is also the minimum 
number of edges in an n-vertex graph satisfying the stronger condition that every 
pair of distinct vertices has a common neighbor (see Ex. 2.4 of Béla Bollobas, 
Combinatorics, Cambridge University Press, 1986, p. 9). He and J. T. Ward 
(independently) also generalized the problem, replacing the word “triangle” in the 
problem statement by the word “m-gon’”. More generally, we can consider the 
question with “triangle” replaced by an arbitrary graph 7 with m edges and r 
vertices and ask whether the formula m(n — 1)/(r — 1) is the correct answer when 
n = 1modr — 1. This is not true in general if J has a vertex of degree less than 
m/(r — 1). In this case one does better by appending  — r vertices having the 
minimum degree to a single copy of T. However, if T is a regular graph, then we 
ask whether edge-disjoint copies of 7 sharing a single vertex always provide an 
optimal construction. S. F. Barger proved that this is true when 7 is the complete 
graph on r vertices, and more generally gave the complete solution for all 
congruence classes of n mod r — 1 in that case. 


Solved also by S. F. Barger, P. A. Catlin & A. M. Hobbs, C. Coleman, J. Davis, W. Fenton, 
K. Fogarty, F. Galvin, D. C. Kay, O. P. Lossers (The Netherlands), D. P. Mehendale & M. R. Modak 
(India), L. Nykaky (France), A. Palmer (student), A. Pedersen (Denmark), R. G. Powers & K. D. 
Wallace, R. B. Richter, A. Riese, E. Schmeichel, M. Shimure, J. H. Steelman, J. T. Ward, K. Yamaoka 
& A. Riskin, and the proposer. Several incorrect solutions were received. 


The Three Centroids of a Triangle 


E 3283 [1988, 762]. Proposed by Orrin Frink, Pennsylvania State University, State 
College, PA. 


Every simple closed polygon in the plane has three centroids, namely the 
centroid of its vertex set, the centroid of its boundary, and the centroid of its 
interior. In general all three are distinct. 

(a) In the case of a triangle, show that these three centroids coincide if and only 
if the triangle is equilateral. 

(b) Which of the three centroids are affine invariants? 


Solution by Allan Pedersen, S@borg, Denmark. (a) It is known and easy to prove 
that the interior centroid C, of any triangle coincides with its vertex centroid C,, 
and that this is the intersection of the medians. The boundary centroid C, of a 
triangle ABC is the barycenter of mass distributed along its sides with constant, 
say unit, density. It is a folk theorem that, given a mass distribution and a finite 
partition of this, the barycenter is the same as that obtained after having concen- 
trated the masses of the parts at the various barycenters. Hence C, is the 
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barycenter of a mass a at the midpoint A’ of BC, a mass b at the midpoint B’ of 
AC, and a mass c at the midpoint C’ of AB. C, is on AA’ if and only if the 
barycenter of the masses at B’ and C” is at the midpoint of B’C’, ie., precisely 
when b = c. Thus C, is on all the medians if and only if a =b =c. 

(b) Let P be a simple closed polygon. Denote by p*, ut, ut the uniform 
distributions of a unit mass on the vertices, the boundary, and the interior, 
respectively. The corresponding barycenters are the centroids C?,C?,C?. For 
x € {v, b, i}, C® is an affine invariant if and only if f(C?) = C/™ for any P and 
any affine transformation f. By linearity of f, f(C*) is the barycenter of the mass 
distribution f(u®) on f(P) transferred from P by means of f. Clearly f(u®) = pl. 
Also, since the Jacobian of f is constant, f(u!) = uf. Accordingly, CP and CP 
are affine invariants. 

However, f(y‘) is supported on the sides of f(P), uniformly distributed on each 
side, but in general with different densities, since ratios of sides are not affine 
invariants. Therefore, it is not to be expected that C} is an affine invariant. For an 
explicit example, choose P to be an equilateral triangle, and choose f so that f(P) 
is not equilateral. Then CP = CP and f(C?) = C!, but (a) implies C{® # C/™, 


Editorial comment. As shown by O. P. Lossers, the centroid of the boundary of 
a triangle is the center of the inscribed circle of the triangle with vertices at the 
midpoints of the sides. A proof is given in R. A. Johnson, Modern Geometry, 
(Houghton-Mifflin Co., 1929), Theorem 412. Johnson calls the inscribed circle of 
the midpoint triangle the “Spieker circle’. 


Solved also by B. Kirchheim (Czechoslovakia), O. P. Lossers (The Netherlands), J.-M. Monier 
(France), and A. Nijenhuis. One incorrect solution was received. 


Divisibility of Multinomial Coefficients 


E 3313 [1989, 253]. Proposed by Amir Daneshgar (student), Sharif University of 
Technology, Teheran, Iran. 


Prove that the number of terms in the expansion of (x, +x, + °-: +x,,)” 
whose coefficients are not divisible by a given prime number p is 


k 
a,+m—1 
m1) 


m-— 1 


where 
k 
n=  a;p', O0<a,<p, 
j=0 


is the p-adic expansion of the positive integer n. 


Solution by Y. H. Harris Kwong, SUNY College at Fredonia, Fredonia, NY. Let 
f(r) denote the number of terms in the expansion of (x, + x, + +--+ +x,,)’ whose 
coefficients are not divisible by p. For 0 <r < p, since none of the terms in the 
expansion of (x, +x, + -°-: +x,,)” has coefficient divisible by p, f(r) is precisely 
the total number of terms in the expansion. Thus f(r) equals the number of 
combinations of {x,, x5,..., x,,} taken r at a time, with repetitions allowed. This is 
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knowri to be (oa r ‘), Now from 


k | 
(x, $x) 407+ $x)" = TT (ey +49 + 01 +x,” 


. 1\ &@, 
+xf +++ +xP) (mod p), 


Il 
> 
—~ 

be 

om) 


it is evident that 


i=0 


ume 


k k 
rin) = THe) = (“2 
i=0 mn 


Editorial comment. As pointed out by several solvers, this result can be found in 
the paper of F. T. Howard, “The number of multinomial coefficients divisible by a 
fixed power of a prime,” Pacific J. Math. 50 (1974), 99-108 and in the solution by 
David Singmaster of Problem 1255, Mathematics Magazine 61 (1988) 53. 


Solved also by R. Betts (student), J. C. Binz (Switzerland), D. Callan, R. J. Chapman (United 
Kingdom), B. de Weger (The Netherlands), F. Dodd, F. T. Howard, J. Heuver (Canada), H. Ki (Korea), 
H. K. Krishnapriyan, E. Levine, O. P. Lossers (The Netherlands), J. H. Nieto (Venezuela), 
A. Nijenhuis, A. Pedersen (Denmark), J. H. Steelman, C. Wildhagen (The Netherlands), D. Zeilberger, 
A. Zulauf (New Zealand), the Stanford Ordinal, and the proposer. 


An Inequality for the Brocard Angle 


E 3314 [1988, 253]. Proposed by Antal Bege, Miercurea Ciuc, Romania. 


Let M be a point inside an acute-angled triangle ABC, and put A, = 4 MAC, 
B, = 4 MBA, C, = & MCB. Prove that 
cot A, + cot B, + cot C, > 2°/73!/4(cot A + cot B + cot C)'”’. 
Solution by Marcin E. Kuczma, University of Warsaw, Warsaw, Poland. We prove 
that 
cot A, + cot B, + cot C, > 3°/4(cot A + cot B + cot C)'”’, (1) 


which implies the desired inequality and holds with equality when the triangle is 


equilateral and M is its center. 
We have MA/MB = sin(B,)/sin(A — A,) and analogously for MB/MC and 


MC /MA, which yields 
sin( A — A,)sin( B — B,)sin(C — C,) 

sin( A, )sin( B,)sin(C,) 7 
(This is a form of Ceva’s Theorem.) Using a, b,c and x, y, z for the cotangents of 
A,B,C and A,, B,,C,, respectively, the well-known identity tan A + tan B + 
tan C = tan Atan B tan C becomes 

a~'+b-+4+¢7! = (abc). (3) 
Furthermore, we have sin(A — A,)/sin A, = sin A(cot A, — cot A) = 
(1 + a?)~!/*(x ~ a), and similarly for b,c, which transforms (2) into 


(x -a)(y —b)(z-c) =[ 4+ a2) 4+b2)(11 + c2)]'”. (4) 


(2) 
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This reduces the problem to minimizing the quantity 
OQ=(xty+z)/(atbt+c)'” 

subject to the equalities (3), (4) and the inequalities 0 <a <x,0<b<~y, and 

0<c<z that hold because the triangle is acute-angled. By the Arithmetic- 


Geometric Mean Inequality, we have (x —a)+(y— 6) +(z-c) > 3[(x —- a) 
(y — b\(z — c)]'”°, which with (4) implies 


xtytz>at+btc+3[(lta2)(1+b2)(14+02)]”. (5) 


Note that the Mean Inequality implies that for a fixed triangle, the sum x + y + z 
attains its minimum only when M is chosen so that x --a=y—b=z-—c, in 
which case the common value is [(1 + a7)(1 + 67). + c7)}'”®. 

Using s =a +6+c and q = abc, we can multiply (3) by g and square it to 
rewrite it as a*b? + b*c* + c*a* = 1 — 2qs. Further (3) implies a? + b* +c? = 
(a +b+c)* — 2ab — 2be — 2ca =s* — 2, and together these two equations 
imply 

(1+ a?)(1 + b?)(1 +c?) =14 (s?— 2) + (1 -2qs) +q?=(s-—q)’*. (6) 

Now (3) and Cauchy’s inequality give s/q =(a+b+c\a7'+b !+c7') 39, 
which implies s — q > 8s/9, with equality if and only if a = b = c. Together with 
(5), (6), this produces the following inequality: 

Q> s—'/2(s 4 3(s _ q)'’”) > s'/(s 4 3(85/9)'/°) = 5!/2 49. 31/3,-1/6 
Let o(s) = 5! 42-3157 17°, 

Using a? + b* + c* = s* — 2 and Cauchy’s inequality, we have 

s=atb+c<3'7(a@+b* 4+ czy” = (35? — 6)'”, 
which implies s > 3'/*. Routine computation by calculus shows that @ increases in 
[3'/7, 0). Thus, finally, OQ > o(3'/*) = 3°/*, which implies (1). All inequalities 
convert into equalities for the equilateral triangle and its center, which is the only 
case where they do so. 


Editorial comment. If w is the Brocard angle of ABC (see R. A. Johnson, 
Modern Geometry, Houghton-Mifflin (New York, 1929)), then cot w = cot A + 
cot B + cot C, and the improved inequality (1) is equivalent to cot A, + cot B, + 
cot C, > 3(V¥3 cot w)!/”. 


The improved inequality (1) was obtained also by Jean Anglesio (France), Eugene Levine, and 
Harvey Lipman. Levine remarked that in fact (1) holds for all triangles, not just for acute-angled 
triangles. Walther Janous (Austria) proved the original assertion of the problem by showing that 


cot A, + cot B, + cotC, > cot A + cot B + cot C + 2y3 
> 23/231/4(cot_A + cot B + cot C)'”’. 


One incorrect solution was received. 
The Evolute of an Ellipse 


E 3316 [1989, 254]. Proposed by Jim Delany, California Polytechnic State Univer- 
sity, San Luis Obispo, CA. 


Consider the fluctuations in distance between an observer located at (h, k) and 
an object orbiting along the ellipse x*/a* + y*/b* = 1, where a > b > 0. If the 
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observer is at or near the origin, then on each orbit the distance will have two 
relative maxima and two relative minima. If the observer is far from the origin, 
there will be only one of each. Determine the equation of the curve that marks the 
boundary between the two cases. 


Solution by K. F. Andersen, University of Alberta, Edmonton, Alberta. The 
desired equation is (ah)*/? + (bk)?/? = (a? — b*)’/*. Let f(t) = (a cost — h)* + 
(bsint —k)* represent the square of the distance from (h,k) to the point 
(a cos t,b sin t) on the ellipse. By Rolle’s Theorem, there is a zero of f” between 
each pair of consecutive zeros of f’. Thus the boundary curve is characterized by 
the requirement that there exists a value of ¢ such that f(t) = f’(t) = 0, that is, 


(b* — a*)sint cost + ahsint — bk cost = 0, 


(b* — a*)(cos* t — sin? t) + ahcost + bk sint = 0. 


Multiplying the first by sin ¢, the second by cos t, and adding yields cos? t = 
ah /(a* — b*). Multiplying the first by cos t, the second by sin ¢, and subtracting 
yields sin? t = —bk/(a? — b*). We combine these using the identity sin? ¢ + 
cos* ¢ = 1 to obtain the equation claimed. 


Editorial comment. Many readers noted that this equation describes a concave 
four-pointed star, known as an astroid. The astroid is well-known to be the evolute 
of the ellipse, defined to be the locus of the centers of its inscribed circles of 
curvature. The astroid is also the locus of points at which three normals to the 
curve meet (except for the cusps), separating the region of the plane where four 
normals meet from that where only two meet. Finally, the astroid is also known to 
be the envelope of the normals to the ellipse. G. D. Chakerian noted that this is a 
very old problem, essentially solved by Appolonius of Perga in his first book on 
Conics. 

Solved also by R. Bagby, H. Carus, G. DuChamp & J.-Y. Thibon (France), J. Duemmel, A. Guetter, 
H. Guggenheimer, D. Halprin (Australia), M. E. Kuczma (Poland), K.-W. Lau (Hong Kong), 
H. Lipman, O. P. Lossers (Netherlands), L. F. S. Martins, J. McHugh, R. B. Nelsen, A. Nijenhuis, 


T. S. Norfolk, R. Parris, A. J. Schwenk, J. V. Wales Jr., Western Maryland College Problems Group, 
and the proposer. 


Dini Rides Again 


E 3318 [1989, 254]. Proposed by Walter Rudin, University of Wisconsin, Madison. 


Suppose 0 < p < ~. Let A be the set of sequences of positive real numbers 
@,,45,... such that L*_,a, =1. Find sup, X%_,a,(a, +a,,,+°°:)? ' and 
sup, Lp_14,(a, ta,+ ++: +a,)?71. 

Solution by Albert Nijenhuis, Seattle, WA. Both suprema are 1 when p > 1 but 


1/p when 0 <p <1. 
Instead of determining the suprema of the infinite series 


yy) a,(a, foises +a,)?* and a,(a,, +@,.,+ °°: yr! 


n=1 n=1 
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we observe that these are the same as the suprema of the finite sums of the form 


N N 

Yi a,(a,+-::+a,)? ' and Yaa, +--+ tay)? ', 

n=1 n=1 
with unbounded N, a, > 0 and L*_,a, = 1. 

The first sum is a “right endpoint” Riemann sum for {jx?~' dx. If p > 1 then 

x +» x?~! is increasing in [0,1], and the sum is an upper estimate for the integral. 
The largest “right endpoint” sum consists of one term (N = 1), 1?~! - 1, yielding 
the value 1. Of course, this can be arbitrarily closely approximated by an infinite 
series with a, slightly less than 1, and the other positive a, making up the balance. 
If 0<p <1, then x ~x?7! is decreasing (non-increasing), and the “right end- 
point” Riemann sum is a lower estimate of {jx?~' dx = 1/p, so that 1/p is the 


supremum. 
The second sum, which can be written as 
N 
L a,{ _ (ay rote ta,_,)}?, 
n= 


is a “left endpoint” Riemann sum for the integral [j(1 — x)?~' dx. When p > 1, 
x + (1 —x)?~' is a decreasing function, and the sum is an overestimate. Again, 
the one term sum (1 — 0)?" '- 1 =1 is the supremum. If 0 < p < 1, the sum is 
again a lower estimate, and the supremum equals the integral, i.e., 1 /p. 


Editorial comment. David Callan noted that the problem is essentially the same 
as E 2996 [1983, 334; 1986, 303]. According to §39 of Knopp’s Infinite Series, 
Blackie, 1928, the assertion of the problem essentially goes back to Dini. 


Solved also by K. F. Andersen (Canada), L. Baribeau (Canada), D. Borwein (Canada), D. Callan, 
G. Duchamp (France), M. Falkowitz (Israel), J. Gaisser, E. Hertz, H. Ki (Korea), M. E. Kuczma 
(Poland), K.-W. Lau (Hong Kong), O. P. Lossers (The Netherlands), K. Schilling, M. Stieglitz and 
H. Tietz (West Germany), Western Maryland College Problem Group, and the proposer. 


Rationally Independent Subsets of the Reals 


E 3319 [1989, 356]. Proposed by Paul Erdés, Hungarian Academy of Science, and 
Janos Surdnyi, Eotuos Lordnd University, Budapest, Hungary. 


Suppose S is a set of 2n + 1 irrational real numbers. Prove that S has a subset 
T of cardinality n + 1 such that no nonempty subset of T has a rational sum. 


Solution I by Ilias G. Kastanas, California State University, Los Angeles. Suppose 
S = {a@, @),.-.,@>,}. Let 1, B,, B5,...,B8,, be a basis for the finite-dimensional 
vector space V over @ generated by 1, ay, a),..., @2,. (Here Q denotes the field of 
rational numbers.) Let p(a;) be the projection of a; on the subspace of V 
complementary to 1, ie., let a; = 7; + p(a;), where r; is rational and p(a;) is a 
linear combination of B,,...,8,, With rational coefficients not all zero (i = 
0,1,...,2n). Then any set of a; has a rational sum if and only if the corresponding 
set of p(a;) has sum zero. Hence it suffices to show that there exists a set of n + 1 
of the p(a@;) such that each of its non-empty subsets has a non-zero sum; but this is 
immediate, since either n + 1 of the p(a;) are positive or n + 1 of the p(a;) are 
negative. 
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In the assertion of the problem the number n+ 1 cannot be replaced by a 
larger one, as is shown by the example a; = (— 1)’e + i for i = 0,1,...,2n. 


Solution IT by an anonymous contributor. Let W be the finite-dimensional vector 
space over () generated by the elements of S. Since S is a finite set of nonzero 
elements in W, there exists a hyperplane through the origin of W containing no 
elements of S. Clearly n + 1 of the elements of S fall on the same side of this 
hyperplane. These n + 1 elements of S form a set T having the property stated in 
the problem; in fact no nontrivial nonnegative linear combination of the elements 
of T equals zero. 

Solved also by I. C. Bivens, R. J. Chapman (England), L. E. Clarke (England), J. Ferrer (Spain), 
S. Gaignoux (France), J. Henle, I. M. Isaacs, O. P. Lossers (The Netherlands), H. Noland, A. Riese, 


P. Shirts, P. A. Staley, J. H. Steelman, R. Stong, O. Such (Czechoslovakia), P. Tracy, E. Triesch (West 
Germany), T. P. Vaughan, and the proposers. 


To Be or Not to Be Injective 


E 3325 [1989, 445]. Proposed by Walter Rudin, University of Wisconsin, Madison. 


Let us say that a function f of the form 
f(z) =z+ Yoa,z" 
n=2 


has property P, if LU _.nl|a,| < t. Prove: 

a) If P, holds for some t < ~, then f is continuous on the closed unit disc, i.e. 
on {z € Clz| < 1}. 

b) If P; holds, then f is one-to-one on the closed unit disc. 

c) If t > 1, then some f satisfies P, without being one-to-one in the open unit 
disc. 


Solution to (a) by Raymond Mortini, Universitit Karlsruhe, Karlsruhe, West 
Germany. If f satisfies property t for some t > 0, then the power series for f’ and 
that for f converge uniformly (and absolutely) on D = {z: |z| < 1}. Hence f and f’ 
have continuous extensions to D. 


Solution to (b) by Adam Riese, Wright State University, Dayton, OH. If P, holds 
for f, choose z,w € D with z # w. Then |z + w| <2 and |z"~! + z"~2w| < 2. 
Thus |z” —w"|=|z—wllz”7b4+2z"-*w+--- tw? "1<|z-—wln. If f(z) #z, 
then |f(z) — f(w)| > lz — wl — [LP_,a,(z” — w”)| > |z — wil — LF _la,|n) > 0. 
Hence f is one-to-one. 


Solution to (c) by K. F. Andersen, University of Alberta, Edmonton, Alberta. 
Given t > 1, choose b > 0 such that (1/t)* + b? < 1. Set f(z) =z + (t/2)z’ = 
(¢/2Mz + 1/t)* — 1/(2t); note that f has property P.. For z; = —t~'! + bi and 
Z, = —t~! — bi, we have |z,| =|z,| < land f(z,) = —(1 + b7t?)/(20) = f(z,), so 
f is not one-to-one in {z: |z| < 1}. 


Editorial comment. Raymond Mortini remarked that parts (b) and (c) are 
Problem 24(a) on p. 73 of P. L. Duren’s Univalent Functions. A. W. Goodman 
remarked that under the hypothesis of (b) the function f maps the closed unit disc 
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onto a domain that is starlike with respect to the origin; see Proc. Amer. Math. 
Soc. 8 (1957) 598-601. 


Solved by the proposer and 23 others. Five incorrect solutions were received. 


Lines Normal to a Cubic at Two Points 


E 3334 [1989, 524]. Proposed by Stanley Rabinowitz, Westford MA. 


Consider the cubic curve y = x? + ax* + bx +c, where a, b,c are real numbers 
with a? — 3b > 373. Prove that there are exactly two lines that are perpendicular 
(normal) to the cubic at two points of intersection and that these two lines 
intersect at the point of inflection of the curve. 


Solution by Luiz Felipe Martins, Brown University, Providence, RI. We may 
assume a = c = Q; the general case is reduced to this by considering a coordinate 
translation that places the origin at the point of inflection of the cubic. The slope 
of the line tangent to the cubic at the inflection point is b — a*/3, so the quantity 
a* — 3b = —3(b — a’/3) is invariant under translation. 

Hence we assume the curve is y = f(x) =x°* + bx with b < — y3. Let P= 
(p, f(p)) and Q = (qa, f(q)) be points on the curve such that PQ is normal to the 
curve at P and Q. The tangents at P and Q must be parallel, so f'(p) = f'(q). 
Since f’ is an even quadratic function, this implies g = —p. Since f is odd, this 
implies that PQ contains the origin, so all such “double normals” intersect there. 

To determine how many such PQ there are, we may assume p > 0. With 
q = —p, the slope of PQ is f(p)/p, so PQ is normal to the curve at p and q if 
and only if f’(p) = —p/f(p), or 3p* + 4bp* + b? + 1 = 0. The solutions of this 
quadratic in p? are p? = (—2b + Vb? — 3)/3. With b < — 73, both solutions 
are real and positive, and the positive square roots of these are the desired values 
of p. These yield exactly two distinct solutions PQ. 


Solution I by Michael Vowe, Therwil, Switzerland. As above, we reduce the 
problem to f(x) = xi +bhx,b< — V3. Let y = mx +d be a line normal to the 
curve at (p, f(p)) and (q, f(q)). As above, f’(p) = f(g), which with normality 
implies 3p* + b = —1/m = 3q* + b. This yields g = —p, f(q) = —f(p), which 
implies m = f(p)/p = p* + b, d=0. Hence —1/m = f'(p) = 3p? +b = 3m — 
2b. The solutions to this quadratic in m are m = (b + Vb* — 3)/3, both negative 
for b < — y3. Since d = 0, the two lines intersect at (0,0), the point of inflection. 


Solved by 35 readers and the proposer. One incomplete solution was received. 


A Dense Set Determined by a Polynomial 


E 3336 [1989, 525]. Proposed by Nick MacKinnon, Winchester, England. 


Suppose P is a given polynomial with integer coefficients and degree greater 
than one. Let 


W = {w:w is an irrational real number but P(w) is rational}. 


Prove that W is dense on the real line. 
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Solution by Stewart A. Levin, Mobil Research and Development, Dallas, TX. We 
exhibit a set S of rational numbers that is dense and disjoint from P(Q). Since P is 
continuously differentiable and has a finite number of isolated extrema, the real 
line can be decomposed into a finite number of intervals on each of which P is 
monotonic. By considering each such interval separately, it is easy to see that the 
set of irrational w having P(w) in such a set is dense on the real line. With 
PWw) = Liao jw, let S be the set of rational numbers p/q such that q is a prime 
not dividing p or a,. If p/q = P(w) for p/q € S and w € Q, then w = r/(sq) for 
some integers r,s with gcd(q, r) = 1. For if z is a rational number with denomina- 
tor relatively prime to qg, then the denominator of the rational number P(z) is also 
relatively prime to g. However, P(r/(sq)) = L7_oa,r/(sq)" 4 /(sq)". Since q di- 
vides all the terms in the numerator except a,r”, the numerator is relatively prime 
to q. Hence the denominator of the reduced fraction equal to P(r/(sq)) contains a 
factor of g” with n > 1, contradicting P(w) = p/q. 


Editorial comment. The proposer and several other solvers used the Eisenstein 
Irreducibility Criterion to prove that there is a dense set of rational numbers a 
such that P(x) — a is irreducible over Q. This implies that there is a dense set of 
algebraic numbers w of exact degree n such that P(w) is rational. 


Solved also by 26 other readers and the proposer. 


ADVANCED PROBLEMS 


6640. Proposed by Douglas Bowman (student), UCLA. 


Let f(n) be the number of partitions of the positive integer n into parts taken 
from the set of divisors of n. Prove that 


{1 + oc) {2 -- t}Iog n < log f(n) < {1+ o(1)} ae log n, 


where 7(7) is the number of divisors of 7. 


6641. Proposed by Theodore M. Alper, Stanford University, Stanford, CA. 


(a) For every positive integer n, is there an integer N, such that every finite 
group of order at least N,, has an abelian subgroup of order at least n? 

(b) For every positive integer n, does every infinite group have an abelian 
subgroup (possibly infinite) of order at least n? 


6642. Proposed by the editors. 


Let A be the maximum possible inradius of an arbitrary triangle lying in the 
closed set bounded by a regular pentagon of side-length one. 

(a)* Determine A up to an error of at most 107°. 

(b)* Determine A exactly. 
Cf. 6477 [1984, 588; 1989, 945] and 6478 [1984, 588; 1990, just below]. 
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SOLUTIONS OF ADVANCED PROBLEMS 
Inradii and Width 


6478 [1984, 588]. Proposed by Louis Funar, University of Craiova, Rumania. 


Let r be the radius of the incircle of an arbitrary triangle lying in a closed 
convex figure F of width w, and let R be the radius of the incircle of F. Are the 
following inequalities valid? 

(i)* 7 < (supr)/w < 5, 

(ii)* +< (supr)/R < 1. 


Solution (partial) by C. A. Rogers, University College, London. Three of the 
inequalities are easy; the remaining one is, we believe, invalid. 

Since the width of the incircle of the triangle is 27, we clearly have 2r < w. 

Since the incircle of the triangle is contained in F, we clearly have r < R. 

Since the incircle of F contains an equilateral triangle with incircle of radius 
5R, we clearly have (sup r)/R > 3. 

The remaining inequality (sup r)/w > ; appears to be incorrect. To see this 
consider the special case when F is a regular pentagon ABCDE. We work on the 
assumption, which seems highly plausible, that the triangles, for which the largest 
inradius is attained, are isosocles triangles inscribed in the pentagon, the vertex 
where the equal sides meet being at one of the vertices, say A, of the pentagon. 
Let the second and third vertices of the triangle be F on CB and G on DE, the 
ratios CF/CB and DG/DE having a common value ¢, with 0 <t < 1. Simple 
numerical calculations show that the maximum value for r/w is attained for ¢ near 
0.27 and is less than 0.2441. Thus, provided our assumption is justified, we do not 
have (sup r)/w > = when F is a regular pentagon. 


Editorial comment. Although the inequality (sup r)/w > 7 seems to be incor- 
rect, it is true that (sup r)/w has a positive lower bound independent of F. For by 
Theorem 50 of H. G. Eggleston, Convexity, Cambridge Tract No. 47, 1958, we have 
R > w/3; since sup r > R/2, it follows that (sup r)/w > <. (This weaker inequal- 
ity was noted by the proposer.) 

To complete Rogers’ argument that the inequality (sup r)/w > 7 is not always 
true, one needs a rather precise determination of the maximal possible inradius of 
a triangle lying in the closed set bounded by a regular pentagon of width one, or at 
least a proof that this maximum is strictly less than 7. Accordingly we have posed 
this question as Advanced Problem 6642 above. (Since 6642 is stated in terms of 
side-length, we note that for a regular pentagon 


width 1 + cos(7/5) 5+ 75 
side-length  2cos(7/5) 4/419 — 2/5 


It remains to find the greatest lower bound of (sup r)/w as F ranges over all 
convex figures of width one and to determine whether or not the convex set 
bounded by a regular pentagon is the extreme case. We hope that some of our 
readers will consider this question as well. 


= 1.5388418. 


The only solution received before the customary deadline was by L. Kuipers (Switzerland), who gave 
a proof of the inequalities of the problem for the special case when F is a parallelogram. The solution 
given above was prepared by Professor Rogers at the request of the editors. 
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The Fejér-Jackson Sum and Legendre Polynomials 
6585 [1988, 880]. Proposed by Alexandru Lupas, Sibiu, Romania. 
Prove that if —1 <x < 1, then 
sin(k arc cos x) vl-—x » 1-Piy) dy 


x k 2 I, l-y yx-y’ 


where 


P(x) = 


1 d \" > 4)" 
2”"n! a | (3 
is the nth Legendre polynomial. 


Solution by the Western Maryland College Problems Group, Westminster, Mary- 
land. We will use induction. The case n = 1 is easily checked. If the assertion is 
true up to m — 1, the case for n reduces to showing that 


(1 + ry? fn P,ae) ~ P(t))(1 —t) "(x —1) 7? dt 


=(1 ~ x2) 17 sin(n arccos x), (1) 


i.e., to Showing that L,_ (x) = U,_,(x), where L,,_ (x) is the left-hand side of (1) 
and U,,_,(x) = sin(w arccos x)/sin(arccos x) is the Chebyshev polynomial of the 
second kind. We require the Christoffel-Darboux formula for Legendre poly- 
nomials 


n(Py (8) ~ DMA 91 = Yk + PACH) (2) 


(This formula (2) may be derived by summation from the familiar recurrence 
formula 


x(2k + 1)P,(x) = (kK + 1)P,, (x) + kP,_1(x).) 


If we make the substitution tf = x — (1 + x)s* and use (2), then L,_ (x) becomes 
n—-l 
1 
L, (x) = ¥, (2k + 1) f P,(x — (1 + x)s?) ds. (3) 
k=0 0 
Now from the classical generating function 
. k+1/2 — 41/241 _. 2\—1/2 
Y) P(t)t ='/7(1-—2xt+t?) “, 
k=0 
we obtain by differentiation 
S. (2k + 1)P,(t)t* = (1 — #2) (1 — 2at + 0?) °°”. (4) 


k=0 


Multiplying (3) by ¢”~', summing from n=1 to ©, changing the order of 
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summation, and using (4), we obtain 


etn, x) = y (2k + 1) y t"~ [P, (x — (1 + x)s?) ds 


k=0 n=k+1 


fra —t)7! x (2k + 1)P,(x — (1 +x)s?)t* ds 


=0 


1 _ 
= (1+) f/{1 — 2xr + 1? + 2¢(1 + x)s9}°/ ds 
0 


= (1 —2xt+ 1’). 


by an elementary integration. Since 


Ye? -1U,_ (x) = (1 — 2st + 0?) 
n=0 


(1) follows. 


Editorial comment. Most of our solvers were rather cavalier about interchange 
of limit processes, but David Callan provided a proof somewhat similar to the 
above that justified such operations by uniform convergence and the Moore-Osgood 
theorem. Some solvers demonstrated a knowledge of Jacobi polynomials, while 
others relied on the formula 


SS (Tx) + Toy (2)) 
a x) + X)), 
lea ve—-t  (2n+1)vVl+x*” an 
where T(x) = cosbn arccos x) is the Chebyshev polynomial of the first kind (see, 


e.g., Abramowitz and Stegun, Handbook of Mathematical Functions, Formula 
22.13.10, p. 786). 

Since |P,(t)| < 1 for —1 <t < 1, this provides, as the proposer remarks, a proof 
of the Fejér-Jackson inequality 


” sin k@ 
» 


k=1 


> 0, O<0<7. 


Richard Askey informs us that despite about 20 previous proofs, this one appears 
new! For a proof of Turan’s inequality by a similar method, see 6517 [1986, 305; 
1988, 264]. 


Solved also by Jean Anglesio (France), James P. Butler, Paul F. Byrd, David Callan, Louis N. 
Howard, A. A. Jagers (The Netherlands), Haseo Ki (Korea), Allen R. Miller, I. A. Sakmar (Turkey), 
and the proposer. 
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Learning Discrete Mathematics With ISETL. By Nancy Baxter, Ed Dubinsky, and 
Gary Levin, Springer-Verlag, 1989, xvii + 416 pp. 


DONALD L. MUENCH 
Department of Mathematics and Computer Science, St. John Fisher College, 
Rochester, New York 14618 


Next to the blizzard of lectures, workshops, minicourses and publicity surround- 
ing calculus reform, there is a brisk breeze surrounding discrete mathematics. 
Starting with Anthony Ralston’s controversial article [6] on the role of discrete 
mathematics in the curriculum and its threat to the privileged status of calculus as 
the pearly gateway to mathematical exaltation, there have been as many ideas 
about discrete mathematics as there are mathematicians. But, if you examine the 
textbooks on the subject, there are more similarities than differences, although 
there are books that approach the subject in radically different ways. Fred 
Roberts’ Discrete Mathematical Models [7] is one significant example. And, the 
book under review, Learning Discrete Mathematics with ISETL, by Nancy Baxter, 
Ed Dubinsky, and Gary Levin, is yet another noteworthy exception. This book 
stands apart from the others in its unique approach, in its grounding in a 
distinctive theory of learning, and in its use of a not-so-familiar (but steadily 
becoming better known) computer language as a powerful medium of mathemati- 
cal expression. 

As soon as one hears or sees the words “computer” and “mathematics” in close 
proximity, the reaction is either enthusiasm or alarm. The alarming unease is often 
accompanied by a sense of sadness—as though something valuable has been lost 
or defiled—and a concern that some important topic in the curriculum will be 
omitted or diminished in order to make room for “the programming”. 

In contrast, the enthusiasm derives from the conviction that the use of the new 
technology should and will radically alter the present curriculum in the direction of 
encouraging the kinds of mathematical skill and thinking that have either been 
missing or could not have been done effectively before. 

However, the computer should not be the focus of the debate. The computer is 
not the central element in the discussion; how students learn is what we should be 
talking about. Using the computer can motivate, help, and reinforce student 
learning. The idea is to concretize the abstract concepts, to make them tangible 
and real to the student. Only then can our students make the mental images that 
breathe life into abstractions. 

Because learning is central, it is significant that the title of this book is not 
“Discrete Mathematics with ISETL”. The “Learning” in the title is a clue that this 
book is different and has a definite orientation toward the student. This is reason 
enough to read it and act on it with your (essentially) free copy of ISETL, which 
unfortunately does not come with the book. But, you may get ISETL and 
documentation from Ron Pullins of West Educational Publishing. 

The approach taken by the authors is the result of basic research done by the 
first two authors [1], [2] on how students learn. Their theory, heavily based on the 
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work of Piaget, is used to develop the topics in the book, carefully constructing 
computer activities which concretize concepts such as propositional calculus, 
multilevel quantification, induction, relations, functions, graphs, and combina- 
torics. Their findings show that students CAN and DO learn mathematics by 
writing small computer programs. Anyone adopting this book needs to be aware 
that it calls for the active participation of both the student and the teacher. One 
does not read this book without sitting down at a computer terminal and DOING 
something with the concepts that are being explained. And the beauty of it is that 
no coercion is necessary; the students become the most important players and the 
influence of the teacher recedes into the background. They DO the work and 
interact with one another with very little prompting from the professor. They 
WANT to do it, and they ENJOY doing it. Oh, yes, they do get frustrated from 
time to time, but they don’t give up. 

Creative approaches such as this may very well revolutionize the way we teach 
because it emphasizes the role of the student as the primary agent of learning. 
Seymour Papert [5] has described our traditional lecture method as one which 
makes the students “passive observers” and not “active participants.” What 
students often do in class is transcribe words from the instructor’s tongue onto 
paper. Then, when they study, they employ the method of pattern-matching, 
hoping to find examples in the book or their notes which resemble the problem 
they are working on. And, when they imitate us and their textbook, we reward 
them. And then we complain that they really haven’t learned to be independent 
thinkers. 

So how do we remedy this situation? Because the main goal of our teaching is to 
facilitate learning, our teaching should be informed by the results of investigations 
by scholars into the way students learn. This knowledge should shape our instruc- 
tional strategies and influence the way we present mathematics in our textbooks. 
With the proper materials, written from the point of view of how students learn, 
mathematics may be learned more effectively. And, if computers are in the 
classroom laboratory (someday “laboratory” and “classroom” will be synonymous! ), 
there will be little lecturing and more interactive learning and discussion. 

What is going on in the classroom when a computer is being used? Certainly not 
the copying of notes that the instructor has written on a blackboard! What is 
happening is active learning. By performing carefully designed computer activities, 
the students develop mental images which they use to construct an abstract 
mathematical concept. They concretize abstract ideas through computer represen- 
tations. Using a language like ISETL, students learn about properties of mathe- 
matical objects by writing “property-testers.” These are ISETL statements which 
look like the familiar descriptions and definitions in precise mathematical nota- 
tion. The computer is the powerful means to the end of learning and developing 
mathematical maturity and sophistication. It is not the end itself. 

What is ISETL? Before an answer can be given, it should be said that the 
language and the software should be chosen for their suitability in promoting 
student learning. If it is easy to learn and powerful enough to enable students to 
form the mental images we want them to construct, and encourages experimenta- 
tion and discovery, then it may be considered in a course. ISETL is a powerful 
high-level language whose syntax is close to standard mathematical notation. It is 
easy to learn, read, and write. Derived from SETL [8] and developed by Gary 
Levin [3], with substantial input from the other authors, as a tool for the teaching 
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and learning of mathematical concepts, ISETL (Interactive SET Language) has the 
expressive power to construct concrete realizations of mathematical concepts. It 
supports boolean expressions and operations (and, or, not, implication), heteroge- 
neous sets and tuples (sequences that have no fixed size), iteration over sets, tuples 
and strings, and both universal and existential quantification. ISETL also supports 
the usual control statements (while-do, for-do and if-then-else). There are no 
graphics with the ISETL package, but a new version now under development will 
include it. 

ISETL has no procedures, only functions. Functions (funcs in ISETL) are 
first-class objects—that is, they may be used just like any other data object. 
Consequently, they can be elements of sets and sequences and can be input 
parameters in a function and be returned by a function. This powerful feature 
provides incredible flexibility to express concretely not only the idea of a function 
itself, but other abstract concepts as well. A more extensive review of ISETL can 
be found in [4]. 

Probably the most effective chapter of all in the book is the chapter on 
functions. This one chapter may be the best explanation and development of the 
idea of a function that has appeared in any textbook. Considerable effort is taken 
to lead the student to a thorough understanding of the ways in which functions 
may be represented: as a process, as a set of ordered pairs, and (for those whose 
domains are positive integers) strings and sequences. The ISETL exercises ask 
students to convert from one mode of representation to another and ask them to 
formulate the meaning of function composition in these different modes, thus 
leading them to the realization that a function is an object. 

In describing a function as a set of ordered pairs, the authors say only that each 
element of the domain must appear exactly once as the first component of one of 
the pairs. The student is then asked to make the precise mathematical definition. 
It is this reviewer’s experience that the students discover on their own several 
equivalent formulations of the definition. Here are some actual definitions that 
students have given, without the benefit of looking in some textbook (This is 
ISETL code, where “f” is a set of ordered pairs and u(1) and v(1) refer to the first 
components in the pairs u and v): 


(1) forall x in domains (f)|forall y1, y2 in image (f)| 
[x, yl] in f and [x, y2] in f imp /y1 = y2; 


(2) forall u,v in flu/ = v imp uQ)/ = vQ); 

(3) forall u,v in flu() = vQ) imp lu = v; 

(4) {v():v in f\(exists u in f| (u/= uv and u(1) = v(1)))} = € }; 
(5) {vQ) = uQ): v,u in flu/ = u} = {false}. 


Some of these look familiar, but others look strange. In all cases, they “belong” 
to the students who wrote them. One of the strange ones, (5), is not quite correct. 
A very interesting and spirited classroom discussion about (5) followed! This kind 
of exercise and discussion probably never would have taken place in the typical 
“traditional” classroom. 

So, whither the computer revolution in mathematical instruction? It looks as 
though new and exciting materials will continue to evolve for teaching not only 
discrete mathematics, but also calculus, differential equations, linear algebra, and 
abstract algebra. But, be warned: the use of these materials may be hazardous to 
your former way of teaching and learning. With careful attention to the design of 
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appropriate activities in the classroom and in the laboratory, they just may help 
students learn, think, create, and grow intellectually. 
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Generatingfunctionology. By Herbert S. Wilf, Academic Press, San Diego, 1990. vili 
+ 184 pp. 


RICHARD P. STANLEY 
Department of Mathematics, M.I.T., Cambridge, MA 02139 


Let g(n) denote the number of graphs with vertex set {1,2,...,} (without 
loops or multiple edges) which are regular of degree two. Then 
—4y—ix? 


x" e 
Ly vl—-x 

Who cannot be fascinated by this strange mixture of combinatorics and analysis? 
The theory of generating functions seeks to develop general methods for proving 
such identities and for using these identities to obtain further information about 
the coefficients (here g(n)). All introductory texts in combinatorics have some 
rudimentary information about generating functions, but Herb Wilf’s Generating- 
functionology is the first book suitable for undergraduates to be devoted exclusively 
to this topic. It performs an admirable job of conveying the essential features of 
generating functions. 

Three types of generating functions are considered in Generatingfunctionology : 
(a) ordinary generating functions, (b) exponential generating functions, and (briefly) 
(c) Dirichlet series. These types have the respective forms 


x” 
Lf(n)x", Lf), L f(n)nn. 
n>0 n>0 ° n>1 
In all cases it is important to regard these series as formal objects, without regard 
to questions of convergence. If a generating function does converge in a suitable 
region of the complex plane, then techniques from complex variable theory can 
often be used to obtain further information about the coefficients. But even 
without convergence the generating function can provide valuable understanding 
of the coefficients. These issues are treated well in Generatingfunctionology. 
A basic question which faces anyone wishing to investigate a sequence by the 
use of generating functions is the following: Which generating function should be 


nh 
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used? Many general situations appropriate to one type of generating function are 
discussed in Generatingfunctionology. In particular, the use of exponential generat- 
ing functions in situations where one has a structure on a set (or “labelled 
structure”) built up from “connected” pieces is explained using a colorful termi- 
nology due to Adriano Garsia. For instance, the number f(7) of involutions 
(permutations whose square is the identity) in the symmetric group S,, is given by 


E p(n) =e? 

n=0 n} 

The exponent x + x*/2 arises because an involution is built up from 1-cycles 
(fixed points) and 2-cycles (transpositions). A result known as the Exponential 
Formula provides a general framework for understanding such results. 

A nice overview is given in Chapter 4 of the basic applications of generating 
functions. These include finding averages and standard deviations, proving combi- 
natorial identities and congruences, and proving results on unimodality, convexity, 
etc. The treatment of combinatorial identities is especially lucid and takes the 
reader to the frontiers of current research by providing an introduction to the 
WZ-method of Wilf and Zeilberger. As an example of a combinatorial identity 
discussed in Generatingfunctionology, consider the sum 


n-E ("Rk pr* (mao) 
og \ 2k 
If we form the generating function y = L,,.)f,x”, then by straightforward manip- 
ulations involving little real thought we obtain 


: 2 1 
~ 3(1 — 4x) r 3(1 —x)/ 


It follows that f, = (2*"+' + 1)/3. This “mechanical” procedure for evaluating 
combinatorial sums is analogous to the use of Laplace transforms to solve certain 
classes of differential equations. 

The final chapter of Generatingfunctionology is devoted to analytic and asymp- 
totic techniques, and will be accessible only to students who have had a course in 
complex variables. The three sections on asymptotics provide a brief but useful 
introduction to the connections between analyticity and asymptotics. There is also 
a short section on one of the most interesting and elegant formulas in enumerative 
combinatorics—the Lagrange Inversion Formula, which gives a formula for the 
coefficients of the generating function f(u(t)), where f is an arbitrary generating 
function, and where for a given generating function A(t), we have that u(t) is 
defined by the functional equation u = tdé(u). Such generating functions are 
common in enumerative combinatorics, and the Lagrange Inversion Formula has 
wide applicability, especially in situations dealing with trees and similar recursive 
structures. For instance, if r(m) denotes the number of labeled (rooted) trees on n 
vertices and we set 


y 


t” 
u(t) — » r(n)—, 
nol n!} 
then a simple use of the Exponential Formula yields u(t) = te“. Lagrange 
inversion then produces the miraculous formula r(n) = n”~*. (There are now 
many other known proofs of this formula, which goes back to Borchardt, Sylvester, 
and Cayley.) Similarly, if a(n) denotes the number of ways to divide a convex 
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polygon with 2” sides into m — 1 quadrilaterals by drawing n — 2 non-intersecting 
diagonals, then the generating function u(t) = L,,. a(n)t*"*! satisfies u = t/ 
(1 — u*), and a simple application of Lagrange inversion yields the equally miracu- 
lous result 


a(n) = | ( 3n 

2nt+1\ny} 
The treatment of Lagrange inversion in Generatingfunctionology is probably the 
weakest part of the book. While the proof Wilf provides is short and elegant, it 
uses the residue theorem and is only valid for suitably convergent power series. On 
the other hand, a proof due to Jacobi is equally short and elegant but is completely 
formal and uses no deep machinery. There are also purely combinatorial proofs 
due to Raney and to Joyal-Labelle which provide great insight into the combina- 
torics of trees, lattice paths, and formal languages. While the proof and applica- 
tions of Lagrange inversion in Generatingfunctionology are disappointing, this is a 

small quibble about an overall quite engaging book. 

What more can be done with generating functions beyond the subject matter of 
Generatingfunctionology? To whet the appetite of the reader for further study of 
generating functions, consider the following three enumerative functions: Let f(n) 
be the number of ways to write the positive integer n as an ordered sum of positive 
integers. Let g(n) be the number of ways to write an n-element set as an ordered 
union of pairwise disjoint nonempty subsets. Let h(n) be the number of ways to 
write the integer m > 1 as an ordered product of integers > 1. Define the generat- 
ing functions 


F(x) =1+ > f(n)x", G(x) =1+ 9 a(n) —, H(x) =1+ Yi A(njn’s. 


n>1 n>1 n>2 


Then we have 


1 
F = , G = , H , 
(*) a I O) = 5G O) = 5 f(s) 
1-x 
where 
1 x” 
7 = > x", e= —, f{(s)= Yin™. 
— * n>O n>o n>1 


Surely these formulas can’t be a coincidence! There is an obvious necessity to 
develop a unifying theory which will “explain” the various types of generating 
functions and the situations to which they can be applied. Indeed, there are several 
such theories, based on such concepts as prefabs, incidence algebras, dissects, 
linked sets, and species. The various unifying theories of generating functions 
comprise one of many topics for study after completion of Generatingfunctionology. 
There are also a vast number of other uses and applications of generating 
functions, involving, for instance, the transfer-matrix method, elliptic functions, 
symmetric functions, Hilbert functions, convex polytopes, partially ordered sets, 
formal languages, lattice paths,... . Enjoy! 
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General, S*, L*. Fifty Per Cent Proof: An An- 
thology of Mathematics Humour. Keith Selkirk, 
William Wynne Willson. MAA, 1989, 87 pp, (P). 
[ISBN: 0-7487-0166-4] Chock-full of humorous sto- 
ries, quotes, cartoons, misstatements, and jokes. 
Good evidence of something most of us (all of us?) 
already knew—mathematics can be very funny, es- 
pecially when it isn’t trying to. MPR 


General, S, P, L. Bias in Quantifying Judgments. 
E.C. Poulton. Lawrence Erlbaum Associates, 1989, 
xxi + 304 pp, $49.95. [ISBN: 0-86377-105-X] Inter- 
esting but technical study of the various types of bias 
that can result when people make quantitative judg- 
ments. Includes both theory and practical illustra- 
tions, together with ways to avoid particular biases. 
Good set of references. RSK 


General, S**, L*. Paradozes of Gambling Be- 
haviour. Willem Albert Wagenaar. Essays in Cog- 
nitive Psychology. Lawrence Erlbaum Associates, 
1988, vii + 126 pp, $24.95. [ISBN: 0-86377-080- 
0] Psychological and statistical investigation of why 
people gamble, especially in the face of systematic 
losses. Based on extensive research with roulette, 
blackjack, and lottery players. One set of experi- 
ments clearly indicates that gamblers view chance 
and luck as different concepts (in roulette, e.g., 
chance determines the outcome of the wheel, but luck 
influences the player’s choice of a number). Also in- 
cludes a chapter on the legal problem of distinguish- 
ing between games of chance and games of skill. RSK 


General, L**, Envisioning Information. Edward 
R. Tufte. Graphics Pr, 1990, 126 pp, $48. A mag- 
nificant sequel to Tufte’s classic The Visual Dis- 
play of Quantitative Information (TR, March 1984), 
this volume features maps, time tables, diagrams, 
charts—even computer screens—in order to illustrate 
wise principles of information design: “self-effacing 
displays intensely committed to rich data.” A beau- 
tiful book, printed in six colors, whose own page lay- 
out is itself a pleasure to behold. LAS 


General, S, L. Figuring: The Joy of Numbers. 
Shakuntala Devi. Penguin Books, 1990, 157 pp, 


£3.99 (P). [ISBN: 0-14-011850-0] Number curiosi- 
ties and insider tricks for doing mental calculations 
with the four basic operations of addition, substrac- 
tion, multiplication, and division. LCL 


General, P*, L*. Using Writing To Teach 
Mathematics. Ed: Andrew Sterrett. MAA Notes 
No. 16. MAA, 1990, xvii + 139 pp, $15 (P). 
[ISBN: 0-88385-066-4] An important compendium 
of thirty-one diverse experiences in “writing to learn” 
mathematics—an overdue idea whose time has finally 
come. Authors report on very different approaches 
(from writing proofs to writing mathematical autobi- 
ographies; from keeping personal journals to prepar- 
ing formal papers), operating at various curricular 
levels. “I need to explain this to myself,” wrote one 
student, “so I can understand it.” LAS 


General, L. Guidelines for the Use of Calculators 
in Competitions. M. Kathleen Heid, et al. NCTM, 
1989, v + 24 pp, $9.60 (P). [ISBN: 0-87353-286-4] 
Practical suggestions on such issues as deciding how 
and when calculators might be used, and on writing 
appropriate questions (including examples). Essen- 
tial reading for all mathematics educators. LCL 


Elementary, T(13). Introductory Geometry: A 
Brief Course with Reasoning Skills. Alan Wise. Har- 
court Brace Jovanovich, 1990, viii + 132 pp, $6 (P). 
{ISBN: 0-15-546508-2] A guided workbook covering 
basic geometric concepts for college developmental 
mathematics courses. JNC 


Precalculus, T*(13: 1). Trigonometry. Charles 
W. Gantner, Thomas E. Gantner. PWS-Kent, 1990, 
xii + 388 pp, $29.50. (ISBN: 0-534-92158-2] A 
review of geometrical concepts precedes coverage 
of traditional topics in right triangle and analytic 
trigonometry. Includes a nice historical introduction 
and interspersed historical notes, Basic programs 
for numerical computations and graphing, and com- 
ments about calculator usage. JNC 


Finite Mathematics, T(13-14: 1), L. Finite 
Mathematics. Lawrence E. Spence, Charles Vanden 
Eynden, Daniel Gallin. Scott Foresman, 1990, xvi + 
591 pp, $34. [ISBN: 0-673-38582-5] A straightfor- 


867 


868 


ward presentation of the conventional material. Top- 
ics include linear programming, finance, probability 
and statistics, Markov chains, game theory. Good 
selection of examples and exercises. Index of appli- 
cations. JS 


Finite Mathematics, T(13: 2). Applied Math- 
ematics for the Management, Life, and Social Sci- 
ences. Lawrence E. Spence, Charles Vanden Eyn- 
den, Daniel Gallin. Scott Foresman, 1990, xvi + 
1049 pp, $40. [ISBN: 0-673-18837-X] Intuitive but 
honest. Stresses concepts over techniques; wide vari- 
ety of applications. Algebra review throughout; ex- 
ponential and logarithmic functions interspersed in 
several chapters to provide early introduction. Cal- 
culator access assumed, but not essential. Numerous 
exercises; practice problems with complete solutions 
at book’s end. Appealing appearance. Supplements 
and aids are available. JDEK 


Education, P. From Gatekeeper to Gateway: 
Transforming Testing in America. National Com- 
mission on Testing and Public Policy (Boston Col- 
lege, Chestnut Hill, MA), 1990, xiii + 59 pp, (P). 
An important and very timely polemic against the 
overuse, misuse, and abuse of multiple choice tests. 
Recommendations stress new objectives for testing— 
“to promote the development of all human talent” 
and to ensure greater public accountability. LAS 


Education, P, L. Soviet Studies in Mathematics 
Education. Transl: Joan Teller. NCTM, 1990, $25 
each (P), [ISBN: 0-87353-289-9] set. Volume 1: Sets 
of Mathematics Teaching Aids, M. Ya. Antonovskii, 
et al., xix + 377 pp; Volume 2: Types of Generaliza- 
tion in Instruction, V.V. Davydov, xviii + 438 pp. 
First two volumes in a series of translations of Soviet 
literature on research in the psychology of mathemat- 
ics instruction, especially on the dynamics of mental 
activity and principles of the learning process. Vol- 
ume 1, originally published in 1971, provides a theo- 
retical foundation for the use of instructional media 
in elementary mathematics lessons. Volume 2, orig- 
inally published in 1972, conveys—in the context of 
a dialectical materialist theory of cognition—an in- 
novative view of concept formation based on coun- 
terpoint between the “abstract” and the “concrete.” 


LAS 


Education, P*, L*. Making the Grade in Math- 
ematics: Elementary School Mathematics in the 
United States, Taiwan, and Japan. Harold W. 
Stevenson, et al. NCTM, 1990, iv + 34 pp, $9.50 
(P). [ISBN: 0-87353-292-9] A concise summary in 
plain language of the authors’ comparative research 
from three countries (Taiwan, Japan, U.S.) of chil- 
dren’s mathematics achievement, their attitudes 
about the role of work vs. ability, and their moth- 
ers’ role and attitudes concerning mathematics edu- 
cation. Dispels several widespread myths (e.g., that 
Japanese and Chinese success in mathematics is due 
to rote learning, superior intelligence, or unrelenting 
parental pressure), and documents several key dif- 
ferences (e.g., educational priority of mathematics in 
school, support by parents at home). LAS 
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Education, P*, L**. On the Shoulders of Giants: 
New Approaches to Numeracy. Ed: Lynn Arthur 
Steen. National Academy Pr, 1990, vii + 232 pp, 
$17.95. [ISBN: 0-309-04234-8] A collection of five 
potential “strands” for school mathematics that il- 
lustrate how “mathematical ideas can be developed 
from informal childhood exploration through formal 
school and college study.” Authors Thomas Ban- 
choff, James Fey, David Moore, Marjorie Senechal, 
and Ian Stewart illustrate through five essays (on di- 
mension, quantity, uncertainty, shape, and change) 
the potential of deep and powerful mathematics to 
stimulate rich and inviting school curricula. 


Education, P. Mathematics Contests: A Guide 
for Involving Students and Schools. Frederick O. 
Flener. NCTM, 1990, iv + 115 pp, $9.95 (P). 
[ISBN: 0-87353-282-1] Some rationale for mathe- 
matics competitions, some hints for coaches, descrip- 
tions of the various forms that competitions take 
(timed individual or team events, relays, oral events, 
power questions ...), and a nice sampling of ques- 
tions appropriate to these various forms. AWR 


Education, P, L*. A Sourcebook for College Math- 
ematics Teaching. Ed: Alan H. Schoenfeld. MAA, 
1990, ii + 68 pp, $10 (P). [ISBN: 0-88385-068-0] 
A timely collection of pointers to improve under- 
graduate mathematics instruction: goals, curriculum 
guides, resources, advising, instructional techniques, 
use of computers, and evaluation. Individual chap- 
ters are filled with references to research findings and 
other sources of supplementary information. “Our 
goal is to inform. This is the year of national dia- 
logue, and informed dialogue is the best kind.” LAS 


History, S, P*, L**. Norbert Wiener, 1894- 
1964. P.R. Masani. Birkhauser Verlag, 1990, 416 
pp, $63. [ISBN: 0-8176-2246-2] A fascinating, thor- 
oughly documented reflection on the intellectual and 
personal character of Norbert Wiener—a “universal 
thinker of colossal proportion.” Biography; math- 
ematics (Brownian motion, harmonic analysis, sta- 
tistical physics, cybernetics); medicine (physiology, 
prosthetics, molecular biology); defense (WWI, anti- 
aircraft fire control, war games); religion (sin and 
evil, myth, catharsis, redemption); literature and art; 
and personal prejudices and blind spots. Includes ex- 
tensive quotations from correspondence with many 
Wiener contemporaries (e.g., John von Neumann, 
T.S. Eliot, Walter Reuther), and many useful ref- 
erence lists (vita, Ph.D. students, Wiener bibliogra- 
phy, references, index to over 500 names, and sub- 
ject index). Presents Wiener’s ideas in relief against 
the backdrop of his peers—other great thinkers both 
present and past. LAS 


Logic, S(16-18), P, L. An Introduction to De- 
fault Logic. Philippe Besnard. Symbolic Computat. 
Springer-Verlag, 1989, xi + 208 pp, $39.50. (ISBN: 
0-387-51566-6] Emphasis on formal development of 
default logic with a subsidiary theme of artificial in- 
telligence applications. Begins with the need for de- 
fault reasoning followed by an overview of formal 
logic as a knowledge representation scheme. Followed 
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by asummary of first order logic with nonmonotonic 
extensions to it. Chapters on default logic from the 
core of the text. Includes a chapter on circumscrip- 
tion, and a final chapter on alternate logic formaliza- 
tions of nonmonotonic reasoning. References. RJA 


Logic, P. Lecture Notes in Mathematics-1388: 
Mathematical Logic and Applications. Eds: J. Shin- 
oda, T.A. Slaman, T. Tugué. Springer-Verlag, 1989, 
222 pp, $20 (P). [ISBN: 0-387-51527-5] Proceedings 
of the 87th Meeting on Mathematical Logic and its 
Applications held in Kyoto, Japan during August 3- 
6, 1987. List of meeting participants. RJA 

Logic, T(14-18: 1), S. Systems of Logic. Nor- 
man M. Martin. Cambridge University Pr, 1989, xi 
+ 320 pp, $44.50; $15.95 (P). [ISBN: 0-521-36589- 
9; 0-521-36770-0] Assumes reader has had an in- 
troduction to logic. Studies and compares from a 
primarily syntactic point of view the characteristics 
of different logical systems: structural elements of a 
system, derivation and proof, interpretation and sat- 
isfaction, entailment and equivalence. Chapter ex- 
ercises. Appendix of historical and bibliographical 
remarks. RJA 


Logic, P. Accessible Categories: The Foundations of 
Categorical Model Theory. Michael Makkai, Robert 
Paré. Contemp. Math., V. 104. AMS, 1989, viii + 
176 pp, $31 (P). [ISBN: 0-8218-5111-X] Introduction 
to categorical model theory defined by the authors as 
“the study of set-valued models of possibly infinitary 
first order theories by means of the conceptual tools 
of category theory.” KES 


Foundations, T(13-14: 1), L. A Transition to Ad- 
vanced Mathematics, Third Edition. Douglas Smith, 
Maurice Eggen, Richard St. Andre. Brooks/Cole, 
1990, xiii + 285 pp, $38. [ISBN: 0-534-12234-5] Ex- 
tensive reworkings in Chapter 1 and 5; new optional 
section in Chapter 1 expands on examples of proof 
techniques; section on graphs in Chapter 3 rewritten 
(First Edition, TR, April 1984; Second Edition, TR, 
March 1987). LC 


Foundations, T(16-18: 1), L. Constructivism in 
Mathematics, An Introduction, Volume 1. AAS. 
Troelstra, D. van Dalen. Stud. in Logic & Found. 
of Math.,V. 121. North-Holland (US Distr: Elsevier 
Science), 1988, xx + 356 pp, $33 (P). [ISBN: 0-444- 
70506-6] First volume of two-volume text. Cov- 
ers logical formalism for intuitionism, intuitionistic 
arithmetic, constructive treatment of real numbers, 
examples of constructive analysis. Assumes familiar- 
ity with classical predicate logic. KES 


Graph Theory, T(12-14), S*, L**. Graphs and 
Their Uses. Oystein Ore. Revised by Robin J. Wil- 
son. New Math. Lib., V. 34. MAA, 1990, vill + 
153 pp, $14.50 (P). [ISBN: 0-88385-600-X] A classic 
reborn, with updated terminology, modern notation, 
and new material on interval graphs, travelling sales- 
man problem, shortest route problems, and map- 
coloring. A brief contemporary bibliography points 
readers to sources of further information. A mar- 
velous resource for math clubs, freshman seminars, 
and independent study. (Original edition is NML V. 
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10, TR, June-July 1976.) LAS 


Combinatorics, T(16-17: 1), L. Generating Func- 
tionology. Herbert S. Wilf. Academic Pr, 1990, viii 
+ 184 pp, $29.95. [ISBN: 0-12-751955-6] Introduc- 
tion to generating functions and their applications in 
enumeration. For example, the generating function 
of a sequence can be used to find an exact formula, 
averages, recurrence relations, and prove facts about 
unimodality or convexity. In addition to the com- 
binatorial aspects of generating functions, this text 
discusses asymptotic methods. LC 


Combinatorics, P. Tools and Fundamental Con- 
structions of Combinatorial Mathematics. Heinz 
Lineburg. Bibliographisches Institut, 1989, 525 pp. 
[ISBN: 3-411-03194-8] The author’s underlying be- 
lief is that induction and recursion are the most im- 
portant tools of combinatorial mathematics. Thus 
he begins with Dedekind triples and studies them in 
depth, using them for instance to define finite sets. 
Topics covered include Gray codes, the symmetric 
group, the marriage theorem, independence struc- 
tures, and partitions of finite sets. LC 


Discrete Mathematics, T(13-14: 1, 2), L. Dis- 
crete Mathematics with Applications. Susanna S. 
Epp. Wadsworth, 1990, xx + 882 pp, $51.50. [ISBN: 
0-534-09630-1] Similar in outline to most of the 
new texts, the treatment here is distinguished by 
a leisurely approach and attention to detail. There 
are two chapters on logic, two on methods of proof, 
and one on sets before counting is first encountered 
in Chapter 6 on page 315. Further chapters on 
functions, recursion, algorithms, and relations before 
graphs are introduced in Chapter 11, page 677. Copi- 
ous examples and exercises with solutions and hints; 
index. JS 


Number Theory, T*(18), S*, P**. Class Field 
Theory. Emil Artin, John Tate. Adv. Book Classics 
Ser. Addison-Wesley, 1990, xxxviii + 259 pp, $29.95. 
[ISBN: 0-201-51011-1] A reprint of the 1967 publi- 
cation of Chapters 5-12 (global theory) of the notes 
from the Princeton seminar on Class Field Theory. 
Unfortunately it is a photoreproduction of the orig- 
inal typewritten text (with many handwritten sym- 
bols), but it is still a classic, typeset or not. “Special 
Preface” includes a list of eight related books pub- 
lished since the original printing. SB 


Number Theory, P. Automorphic Forms and Ge- 
ometry of Arithmetic Varieties. Eds: K. Hashimoto, 
Y. Namikawa. Adv. Stud. in Pure Math., V. 15. 
Academic Pr, 1989, xvi + 523 pp, $79.50. [ISBN: 
0-12-330580-2] A collection of papers by Japanese 
researchers on dimension formulas for spaces of auto- 
morphic forms. The articles largely overlap a series 
of symposia held during 1985 and 1986. GG 


Linear Algebra, T(14: 1). Elementary Linear Al- 
gebra. Harvey Gerber. Brooks/Cole, 1990, xiii + 498 
pp, $42. [ISBN: 0-534-11574-8] A straightforward 
introductory text which includes historical notes, ap- 
plications, and challenging supplementary exercises. 
Chapters cover systems of linear equations; matrices; 
determinants; R?,R° and R®; algebra of R”; vec- 
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tor spaces; linear transformations; eigenvalues; linear 
programming. JNC 


Linear Algebra, T*(14: 1). Linear Algebra, 
Third Edition. Michael O’Nan, Herbert Ender- 
ton. Harcourt Brace Jovanovich, 1990, ix + 461 
pp, $35. [ISBN: 0-15-551008-8] Emphasis on geo- 
metrical point-of-view with gradual increase in level 
of abstraction and rigor. Computational algorithms 
feature Gaussian elimination and reduction to row- 
echelon form. In black and white, slowly paced, hon- 
est, and correct, from the concrete to the abstract. 
Few frills. Big plusses are the non-threatening writ- 
ing style and the anticipation of customary hur- 
dles. Numerous problems both routine and text- 
extending, with answers to many including evens. 
Solutions manual available. A textbook for tose 
not in a hurry and without a need for excessive ab- 
straction. (First Edition, TR, May 1971; Extended 
Review, June-July 1973; Second Edition, TR, June- 
July 1977.) JDEK 


Linear Algebra, T(13-14). Linear Algebra: 
An Introduction with Concurrent Ezamples. A.G. 
Hamilton. Cambridge University Pr, 1989, x + 
328 pp, $22.95 (P); $69.50. [ISBN: 0-521-31042- 
3; 0-521-32517-X] The first 40% of the book cov- 
ers linear equations and vector geometry in two- and 
three-dimensions. The remainder is devoted to the 
concepts of linear algebra (basis, dimension, linear 
maps, eigen-theory, diagonalization, orthogonality, 
and quadratic forms) in R®. An interesting feature 
is the placing of examples and exercises on the left- 
hand pages and the text on the right-hand pages. SG 


Linear Algebra, T*(14-15: 1). Linear Algebra, 
Second Edition. John B. Fraleigh, Raymond A. 
Beauregard. Addison-Wesley, 1990, xiii + 571 pp, 
$35.16. [ISBN: 0-201-11949-8] Changes from First 
Edition (TR, April 1988) include expanded exercise 
sets (including ten-part True-False sections), exam- 
ples from calculus, a chapter devoted to complex 
scalars, gradual presentation of linear transforma- 
tions (T(x) = Ax in section 1.5, general transfor- 
mations in 2.9), orthogonality chapter reorganized. 
A unique feature of this text is that eigenvalues, 
eigenvectors, and their applications are introduced 
in Chapter 4 (and recur in Chapters 5 and 8), not 
relegated to the last topic of the course. Although 
not intended to be overly rigorous, the text and ex- 
ercises contain their fair share of proofs. SB 


Calculus, T(13: 2). Calculus for Management and 
the Life and Social Sciences, Second Edition. Don- 
ald L. Stancl, Mildred L. Stancl. Richard D Irwin, 
1990, xviii + 892 pp, $43.95. [ISBN: 0-256-08251-0] 
No specific revisions from First Edition (TR, Decem- 
ber 1988) are cited. Includes chapters on probability 
and differential equations, in addition to standard 
calculus topics. No trigonometric functions appear 
until page 700. Chapters 2-12 end with supplements 
containing enrichment material. Supplements to the 
text include solution manuals, computer manual and 


disks, and test bank. SB 
Calculus, T(13: 2-3). Calculus with Analytic Ge- 
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ometry, Fourth Edition. Robert Ellis, Denny Gulick. 
Harcourt Brace Jovanovich, 1990, xvii + 1125 pp, 
$44. (ISBN: 0-15-505687-5] A traditional approach 
with a few minor changes including the use of tangent 
lines and velocity to motivate limits, a new section 
on integration using tables, and a new chapter on 
complex numbers and functions. (First Edition, TR, 
August-September 1978; Third Edition, TR, October 
1986; Alternate Edition, June-July 1988.) JNC 


Calculus, T(13: 3). Calculus: One Variable, Sizth 
Edition. S.L. Salas, Einar Hille. Wiley, 1990, xiii + 
777 pp, $46.95. [ISBN: 0-471-61626-5] Main differ- 
ence between this and previous editions is the in- 
creased presence of applications scattered through- 
out the text. (First Edition, TR, June-July 1971; 
Second Edition, TR, October 1974; Third Edi- 
tion, TR, June-July 1978 and December 1978; Fourth 
Edition, TR, June-July 1982; Fifth Edition, TR, 
November 1986.) KS 

Calculus, T*(13: 2-3). Calculus with Analytic Ge- 
ometry, Third Edition. John B. Fraleigh. Addison- 
Wesley, 1990, xv + 1032 pp, $45.56. (ISBN: 0- 
201-50363-8] This edition includes a graphical ap- 
proach, historical notes, a streamlined treatment of 
limits, earlier treatment of trigonometric, exponen- 
tial, and logarithmic functions, coverage of Taylor’s 
theorem before infinite series, and calculator and 
computer exercises with references to Ezploring Cal- 
culus, an IBM compatible software package. (First 
Edition, TR, August-September 1980; Second Edi- 
tion, TR, January 1986.) JNC 


Calculus, T(13: 1, 2), S. Applied Calculus. John 
C. Hegarty. Wiley, 1990, xv + 608 pp, $42.95. 
[ISBN: 0-471-61240-5] An informal, intuitive devel- 
opment of the rudiments of differential and integral 
calculus; applications, which are many and brief, lean 
heavily to economics. Logarithmic and exponential 
functions are introduced late; trigonometric func- 
tions not at all. Computation, not rigor, is stressed: 
e.g., the fundamental theorem appears, but not the 
mean value theorem. Computer and numerical meth- 
ods are not emphasized. Most sections have large 
exercise sets, with many word problems. PZ 


Real Analysis, T?(14). Introduction to Real Anal- 
ysis. Robert L. Brabenec. PWS-Kent, 1990, xiv + 
294 pp, $35. [ISBN: 0-534-92162-0] Essentially a 
standard first course text in real analysis. The au- 
thor writes in a “user-friendly” style which perhaps 
makes the material more accessible to many students. 
However, the author is too casual in many ways, of- 
ten to the point of error. For example: the first two 
instances of sequences are incorrectly used. MPR 


Real Analysis, P. The Ergodic Theory of Discrete 
Groups. Peter J. Nicholls. London Math. Soc. Lect. 
Note Ser., V. 143. Cambridge University Pr, 1989, xi 
+ 221 pp, $34.50 (P). [ISBN: 0-521-37674-2] A pre- 
sentation of the classical work in this area. Also an 
introduction to the recent theory of measures on the 
limit set of a discrete group. Work of S.J. Patterson, 
D. Sullivan, and others. MLR 


Complex Analysis, (17-18), P. Elements of the 
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Theory of Elliptic Functions. N.I. Akhiezer. Transl. 
of Math. Mono., V. 79. AMS, 1990, vii + 237 pp, 
$93. [ISBN: 0-8218-4532-2] A translation of the 
1970 Russian edition. This book, intended for engi- 
neers, presents the basics of the classical theory of el- 
liptic functions: modular, Weierstrass, theta, and Ja- 
cobi functions; transformations of elliptic functions, 
and elliptic integrals. Includes tables of elliptic inte- 
grals. No exercises. SG 


Complex Analysis, P. Convolutional Calculus. 
Ivan H. Dimovski. Math. & Its Applic., V. 43. 
Kluwer Academic, 1990, xi + 184 pp, $79. [ISBN: 0- 
7923-0623-6] Starting with Mikusirski’s approach 
to Heaviside’s operational calculus, the convolution 
is used as a means of unifying and extending these 
methods to first and second order differential opera- 
tors. This is a book by an author who has been a key 
contributor to the new ideas here expounded. AWR 


Complex Analysis, T(17-18:1). Boundary Value 
Problems. F.D. Gakhov. ‘Transl: I.N. Sneddon. 
Dover, 1990, xix + 561 pp, $12.95 (P). [ISBN: 0-486- 
66275-6] Written on the basis of lectures given in 
1947 so that “clarity and simplicity of style ...has 
taken preference over mathematical rigour,” this 
book follows the spirit of historical development. A 
chapter on the Cauchy integral sets the stage for the 
Riemann Boundary Value Problem (Chapter II) and 
Singular Integral Equations (Chapter III), followed 
by the Hilbert Boundary Value Problem and Singu- 
lar Integral Equations (Chapter IV). There is a final 
chapter on Generalized Boundary Value Problems. 
(1966 Addison-Wesley edition, TR, December 1967; 
Extended Review, November 1968.) AWR 


Differential Equations, P, L. The Asymptotic So- 
lution of Linear Differential Systems: Applications 
of the Levinson Theorem. M.S.P. Eastham. Lon- 
don Math. Soc. Mono. New Ser., V. 4. Clarendon 
Pr, 1989, x + 241 pp, $63. [ISBN: 0-19-853299-7] 
Levinson’s 1948 theorem identified asymptotic solu- 
tions (for large z) of a broad class of differential equa- 
tions of the form y’(z) = f(z)y(z). This monograph 
shows how many of the asymptotic results that have 
been obtained since 1948 can be deduced directly 
from the Levinson theorem by certain well-defined 
transformation techniques. LAS 


Partial Differential Equations, P. Recent Top- 
ics in Nonlinear PDE IV. Eds: Masayasu Mimura, 
Takaaki Nishida. Math. Stud., V. 160. North- 
Holland (US Distr: Elsevier Science), 1989, vii + 
243 pp, $97.50 (P). [ISBN: 0-444-88087-9] Twelve 
papers from the fifth meeting on nonlinear partial 
differential equations held at Kyoto University, Jan- 
uary 1988. MPR 


Partial Differential Equations, P. Scattering 
Theory, Revised Edition. Peter D. Lax, Ralph S. 
Phillips. Pure & Appl. Math., V. 26. Academic 
Pr, 1989, xii + 309 pp, $44.95. [ISBN: 0-12-440051- 
5] Changes from the First Edition (TR, Decem- 
ber 1967): Epilogue outlines new directions in the 
subject and results settled since the publication of 
the previous edition and includes a new list of refer- 
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Numerical Analysis, $(17-18), P, L. Finite Dif- 
ference Methods (Part 1), Solution of Equations in 
R” (Part 1). Eds: P.G. Ciarlet, J.L. Lions. Hand- 
book of Numerical Analysis, V. 1. North-Holland 
(US Distr: Elsevier Science), 1990, vii + 652 pp, 
$107.25. [ISBN: 0-444-70366-7] The first volume 
of a series entitled “Handbook of Numerical Anal- 
ysis” consisting of self-contained, accessible yet rig- 
orous expository articles. This volume contains three 
articles covering finite difference methods for linear 
parabolic equations, splitting and alternating direc- 
tion methods, and methods for solving least squares 
problems. SM 


Operator Theory, T, P. Positive Linear Sys- 
tems: The Method of Positive Operators. M.A. Kras- 
nosel’skij, Je. A. Lifshits, A.V. Sobolev. Sigma Ser. 
in Appl. Math., V. 5. Heldermann Verlag, 1989, 
vii + 354 pp, DM 88 (P). [ISBN: 3-88538-405-1] 
Methods for studying systems and problems involv- 
ing equations in spaces which are partially ordered 
by some cone. First chapter introduces basic notions 
and facts; the remaining three chapters deal with 
applications to spectral properties, iteration proce- 
dures, and other topics. Includes some exercises. LC 


Functional Analysis, P. Fourier Analysis on 
Groups. Walter Rudin. Classics Library. Wiley, 
1990, ix + 285 pp, $26.95. [ISBN: 0-471-52364-X] Pa- 
perback reproduction of Rudin’s 1962 classic. MLR 


Analysis, P. Analysis and Partial Differential 
Equations. Ed: Cora Sadosky. Lect. Notes in Pure 
& Appl. Math., V. 122. Marcel Dekker, 1990, xxvii 
+ 742 pp, $115 (P). [ISBN: 0-8247-8302-6] Collec- 
tion in honor of Mischa Cotlar’s seventy-fifth birth- 
day containing papers in harmonic and complex anal- 
ysis, functional analysis and operator theory, and 
partial differential equations. Also contains complete 
bibliography of Cotlar’s works, and two nice sum- 
maries of his contributions (pre- and post-1955). KS 


Algebraic Geometry, S(18), P. Lecture Notes in 
Mathematics-1400: Mized Motives and Algebraic K- 
Theory. Uwe Jannsen. Springer-Verlag, 1990, xiii 
+ 246 pp, $26.50 (P). [ISBN: 0-387-52260-3] Part 
I develops mixed motives for absolute Hodge cy- 
cles; Part II discusses algebraic cycles, K-theory, and 
extension classes; while Part III goes further into 
K-theory and cohomology. Appendices, bibliogra- 
phy. JS 

Algebraic Geometry, P. Lecture Notes in Mathe- 
matics-1414: Algebraic Geometry and Complez Anal- 
ysis. Ed: E. Ramirez de Arellano. Springer-Verlag, 
1989, 180 pp, $17.60 (P). [ISBN: 0-387-52175-5] 
Proceedings of an August 1987 workshop held in 
Patzcuaro, Mexico. GG 


Differential Geometry, P. Lecture Notes in 
Mathematics-1416: Géométrie Symplectique et Méca- 
nique. Ed: C. Albert. Springer-Verlag, 1990, 289 pp, 
$26.50 (P). [ISBN: 0-387-52191-7] Seventeen papers 
from a conference organized by the Universities of 
Avignon, Lyon, Marseille, and Montpellier held in 
May 1988. MPR 
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Topology, T(18), S, P. Topological Fields. Seth 
Warner. Math. Stud., V. 157. North-Holland (US 
Distr: Elsevier Science), 1989, xiii + 563 pp, $110.50. 
[ISBN: 0-444-87429-1] Book contains a comprehen- 
sive treatment of the fundamentals of topological 
field theory, but also goes far enough so that the 
reader is prepared to tackle current literature. In- 
cludes ample exercises, historical notes on each topic 
covered, and a comprehensive bibliography. KS 


Dynamical Systems, S(17-18), P*, L*. Chaotic 
Evolution and Strange Attractors. David Ruelle. 
Cambridge U Pr, 1989, xi + 96 pp, $12.95 (P); 
$39.50. (ISBN: 0-521-36830-8; 0-521-36272-5] Uses 
time series analysis to investigate chaotic behavior 
of deterministic dynamical systems. This book is a 
series of short chapters, making it an easy book to 
work through. No exercises. Intended for scientists 
in a wide range of fields, it assumes a relatively so- 
phisticated understanding of differential equations. 
Provides a concise introduction to strange attractors 
which is suitable for those familiar with nonchaotic 
dynamical systems. SM 


Dynamical Systems, T(17-18: 2). Nonlinear 
Differential Equations and Dynamical Systems. Fer- 
dinand Verhulst. Universitext. Springer-Verlag, 1990, 
ix + 277 pp, $22 (P). [ISBN: 0-387-50628-4] In- 
tended to bridge the gap between introductory grad- 
uate ordinary differential equations texts and re- 
search literature. Reasonably terse, but achieves its 
goal. Plenty of examples and exercises with solu- 
tions. JO 


Probability, S, P*, L**. The Empire of Chance: 
How Probability Changed Science and Everyday Life. 
Gerd Gigerenzer, et al. Cambridge University Pr, 
1989, xviii + 340 pp, $44.50. (ISBN: 0-521-33115-3] 
A narrative account of the reconciliation of Fortuna, 
the muse of unpredictability, with Scientia, the guid- 
ing hand of science. Three historical chapters are 
followed by three in-depth explorations of the evo- 
lution of probabilistic ideas in biology, physics, and 
psychology; two concluding chapters survey the role 
of numbers (hence chance) in the modern world (from 
baseball to medicine) and the implications of chance 
for inference, determinism, and other philosophical 
issues. Effective non-technical exposition with virtu- 
ally no formulas. LAS 


Stochastic Processes, T(17-18: 1, 2), S, P, L*. 
Stochastic Processes. J.L. Doob. Classics Library. 
Wiley, 1990, vii + 654 pp, $32.95 (P). [ISBN: 0-471- 
52369-0] Paperback reprint of 1953 original edition 
as part of Wiley’s series of reprints of its own classic 
publications. This text, the first systematic survey of 
stochastic processes, permits “no compromise” with 
the mathematics of probability. Covers Markov pro- 
cesses, martingales, and stationary processes. A con- 
cluding supplement provides a concise survey of the 
necessary background in measure theory. LAS 


Stochastic Processes, T(18: 1), P. Stochastic 
Integration and Differential Equations, A New Ap- 
proach. Philip Protter. Applic. of Math., V. 21. 
Springer-Verlag, 1990, x + 302 pp, $49. [ISBN: 0- 
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387-50996-8] An advanced text dealing with the in- 
tegration of stochastic processes. Assumes a prior 
knowledge of stochastic processes through the study 
of martingales. The novelty of this work lies in its 
definition of a semimartingale as a “good” integrator 
of stochastic processes. SM 


Stochastic Processes, T(17: 1), S, P. The El- 
ements of Stochastic Processes with Applications to 
the Natural Sciences. Norman T.J. Bailey. Clas- 
sics Library. Wiley, 1990, xi + 249 pp, $24.95 (P). 
[ISBN: 0-471-52368-2] Reprint of a text originally 
published in 1964, based on a set of lectures given by 
the author at John Hopkins Department of Biostatis- 
tics. Stochastic models are covered from the heuris- 
tic approach of applied mathematics, rather than 
through rigorous probability theory. Knowledge of 
probability, statistics, matrix algebra, complex vari- 
able theory, and differential equations is assumed. 
Covers generating functions, Markov chains, random 
walks, birth-and-death processes, queues, diffusion 
processes, and non-Markovian processes. Problems 
at the end of each chapter; not many applied prob- 
lems. JJ 


Elementary Statistics, T*(14-17: 1, 2), C. 
Practical Business Statistics with StatPad. Andrew 
F, Siegel. Richard D Irwin, 1990, xxx + 962 pp, 
$49.95. [ISBN: 0-256-07313-9] Designed primarily 
for MBA-level statistics courses. In addition to the 
usual topics, including multiple regression and non- 
parametrics, the book also has chapters on report 
writing, time series, and quality control. Contains 
many interesting real-world examples. Special fea- 
ture is the inclusion of a disk containing StatPad, sta- 
tistical software for use with the Lotus 1-2-3 spread- 
sheet, and exercises based on a data set which can be 
solved using StatPad or some other statistical pack- 
age. A 138-page appendix contains the StatPad ref- 
erence manual and tutorial. RSK 


Elementary Statistics, T(14: 1, 2). Introductory 
Statistics, Fifth Edition. Thomas H. Wonnacott, 
Ronald J. Wonnacott. Wiley, 1990, xvi + 711 pp, 
$44.95. (ISBN: 0-471-61518-8] Revision of the 1985 
Fourth Edition (First Edition, TR, October 1969; 
Second Edition, TR, August-September 1972; Third 
Edition, TR, March 1978). Includes new sections on 
graphs and common sense, expected value in bidding, 
the bootstrap, and robust estimation. Claimed ex- 
tensive use of Minitab is minimal. Each chapter con- 
cludes with a “Final Challenge,” a practical problem 
based on a real situation. RSK 


Elementary Statistics, T(13-16: 1, 2). Mod- 
ern Data Analysis: A First Course in Applied Statzs- 
tics. Lawrence C. Hamilton. Brooks/Cole, 1990, xv 
+ 684 pp, $43.25. [ISBN: 0-534-12846-7] Stan- 
dard introductory text on applied statistics for non- 
mathematics majors. Does not require calculus. Em- 
phasizes applications over theory, presenting many 
interesting data sets and graphical techniques. Cov- 
ers typical topics: graphing data, summary statis- 
tics, introduction to probability, random sampling, 
the normal distribution (binomial and Poisson, also), 
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hypothesis testing and confidence intervals on means 
and proportions (but not variances), chi-square 
goodness of fit type tests, two-sample tests, nonpara- 
metrics, and regression (simple and multiple). Un- 
fortunately, the text contains no computer instruc- 
tion on how to use a statistical package for graphing 
and data analysis. There is a companion text called 
Statistics with STATA for this. The text only con- 
tains a short section on designing experiments and 
analysis of variance. JJ 


Statistics, S*, L*. Statistics in Psychology: An 
Historical Perspective. Michael Cowles. Lawrence 
Erlbaum Associates, 1989, xvi + 218 pp, $36. [ISBN: 
0-8058-0031-X] Survey of the historical development 
of classical statistical techniques found in basic texts 
on statistics for psychology, including interesting dis- 
cussions of the personalities and controversies sur- 
rounding these developments. Should be of inter- 


est to anyone studying statistics. Good set of refer- 
ences. RSK 


Statistics, P, L. Handbook of Statistical Meth- 
ods for Engineers and Scientists. Ed: Harri- 
son M. Wadsworth, Jr. McGraw-Hill, 1990, x + 706 
pp, $76.50. [ISBN: 0-07-067674-7] Twenty sections 
by 25 authors beginning with basic statistics. Em- 
phasis on quality control; also includes survey sam- 
pling, Bayesian analysis, experimental design, regres- 
sion, simulation, robust and nonparametric methods, 
and time series. Good first reference for engineers. 
Includes section references. TH 


Statistics, S, P. Lecture Notes in Computer 
Sctence-420: Statistical and Sctentific Database 
Management. Ed: Z. Michalewicz. Springer-Verlag, 
1990, v + 256 pp, $24.70 (P). [ISBN: 0-387-52342-1] 
Contains 16 research papers presented at the Fifth 
International Conference on Statistical and Scientific 
Databases held at the University of North Carolina, 
April 3-5, 1990. The papers address a wide range 
of issues related to storing and managing large col- 
lections of numerical data, especially statistical data 
related to the fields of geography, economics, and the 
physical sciences. GMS 


Programming, T(14-18: 1), S. An Introduction 
to Functional Programming Through Lambda Calcu- 
lus. Greg Michaelson. Intern. Comput. Sci. Ser. 
Addison-Wesley, 1989, xv + 320 pp, (P). [ISBN: 0- 
201-17812-5] Begins with the origins of functional 
programming and the differences between it and im- 
perative programming. Investigates pure \-calculus 
before proceeding to boolean values, operations, 
numbers, conditionals, and functions, along with rep- 
resentations of each. Lists, trees, evaluation orders, 
and termination make up the second third. Program- 
ming in standard ML and Lisp come last. Exercises 
and answers; bibliography. RJA 


Programming, T(13-18), S. Practical Prolog. 
Richard J. Young. Van Nostrand Reinhold, 1989, 
xxi + 488 pp, $32.95 (P). [ISBN: 0-442-25934-4] A 
thorough, well-organized introduction to MProlog. 
Layout and explanations conducive to learning. No 
routine exercises. Three appendices of predicates. 
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Companion software available. RJA 


Programming, T(13). BASIC: An Introduction 
to Computer Programming, Fourth Edition. Robert 
J. Bent, George C. Sethares. Ser. in Comput. Sci. 
Brooks/Cole, 1990, xiv + 384 pp, $35.25 (P). (ISBN: 
0-534-12642-1] Revisions include expanded intro- 
ductory chapter on problem solving, a new section 
on menu-driven programs and table processing, in- 
creased emphasis on the application of string vari- 
ables, improved examples and exercises (First Edi- 
tion, TR, March 1979). LC 


Programming, T(15-18: 1), S. LISP, Lore, and 
Logic: An Algebraic View of LISP Programming, 
Foundations, and Applications. W. Richard Stark. 
Springer-Verlag, 1990, xviii + 278 pp, $38 (P). 
[ISBN: 0-387-97072-X] A combination of both the 
practical and the theoretical. Contains historical 
notes throughout. Syntax, programming style, and 
semantics of computation are followed by the alge- 
bra of interpreters, the lambda calculus, and the Lisp 
approach to artificial intelligence. Chapter exercises; 
answers to selected questions; bibliography; index. 
RJA 


Computer Systems, P, L. A Gentle Introduction 
to TX: A Manual for Self-study. Michael Doob. 
TpXniques, No. 12. TeXUsers Group (POB 9506, 
Providence, RI 02940), 1990, iv + 88 pp, $15 (P). 
As advertised—a concise, careful introduction for 
beginners, far less intimidating (and far less com- 
plete) than Knuth’s TXbook, to which page refer- 
ences occur throughout the margins of this primer. 
Characters, shapes, mathematics, alignment, errors, 
boxes—just enough to do simple things. If you are 
just beginning to use TpX, read this first. LAS 


Theory of Computation, S, P. First-Order Logic 
and Automated Theorem Proving. Melvin Fitting. 
Texts & Monographs in Computer Science. Springer- 
Verlag, 1990, xv + 242 pp, $35. (ISBN: 0-387-97233- 
1] Text for researchers in computer science who are 
interested in the applications of formal logic to the 
topic of automated theorem proving. It covers work 
on proof methods, unification, and resolution so im- 
portant to the area of logic-based programming lan- 
guages. It also discusses other topics including equal- 
ity, semantic tableaux, and heuristics. GMS 


Artificial Intelligence, T(16), S. Object-Oriented 
Simulation with Hierarchical, Modular Models: In- 
telligent Agents and Endomorphic Systems. Bernard 
P. Zeigler. Academic Pr, 1990, xvii + 395 pp, 
$49.95. [ISBN: 0-12-778452-7] Describes a simu- 
lation system called DEVS (Discrete Event Simula- 
tion System). The system, written in a language 
called Scheme, is intended for use in simulating and 
studying intelligent systems in a realistic manner. It 
describes the system as well as the object-oriented 
model upon which it is based. It includes numerous 
examples and case studies. GMS 

Artificial Intelligence, P. The Acgutsttion of 
Strategic Knowledge. Thomas R. Gruber. Perspect. 
in Artif. Intellig., V. 4. Academic Pr, 1989, xxiv 
+ 311 pp, $39.95. [ISBN: 0-12-304754-4] Describes 
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a computer-assisted approach to obtaining human 
knowledge and formulating it for use in knowledge- 
based systems. The ASK program is described and 
details of its use presented. Includes techniques for 
knowledge elicitation, machine learning, knowledge 
engineering. Contains a chapter on evaluation of 
this second-generation artificial intelligence research. 
Bibliography. RJA 


Artificial Intelligence, P. Ezemplar-Based Knowl- 
edge Acquisition: A Untfied Approach to Concept 
Representation, Classification, and Learning. Ray 
Bareiss. Perspect. in Artific. Intellig., V. 2. Aca- 
demic Pr, 1989, xii + 169 pp, $24.95. [ISBN: 0-12- 
078260-X] Describes the Protos knowledge acquisi- 
tion system, a learning apprentice that forms cate- 
gories of common experiences and classifies new ex- 
periences into them. Overview of basic issues and 
approaches to them. Design and evaluation of Pro- 
tos. Ends with related research and future work. 
Appendices; glossaries; bibliography. RJA 


Computer Science, S(16-18), P. Predicate Cal- 
culus and Program Semantics. Edsger W. Dijkstra, 
Carel S. Scholten. Texts & Mono. in Comput. Sci. 
Springer-Verlag, 1990, xi + 220 pp, $39. (ISBN: 
0-387-96957-8] A predicate transformer is a func- 
tion from boolean structures to boolean structures. 
Main topics include the calculus of boolean struc- 
tures, properties of predicate transformers, seman- 
tics of repetitions, and use of predicate transformers 
to define programming language semantics. RJA 


Computer Science, T(14-18: 1, 2), L. The Logi- 
cal Basis for Computer Programming. Zohar Manna, 
Richard Waldinger. Addison-Wesley. Volume 1: De- 
ductive Reasoning, 1985, xix + 618 pp, $25.56 (ISBN: 
0-201-18260-2]; Volume 2: Deductive Systems, 1990, 
xiii + 642 pp, $27.16. [ISBN: 0-201-18261-0] Vol- 
ume 1 begins with an introduction to mathematical 
logic, both propositional and predicate; treats the- 
ories with induction and includes axiomatic presen- 
tations of the main structures in computer science 
(strings, lists, trees, sets, etc.). Volume 2 is more 
formal in its development and treats four main top- 
ics: well-founded induction, propositional logic, uni- 
fication, and predicate logic. A deductive-tableau 
system is given for proving the validity of sentences. 
Chapter problems (solutions can be ordered). List 
of related textbooks and selected references. Symbol 
and general index. RJA 


Computer Science, P. Object-Oriented Concepts, 
Databases, and Applications. Eds: Won Kim, Fred- 
erick H. Lochovsky. Addison-Wesley, 1989, xv + 
602 pp. [ISBN: 0-201-14410-7] Divided into five ar- 
eas: object-oriented concepts embedded in object- 
oriented programming languages and semantic data 
models; applications; database systems; architec- 
tural issues; directions for future research and de- 
velopment. Index. RJA 


Applications, T(16-17: 2). Mathematical Meth- 
ods for Sctentists and Engineers: Linear and Non- 
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linear Systems. Peter B. Kahn. Wiley, 1990, xix 
+ 469 pp, $54.95. [ISBN: 0-471-62305-9] Empha- 
sizes perturbation techniques and asymptotic meth- 
ods. Nonlinear problems and associated models in 
diverse fields. Long time in production. Classroom 
tested; informal and readable, in repetitive style, 
with the same examples recurring throughout. Pre- 
requisites include calculus through differential equa- 
tions, some mechanics, some linear algebra, but no 
complex variables. Develops tools of analysis for 
treatment of problems without closed-form solutions. 
For advanced undergraduates or beginning graduate 
students in physics, chemistry, engineering, and bi- 
ology. Numerical and graphical solutions. Software 
packages from Lotus 1-2-3 and Math CAD. Modest 
number of exercises, with answers where appropriate, 
throughout the text. JDEK 


Applications (Economics), T(15-17: 1), L. Dif- 
ferential Equations, Stability and Chaos in Dynamic 
Economics. W.A. Brock, A.G. Malliaris. Adv. Text- 
books in Econ., V. 27. North-Holland (US Distr: El- 
sevier Science), 1989, xvi + 389 pp, $49. [ISBN: 
0-444-70500-7] A text designed to take advanced 
students of economics through an elementary study 
of differential equations. The first half covers the 
usual topics: existence, uniqueness, dependence on 
initial conditions, linear systems, linearization and 
Lyapunov analysis of critical points. The second 
half covers applications in several areas of economic 
theory, including micro- and macro-economic poli- 
cies and investment theory. The last chapter is an 
introduction to chaos in economic systems. Exten- 
sive bibliography. Exercises at the-end of each chap- 
ter. SM 

Applications (Physics), S**(13-16), P*, L**. 
The New Ambideztrous Universe: Symmetry and 
Asymmetry from Mirror Reflections to Superstrings, 
Third Revised Edition. Martin Gardner. WH Free- 
man, 1990, xiv + 392 pp, $19.95. [ISBN: 0-7167- 
2092-2] Third revision of Gardner’s classic 1964 
nontechnical exposition of the fall of parity. The 1979 
Second Edition (TR, May 1979) added five chap- 
ters on the violation of time-invariance symmetry. 
This edition, in addition to minor changes through- 
out, adds five more chapters on twistors, monopoles, 
spin, and superstrings. Absorbing; enlightening; lu- 
cid; witty; inventive. An exemplar of science writing 
at its very best. LAS 


Reviewers 


RJA: Richard J. Allen, St. Olaf; SB: Steve Benson, St. Olaf; 
JNC: Judith N. Cederberg, St. Olaf; LC: Laura Chihara, 
St. Olaf; SG: Steven Galovich, Carleton; GG: George Gilbert, 
St. Olaf; TH: Timothy Hesterberg, St. Olaf; JJ: Jason Jones, 
St. Olaf; RSK: Richard S. Kleber, St. Olaf; JDEK: Joseph 
D.E. Konhauser, Macalester; LCL: Loren C. Larson, St. Olaf; 
SM: Steve McKelvey, St. Olaf; JO: Jeff Ondich, St. Olaf; 
MLR: Margaret L. Reese, St. Olaf; MPR: Matthew P. Richey, 
St. Olaf; AWR: A. Wayne Roberts, Macalester; KS: Karen 
Saxe, St. Olaf; GMS: G. Michael Schneider, Macalester; JS: 
John Schue, Macalester; KES: Kay E. Smith, St. Olaf; LAS: 
Lynn Arthur Steen, St. Olaf; PZ: Paul Zorn, St. Olaf. 


Addisor-Wesley 
gives your students 


Differential Equations: 

A Modeling Approach 

Frank R. Giordano and Maurice D. 
Weir, U.S. Military Academy and 
U.S. Naval Postgraduate School, 
respectively 

Now your students can clearly see 
how differential equations apply to 
realistic problems. This new text by 


Giordano and Weir focuses on mathe- 
matical modeling, showing students a 


natural progression of related ideas, 
and offering motivation to take the 
next step along the way. Unlike other 
texts that use a “solutions technique 
approach, Differential Equations: A 
Modeling Approach puts into per- 
spective how the various solutions 
techniques students learn actually fit 
together — the “why” as well as the 
“how-to.” 


Discrete Algorithmic 
Mathematics 

Stephen B. Maurer and Anthony 
Ralston, Swarthmore College and 
SUNY-Buffalo, respectively 


Turn to Maurer and Ralston for 
discrete mathematics with a basic 
difference: a unique algorithmic 
approach. The authors present dis- 
crete mathematics as a unified topic, 
by viewing it through an algorithmic 
“prism.” A wide range of applica- 
tions, flexible coverage of core 
discrete math topics, and extensive 
coverage of linear algebra also add to 
this text's substantial value. 


A 
Vv 


Addison-Wesley 
1 Jacob Way ¢ Reading, MA ¢ 01867 
(617) 944-3700 


From the publisher that's serious about math 


nn 


the best 
Of mnath 


Vv 
ALSO NEW FOR ‘91! 


Developmental 
Mathematics 


Basic Mathematics, Sixth Edition 
Introductory Algebra, Sixth Edition 
intermediate Algebra, Sixth Edition 
Mervin L. Keedy and Marvin L. Bittinger, 
Purdue University and Indiana University- 
Purdue University at Indianapolis, 
respectively 


Concepts and Applications of 
intermediate Algebra, Second Edition 
Marvin L. Bittinger, Mervin L. Keedy, and 
David Ellenbogen, /ndiana University- 
Purdue University at Indianapolis, Purdue 
University, and St. Michael's College, 
respectively 


Elementary Algebra 
Intermediate Algebra 

Mark Dugopolski, Southeastern 
Louisiana University 


Statistics 


Introductory Statistics, Third Edition 
Neil A. Weiss and Matthew Hassett, 
both of.Arizona State University 


Statistics Methods for Business 
and Economics, Fourth Editidn 
Donald L. Harnett and Ashok Soni, 
both of Indiana University 


Vv 
OTHER TITLES OF INTEREST 


Precalculus: Functions and Graphs 
A. Robert Marshall, Cuesta College 


Mathematics for Elementary 

School Teachers: A Problem Solving 
Approach 

Joseph Newmark, CUNY College of 
Staten Island 

Calculus: A Complete Course 

Robert A. Adams, University of British 
Columbia 


The Power to 
Project Your Ideas 


jo ogegw ocenentafsasnatectecnssnt econ ened 


Since inventing the graphic calculator 
in 1985, Casio has created an entire family 
of exciting classroom products. Five 
different graphic calculator models are 
available, each offering hundreds of 
powerful, built-in functions. 

With our printers, you can create 
oH-7000G_ = “hard copy” of graphs and calculations 
for photocopying onto test papers or transparencies. 
The FA-80 Data Interface lets you store unlimited 
data on audio cassettes. 


Store 
unlimited data 
on audio cassette. 


Create 
printouts 


Pre-load students’ 
calculators with 
lesson data. 


The possibilities are endless. That’s why the NCTM | 
curriculum calls for the use of graphic calculators. Man The OH-7000G lets the instructor 
. 8 P . y lead the c tone . ’ 
new textbooks are being published especially for use cad the Class via d standard 
th this technology overpnedd projector, 
wi , 


So, take a bold, new step and enter Casio’s “Electronic 


Classroom” — a place where the solution is always clear. Cc a S : Cc 


Creators of tlk 
Graphic Caltculatc 


570 Mt. Pleasant Avenue, Dover, New Jersey 07 


For more information about Casio’s family of graphic calculators, and to place an order, 
call one of the educational distributors listed below. 


ABM 
800-234-3396 
(In TX 214-869-0721) 
(In MO 816-861-1113) 
(In CA 213-946-6680) 


Allied Communications 
800-843-3660 
(In PA 800-843-3660) 


Allied National 
800-999-8099 
(In MI 313-543-1232) 


Calculators Inc. 
800-533-9921 
(in MN 800-533-9921) 


Carolina Wholesale 
800-521-4600 
(In NC 704-598-8101) 


Colborn School Supply 
800-548-7031 
(In CO 303-778-1220) 
(In MT 406-245-3158) 


Copco 
800-446-7021 
(In OH 800-589-3006) 


Dale Seymour Publications 
800-872-1100 
(In CA 800-222-0766) 


Douglas Stewart 
800-356-8370 
(In WI 608-221-1155) 


F.ALL. 
800-272-0272 
(In NJ 201-891-9466) 


El Dorado Trading 
800-227-8292 
(In CA 415-494-1995) 


Entertainment Marketing 
800-347-2001 
(In TX 713-995-4433) 


Global Products 
800-633-0633 
(In IL 708-397-4944) 


Longino Distributors 
800-633-6224 
(In NC 704-873-3282) 


NASCO 
800-558-9595 
(In WI 414-563-2446) 


When was the last 
time a computer 


program helped 
you think about 
mathematics? 


Call or write for our latest software catalog. 
For PC or Macintosh. 


Lascaux Graphics 3220 Steuben Ave. Bronx, NY 10467 (212) 654-7429 


Nemesis Distributing 
800-940-7407 
(In FL 305-477-8822) 


Servco Pacific 
(In HI 808-841-7566) 
‘Tam’s 
800-421-5188 
(In CA 800-244-5624) 


Taylor Electric 
000-558-6970 
(In WI 800-242-8940) 


Techline 
800-777-3635 
(In VA 703-389-0857) 


Valley Business Machines 
(In UT 801-969-6303) 


Vistatech 
800-847-985 1 
(in NY 212-254-985 1) 
(In CA 213-602-0277) 


CASIO. 


New From 
Martin 
Gardner 


MATHEMATICAL MAGIC SHOW 
Martin Gardner 


Martin Gardner published his first book in 
1935. Since then he has charmed, puzzled, 
and delighted countless readers in his more 
than 409 books, among them a novel. Here 
is what reviewers have said about this book. 


Highly recommended, but be warned— 
mathematical games can be addictive. 


David Jones in New Scientist 


Mathematical Magic Show begins with 
a chapter on nothing, and finishes with 
a chapter on everything. In between 
we visit most of the prime sites of 
recreational mathematics—game_the- 
ory, factorial...puzzles, playing cards, 
finger arithmetic, Mébias bands, poly- 
ominoes, perfect numbers, the knight's 
tour, trees, and dice. Gardner always 
has new facts and ideas to add interest 
to even the most well-trodden areas. 
For example, he extends his discussion 
of the knight's tour to bring in the cook’s 
tour (a cook travels three squares for- 
ward and then one square right or left) 
and then goes on to include camels, 
asps, and giraffes. The chapter on dice 
has some very useful hints on cheating 
at craps, and how not to get caught. 


Harvey Mellar in Times Literary Supplement 


Over the 30 or so years that (Mar- 
tin Gardner's) column ran in Scientific 
American, he built up an enthusiastic 
readership which included practically 
every professional mathematician that 
| have met, and no doubt the countless 
many | have not. There must be a great 
many professionals whose own inter- 
est in the subject was awakened by his 
rare gift of being able to put across to 
the outsider the deepest developments 
in an extremely difficult and abstract 
subject, and to convey the enjoyment 
and excitement that lies within mathe- 
matics proper. 


Keith Devlin in The Guardian 

312 pp., Paper, 1990, ISBN 0-88385-445-1 
List: $17.50 MAA Member: $14.50 
Catalog Number MAGIC 


ORDER FROM 


[> The Mathematical Association 
of America 


1529 Eighteenth Street, N.W. 
Washington, D.C. 20036 


NEW IN PAPERBACK FROM PRINCETON 


Georg Cantor 
His Mathematics and Philosophy 
of the Infinite 


Joseph Warren Dauben 


One of the greatest revolutions in mathematics 
occurred when Georg Cantor (1845-1918) promul- 
gated his theory of transfinite sets. This revolution 
is the subject of Joseph Dauben’s important 
study—the most thorough yet written—of the 
philosopher and mathematician who was once 


called a “corrupter of youth” for an innovation that 
is now a vital component of elementary school 
curricula. 

“Historians of mathematics can only be grate- 
ful for the effort Professor Dauben has expended 
to create the synthesis of Cantor scholarship 
found in this book. But the book can, and I hope 
will, be read with profit by a far more extensive 
audience.” 


Mathematics and 


Plausible Reasoning 


Volume I: Induction and Analogy 
in Mathematics 

Volume II: Patterns of Plausible 
Inference 


G. Polya 


Here the author of How to Solve It explains how to 
become a “good guesser.” Marked by G. Polya’s 
simple, i and use of clever examples 
from a wide range of human activities, this 
two-volume work explores techniques of guessing, 
inductive reasoning, reasoning by analogy, and the 
role they play in the most rigorous of deductive 
disciplines. 

Volume I: $12.95 Paper ISBN 0-691-02509-6 
Volume II: $12.95 Paper ISBN 0-691-02510-X 


—Thomas Hawkins, Historia Mathematica 


Now in paper: $14.95 ISBN 0-691-02447-2 AT YOUR BOOKSTORE OR 


Princeton University Press 


41 WILLIAM ST. ¢ PRINCETON, NJ 08540 © (609) 258-4900 
ORDERS: 800-PRS-ISBN (777-4726) 


Being first is almost second nature at McGraw-Hill. Whether your students are enrolled in developmental | 
math, precalculus, or calculus, McGraw-Hill's texts will inspire your students to learn. | 


NEW FOR 1991 


DISCOVERING CALCULUS WITH THE HP-28 AND 


The Streeter Seri THE HP-48sx 
BASIC CAL Smith/ Minton 
OEOMETRY ee WITH DISCRETE MATH AND ITS APPLICATIONS, 2 
Streeter /Hutchison /Hoelzle Rosen 
BEGINNING ALGEBRA, Form B NOMS ee 
Streeter /Hutchison/Hoelzle Simmons ’ 
INTERMEDIATE ALGEBRA, Form B 
Streeter ‘Hutchison /Hoelzle INTRODUCTION TO PARTIAL DIFFERENTIAL 
COLLEGE ALGEBRA WITH TRIGONOMETRY | Pinsky 
Rees /Sparks/Rees FUNCTIONAL ANALYSIS, 2/e 
PRECALCULUS: A Bridge to Calculus Rudin 
Runyan/Runyan MATRIX THEORY 
EXPLORATIONS IN CALCULUS WITH A Goldberg 
COMPUTER ALGEBRA SYSTEM BRIDGE TO ABSTRACT MATHEMATICS, 2/e 
Small /Hosack Morash 


For more information, please contact your local McGraw-Hill 
representative or write: McGraw-Hill College Division, Comp Processing 
and Control, P.O. Box 448, Hightstown, NJ 08520. 


Back-To-Basics INSTRUCTIONAL Books 


PERSONAL MATHEMATICS AND COMPUTING 
Tools for the Liberal Arts 

Frank Wattenberg 

This book is a self-contained tutorial that can assist any reader with a background in 
high school mathematics in learning how to apply personal computing to enhance his 
or her understanding of modern quantitative methods. The New Liberal Arts 

492 pp. $29.95 (December) 


Original in Paperback 
INVESTIGATIONS IN ALGEBRA APPROACHING PRECALCULUS 
Albert A. Cuoco — | Matnematics Discretey 
This erudite and witty book will be an Explorations in a Computer Environment 
invaluable resource for high-school math Philip G. Lewis 
teachers and their students. It provides —_ tawis demonstrates how concepts of 
an in-depth opportunity for doing discrete mathematics can be introduced 
mathematics, not just learning about through the computer using the Logo™ 


Ne nour ence J. Davidson, Educator, | language, making them easier to teach 
author of Apple Logo Reference Manual — and more fun to learn. 


350 pp. $45.00 cloth, $29.95 paper 557 pp. $35.00 cloth, $19.95 paper 
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US Poste! Service 
STATEMENT OF OWNERSHIP, MANAGEMENT AND CIRCULATION 
ired by 39 U.S.C. 3685) 


1A Tate of Publication 
THE AMERICAN MATHEMATICAL MONTHLY lofololzis|s|slo| 
requency of jesus 7 y rs 3 5 
monthly except bi-monthly June/July and Aug/Sept Library $120 
Indiv. Member $30 


Studies in Numerical 
Analysis, 
Gene H. Golub, Editor 


MAA Studies in 
Mathematics #24 


415 pp. Cloth, 1985, 
ISBN-0-88385- 126-1 
List: $46.50 

MAA Member: $34.50 


This volume is a collection of papers describing 
the wide range of research activity in numerical | 
analysis. The articles describe solutions to a 
variety of problems using many different kinds 

of computational tools. Some of the computa- 
tions require nothing more than a hand-held 
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Random Intervals 


Joyce Justicz, EDWARD R. SCHEINERMAN, AND PETER M. WINKLER 
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Abstract. Fix a large number n and let {x,, y;,...,X,,> y,} be 2m points chosen independently from 
some fixed continuous probability distribution on the real line. Each pair {x;, y,} determines a random 
interval [min(x,, y;), max(x;,, y;)]. We examine the structure of the resulting family of intervals, and in 
particular answer the following questions: how large a subcollection of pairwise disjoint intervals can 
one expect to find? And, what is the probability that there is an interval in the family which intersects 
all the others? 


Prelude. Before beginning, we invite the reader to test his intuition on the 
following problem (it won’t be any worse than ours was!). The numbers from 1 to 
2n, with n large, are paired at random, each pair being regarded as the endpoints 
of a real interval. What is the probability that among these n intervals there is one 
which meets all the others? 


1. Introduction: Random Intervals. In studying any type of combinatorial struc- 
ture it is useful, and sometimes quite illuminating, to have models for “random” 
structures of that type. The most famous example is the random graph model of 
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Erdés and Rényi [2] in which edges are chosen independently with fixed probabil- 
ity. Random graph theory now forms a substantial subject of study in itself. 

The combinatorial structures with which we concern ourselves here are those 
based on intersection properties of families of intervals. The most obvious defini- 
tion of a “random interval” is the interval between two random numbers x and y; 
for conveniencé we denote that interval by [x, y] even when y <x, so that by 
definition [x, y] = Ly, x]. | 

We create a family F of random intervals in the following way: fix a number n 
and a continuous probability distribution on the real line. Choose values 
X1,Vi>+++»X,, y, independently and let F consist of intervals J,,..., 7, where 
I; = [x py J. 

Since intersection properties of F depend only on the order of the points 
X1,---,Y,» it is clear that the model is insensitive to the choice of distribution. In 
fact, an equivalent discrete model is obtained by choosing at random one of the 
(2n)! assignments of x,,...,y, to the integers from 1 to 2n. In the continuous 
model the most convenient distributton seems to be the uniform distribution on 
the unit interval [0, 1]; in that case we may think of the intervals as having arisen 
from random points (x,, y;) in the unit square. 

To any family F of intervals we may associate a graph G = (V, E) as follows: 
the vertices of G correspond to the intervals of F, and two vertices constitute an 
edge of G just when their corresponding intervals have non-null intersection. 
Graphs arising in this fashion are called interval graphs; when F is a family of 
random intervals as described above, they are called random interval graphs [8]. 


Fic. la. Eight random points and their intervals. 
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Fic. 1b. Random interval graph and interval order. 


We preserve somewhat more of the properties of a family F of intervals by 
assigning to F a partially ordered set P; here the elements of P correspond to the 
intervals of F with x < y just when the interval corresponding to x lies entirely to 
the left of the interval corresponding to y. Partially ordered sets arising in this 
fashion are called interval orders. Note that G may be obtained from P by 
declaring {x, y} to be an edge of G whenever x and y are incomparable in P. 

For further information about both interval graphs and interval orders, we refer 
the reader to Fishburn’s excellent book [4]. 

FiGurE la depicts 8 random intervals (arising from 8 points chosen in the unit 
square) and FiGurE 1b depicts the corresponding interval graph and interval 
order. 


2. Chains and Antichains.... or cliques and independent sets. 

Given a collection of n random intervals, what is the size of a largest family of 
pairwise intersecting intervals? ...or pairwise disjoint intervals? In poset language 
we are asking for the sizes of the largest antichain (pairwise incomparable 
elements) and of the largest chain (pairwise comparable, and therefore totally 
ordered, elements). In graph language we seek the sizes of the largest clique 
(pairwise adjacent) and the largest independent set (pairwise nonadjacent). 

The first question (pairwise intersecting) is easier to answer. 

Intervals of reals satisfy the “Helly property,” that is, every collection of 
pairwise-intersecting intervals has a nonempty intersection. Thus, when endpoints 
are uniformly distributed on [0, 1], it is enough to find the point x € [0, 1] which is 
contained in the maximum number of intervals in the collection. The probability 
that an interval contains x is 1 —x* — (1 —x)* = 2x — 2x*. This is maximized 
when x = 1/2, and the expected number of intervals which contain 1/2 is n/2. 
With a bit more work (the interested reader can consult [8]) one can obtain the 
following result: 


THEOREM 1. Let the random variable A,, denote the size of a largest set of pairwise 
intersecting intervals in a family of n random intervals. There exists a function f(n) 
such that 
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and n n 
lim Pr{ = -f(n) <A, <5 + F(n)| = 1. 


We turn now to the second question: What is the largest collection of pairwise 
non-intersecting intervals (longest chain) in a family of n random intervals? It 
turns out that this question is closely related to the following well-known problem, 
due to Stanislaw Ulam: What is the length of the longest increasing subsequence 
of a random permutation of 1,...,n? 

Ulam’s problem, posed in [10], was tackled by Hammersley in a famous paper 
[6] in which the answer was shown to be asymptotic to c¥n for some constant c. 
Later, a combination of the efforts of Schensted [9], Logan and Shepp [7] and 
Versik and Kerov [11] determined that the constant is, in fact, 2. 

Ulam’s “random permutation” may be constructed by choosing nm random 
points from the uniform distribution in the unit square, and numbering the points 
from left to right (i.e., according to their x-coordinates). The permutation arises in 
reading the label-numbers from bottom to top. 

The points determine a partially ordered set Q under the product ordering, 
where (x;, ¥,) < 9 (x;, y;) just when x; <x, and y, < y,. An increasing subsequence 
of the permutation corresponds exactly to a chain in Q; geometrically, a subset of 
the points extending from southwest to northeast in the square. Let us denote by 
X,, the random variable whose value is the size of the largest such subset. 

As we have seen, the same set of points in the unit square determines a family 
of random intervals, but in the interval order P, we have the stronger condition 
(x;, y;) <p (x; y,) iff max(x;, y,) < min(x,, y,). Let us denote by Y, the size of a 
largest chain in P; it is the asymptotic behavior of Y, which we wish to determine. 
Since every chain of P is a chain of Q, we have Y, < X,. 

For example, consider the point e in FiGureE 1a. The points greater than e in Q 
are those to the northeast (namely, b, c, d and h). We can also describe geometri- 
cally the points which lie above e in P. Draw a vertical and horizontal line through 
the point e. Also, draw a vertical and horizontal line through e’s reflection across 
the positively sloped diagonal of the square. These four lines divide the unit square 
into 9 rectangles. (See FicurE la.) The upper right-hand square contains the 
points which correspond to intervals to the right of e (namely, d and h). 

Hammersley [6] used the method of subadditivity to prove that X,/ vn ap- 
proaches a constant in probability, and in fact a nearly identical argument can be 
made to achieve the same result for Y,/ vn. (See, for example, Bolobds and 
Winkler [1] where Hammersley’s results are extended to higher dimension.) 

In the case of Ulam’s problem the actual value of c was obtained only with 
great difficulty, and in fact the values of the constants in higher dimensions remain 
unknown. Luckily, a special feature of the interval case allows us both to prove the 
existence of the constant and determine its value with relative ease. The difference 
is that in our interval problem a maximum-length chain can be built in “greedy” 
fashion from the bottom up, whereas the equivalent process in Ulam’s problem 
fails by a constant factor, in the limit, to attain maximal length. 


THEOREM 2. Let Y, denote the maximum number of pairwise disjoint intervals in a 
family of n random intervals. Then 
_ Xn 2 
lim 


n-o Vn 


Vir 


in probability. 
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This means for every ¢ > 0, there exists an n, so that for all n > no, 
Y, 2 
Pr 


vn Vor 

Proof. Let us establish a Poisson process on the plane of density 1. We select an 
infinite chain C = {(u,,v,), (U5, v>),...} of points of the process in the following 
manner: (u,, U,) is the point in the positive quadrant which minimizes max(u,, v,), 
and thereafter, (u,, v,) is the point above (u,_,, U,_,) which minimizes max(u,, U,). 
In terms of intervals, C represents the chain built from the bottom by always 
selecting the interval with least possible upper endpoint; it is easily seen by 
induction that in any finite collection of intervals such a chain has maximum 
possible length. 

Now, for any positive real s, the region {(x, y):0 <x, y < s} =[0,s]* is with 
probability exactly exp{—s*} unoccupied by a point of the Poisson process. It 
follows that if S is the random variable whose value is max(x,, y,), then the mass 
density of S is given by the function 


<e}> 1-8 


d 2 2 
f(s) = 7 —e*) =2se~° 


whose expected value is 
oe la 
/ 2s2e-* ds =| t'/2e-' dt = 1 (3/2) = —. 
0 0 2 


The differences 
max(u,,U,) — 0,max(u,,v,) — max(u,,v,),max(u3,v3) — max(u,,v>),... 


are independent and identically distributed with mean yz /2. It follows from the 
law of large numbers that for any « > 0, 


vr max(x,,, y,,) T 
(1 ss < - <Q +e)> 
with probability at least 1 — «, for every sufficiently large m. 

Now let r(n) be the least r such that [0,r]* contains exactly n points of the 
Poisson process; these points then determine a family of n random intervals, as 
described above, and we may therefore identify Y, with the largest m such that 
(U,,>U,,) lies in the square [0, r(m)]*. Since the Poisson process has density 1, we 
will have 


(1 —e)vVm<u(n) <(1+e)vn 
with probability at least 1 — «, for sufficiently large n. 

Let m, =|(1 — eX2/ Vr)vn} and m, =|(1 + eX2/ Vir)vn |; then for large 
enough n, we have that (u,,, , U,,,) will lie inside the square [0, r(n)]° and (u,,, Un,) 
outside the square, with probability at least 1 — ¢. We conclude that 


(Qe) <2 c(+e)t 
— €) = < - < + €) = 

Vr = vn Vr 
with probability at least 1 — ¢, proving the theorem. 


Note: The Poisson process enables us to place the random variables Y,, all in the 
same sample space; thus, with the help of the strong law of large numbers, we may 
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obtain the slightly stronger result that 
Yn 2 
noe Un Vee 
with probability 1. 


3. A Matter of Degree. In a graph, the degree of a vertex x, denoted d(x), is 
the number of edges incident with x. The minimum and maximum degrees of a 
graph are denoted by 6 and A respectively. What can we say about the minimum 
and maximum degree of a random interval graph? 

The minimum degree question is answered in [8]. We simply repeat the result 
here: 


THEOREM 3. If 6, denotes the minimum degree of a vertex in the interval graph 
generated by n random intervals, then for fixed k > 0, 


lim Pr(6, <kvn ) = 1 — exp{ —k?/2}. 


It follows that the average minimum degree approaches yn7/2. 

The maximum degree question is more interesting, partly on account of the role 
played by a point of degree n — 1. Recall that the diameter of a graph is the 
maximum over all pairs (x, y) of vertices of the least number of edges in a path 
from x to y. It is easy to show that with probability approaching 1, the diameter of 
our random interval graph is 2 or 3; and, for an interval graph G, diam(G) < 2 iff 
there is a vertex adjacent to all others. In fact, the following are equivalent for any 
collection of n intervals: 


. the interval graph G has a vertex of degree n — 1; 

G has diameter at most 2; 

. the interval order P has an isolated point; 

. there is an interval in the family which meets all the others; and 

. there is an interval in the family which meets both [u,v] and [ p, q], where 
[u,v] is the interval with leftmost right endpoint and [ p, q] the interval with 
rightmost left endpoint. 


NR WN 


We are thus moved to ask: What is the limiting probability that in a family of 
random intervals, there is an interval which intersects all the others? 

One normally expects such limits to be either 0 or 1, and in fact in many classes 
of structures (such as the random graphs of Erdés and Rényi) there is a “0-1 Law” 
—in that case proved by Fagin [3]—which guarantees that first-order statements, 
such as “there is a vertex which is adjacent to all others,” must have trivial limiting 
probability. Exceptions to this sort of behavior are sometimes quite startling, as in 
the famous “probleme de rencontres” in which the probability that a random 
permutation has no fixed point approaches 1 /e. 

Here we shall obtain an answer which not only ruins a possible 0-1 Law for 
random interval graphs, but adds insult to injury in a surprising way. 

Let us consider first the Poisson model, except that this time for convenience, 
let the process be of density n on [0,1]*. Let S be the minimum, over all points 
(x, y) of the process, of max{x, y}; thus S = max{u, v} where [u, v] is the interval 
with leftmost right endpoint mentioned above. Similarly let R be the position of 
the rightmost left endpoint. Then the “big interval,” i.e., the interval which 
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intersects all others, will exist just when there is a point of the process in the union 
of the rectangles [0, S] x [R,1] and [R, 1] x [0, X]. This will occur with limiting 
probability 


: —2nxy _ —nx? _ —ny? —n(x*+y7) —_— _ 
lim i i e (l-—e e +e ) dx dy 3: 


It was when the authors asked themselves how fast the probability converged to 
2/3 that the startling truth emerged: 


THEOREM 4. For all n > 1, the probability that in a collection of n random 
intervals there is one which intersects all the others is exactly 2/3. 


Proof. One can show, using the uniform model on [0,1]’, that the above 
probability is 


1—4n(n - Df ft — x? —y?— 2xy)” * — 1 deay, 


which a patient reader will reduce to the constant 2/3. Fortunately, there is a 
relatively painless combinatorial proof, given below, which is both more intuitive 
and more powerful. 

In this proof we employ the “discrete” model, in which integers between 1 and 
2n are paired at random. Once the intervals have been selected, we label the 
endpoints A(1), B(),..., A(n — 2), B(n — 2) recursively as follows: 

Refer to the endpoints {1,..., m} as the left side, and {n + 1,...,2n} as the right 
side. Let A(1) =n and let B(1) be its mate. Suppose we have assigned through 
A(j), B(j). We attach the labels A(j + 1) and B(j + 1) by the following rules: 


- If B(j) is on the left side: Let A(j + 1) be the leftmost point on the right side 
which has not yet been labeled. Let B(j + 1) be its mate. 

- If B(j) is on the right side: Let A(j + 1) be the rightmost point on the left 
side which has not yet been labeled. Let B(j + 1) be its mate. 


If A(j) < B(j) we say that this interval “went to the right”; otherwise, it “went to 
the left”. Note that we are labeling endpoints of intervals from the center 
outwards, starting from the left when the last interval went to the right, and 
vice-versa. Endpoints marked A(-) are called inner points and those marked B(-) 
are called outer. (See FiGureE 2.) 


Left Side Right Side 
; 
i 


; é fe) d 


A(6) A(3) AG) AQ) AQ) * BQ) BO) AQ) 


B(6)  B(3) 


Fic. 2. Labeling intervals. 
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It is easy to prove by induction that immediately after the labels A(j) and B(j) 
have been assigned either: 


1. an equal number of points have been assigned from the left and right sides 
(in case A(j) < BCj)), or 

2. two more points have been labeled on the left than on the right (in case 
A(j) > BCj)). 


Now when the labels A(m — 2) and B(n — 2) have been assigned, there remain 
four endpoints a < b <c <d which are unlabeled. Note that the three ways of 
pairing them are equiprobable. Our claim is that if a is paired with either c or d, 
then that interval intersects all others, but if @ is paired with b, then no interval 
intersects all others. This will prove the result. 

We consider the only two possible cases: 


1. a and b are on the left and c and d are on the right, and 
2. only a is on the left. 


In either case, all points labeled with A(-) are between a and c for otherwise one 
of a or c would have been labeled. It follows that either [a,c] or [a, d] meets all 
other intervals. 

On the other hand, suppose [a,b] and [c,d] are intervals in the collection. 
Neither is a candidate as an interval which intersects all others since [a,b] N 
[c,d] = ©. Suppose some interval [e, f] (where e < f) intersects all others. Sup- 
pose e and f have received the labels A(j) and B(j). 

In case (1), where a and b are on the left, we know that the endpoint [e, f] 
labeled A(j) is between b and c. Thus [e, f] cannot intersect both [a,b] and 
Lc, d]. 

Now consider case (2), where just a is on the left. Since [e, f] meets [c, d] we 
have f > c, hence f is an outer point (f = B(j)). Further, e is an inner point and 
therefore [e, f] went to the right. However, the last labeled pair, {A(nm — 2), 
B(n — 2)} must have gone to the left since (in this case) we assigned more labels 
on the left than on the right. Thus, for some k, with j < k <n — 2 we have that 
[ A(k), B(k)] went to the left, but [A(k — 1), B(k — 1)] went right. Thus A(k) <n 
and A(k) < A(j) since A(k) is a later-assigned, left-side inner point. It now 
follows that [B(k), A(k)] is disjoint from [ A(j), B(j)] = [e, f], a contradiction. 

With slightly more care one may use this construction to show that for any 
k <n, the probability that in a:family of n random intervals there are at least k 
which intersect all others is 


independent, again, of n. 
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Playing the Numbers 


You can play mathematical sequences as musical notes on your computer. If the 
sequence is periodic, the melody might be quite interesting, and it certainly puts a 
new dimension into the subject. 

For example, fix a modulus m. The sequence of Fibonacci numbers 


0,1,1,2,3,5,... (modulo m) 
is periodic. You can listen to the Fibonacci numbers (mod m) by running the 
following little Basic program (enter your favorite modulus m at the prompt). 


10 CLS: INPUT "MODULUS"; M : A=0: = 1 
20 C = A+ Bz C = C = M*¥INTCC/M) 
30 SOUND 130*2 (C/12), 2 : A = B: B = C: GOTO 20 


The tune ‘m = 51,’ for instance, is rather appealing. Try other periodic mathemati- 
cal sequences, scoring several of them together, varying the frequencies and 
durations, etc. See problem E3410 on p. 916 of this issue for information about the 


periods. 
—U. Phonious 


The Dark Side of the Moebius Strip 
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The Moebius strip has now been around for over a century. Everybody knows 
how to make one, as a real-world object as well as abstractly. And yet, as a 
geometrical configuration in three-dimensional Euclidean space, there is still no 
satisfactory model for it. Furthermore, most of the attempts in this direction have 
remained relatively unknown. The following is an account of what I found after 
some search, and some research, about these matters. In particular: a smooth flat 
model, as had been also found by Sadowsky [1930, no. 4]; two smooth flat algebraic 
models, one defined by Wunderlich [1962], and one found independently by myself 
[1988]; some results concerning the shortest Moebius strip, by Barr [1964], who also 
reports results of Gardner; some by myself, first reported here; a formulation of a 
variational problem—the Moebius strip of least elastic energy—by Sadowsky 
[1930, no. 5]; and, finally, my own conjecture, as it emerges from all the facts listed 
here plus some observations of real-world Moebius strips. 


1, What exactly is a Moebius strip? On one hand, it is often defined as the 
topological space attained by starting with a (closed) rectangle, endowed with the 
“usual” topology, and identifying two opposite.edges point by point with each 
other, so that each vertex gets identified with the one diagonally across. This 
“abstract Moebius strip” serves in topology as the canonical example Of a nonori- 
entable manifold. 

On the other hand, there is a physical model of the abstract strip, and it is 
usually denoted by the same term. Its inventor Moebius [1865] (also spelled 
“Mobius”, but “Mobius” is not an acceptable spelling), described it as follows: a 
paper rectangle that is sufficiently long and narrow is bent and twisted so that its 
two shorter edges can be glued together in the required manner. 

Between the abstract topological space and its physical model lie some concepts 
of intermediate degrees of abstraction. For example, besides a physical model for 
the topological strip, one can ask for a geometrical model in three-dimensional 
Euclidean space. Such a model will be provided by a subset of 3-space, that is 
homeomorphic to the topological strip. A simple approach to constructing such an 
“embedding” of the strip in 3-space consists of imitating the physical twisting that 
takes place when the paper strip is produced. Fix a line in three space, and pick a 
segment that is coplanar with the line, perpendicular to it and disjoint from it. 
Rotate the plane, and the segment with it, around the fixed line. At the same time, 
in the rotating plane, rotate the segment around its midpoint. If the second 
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rotation is carried out at half the angular velocity of the first one, the segment will 
have completed half a turn around its center when it has gone once around the 
line, and thus will meet itself with the required twist. If the fixed line is chosen as 
the z-axis of a coordinate system, the initial position of the segment is chosen to be 
[R — 1, R — 1] on the x-axis, for some R > 1, and the angles by which the two 
rotations have progressed are ¢ and 5t, respectively, then the equations 


x(s,t) =(R+scos3t)cost y(s,t) =(R+scos35t)sin¢, 
z(s,t) =ssin5t, 


for -—1 <s <1, 0<t < 27, define the embedding. The fact that for all s the 
functions at (s, 277) agree with their values at (—s, 0) reflects the identifying of the 
opposite sides of the rectangle. Besides being homeomorphic to the topological 
Moebius strip, this model has the advantage of using only analytic functions of s 
and ¢. In fact, it is not only analytic—it is algebraic: it is possible to eliminate s 
and ¢t from the expressions for the coordinates, and to represent the strip as part 
of an algebraic surface, that is, as the set of points where a polynomial in the 
coordinates takes on the value 0. 

There is, however, also a drawback to this model. The nature of this drawback 
will become clear after taking a closer look at the physical model, the paper strip. 
It was produced by bending and glueing, but no stretching was used: any curve 
drawn on the paper rectangle would have became a space curve of the same 
length. A mathematical model that reflects this property of the paper strip would 
be an isometric embedding. The following considerations show that the model 
defined above fails on that score. The center line of the rectangle is the line s = 0, 
and its image under the embedding is the circle of radius R around the origin in 
the x-y plane. Its length is 277R, and an appropriate rescaling would have matched 
this value with the length of the rectangle in the (s, t)-plane. However, the other 
lines of the form s = constant go into curves of greater length. To see this, find the 
velocity at which a point on the strip moves when s is fixed, and ¢ is regarded as 
“time.” The velocity can be split into two components, one in the plane through 
the point and the z-axis, and one perpendicular to it. Their magnitudes are 
R+scos5t and $s, respectively. The total velocity is, therefore, strictly greater 
than the first part, whenever s is not 0. Since integration of the first part from 
t= 0 to t = 27 yields just 27 R (note that the cosine term contributes nothing to 
the definite integral), the space-curve corresponding to nonzero constant s is 
strictly longer than the circle of radius R. The model is therefore not a valid 
representation of any Moebius strip that can be made out of a paper rectangle. 
Could it fit a strip made out of a differently shaped piece of paper? Again the 
answer is no, but in order to see this, a local consequence of the absence of 
stretching will have to be considered. 

A differentiable surface can be approximated in the neighborhood of any one of 
its points by a plane, the tangent plane at the point. If the surface is twice 
differentiable, it can be approximated to a higher degree of approximation by a 
quadric. Among twice differentiable surfaces, those that are obtained by bending, 
but not stretching, paper, are characterized by the property that the approximating 
quadrics will be planes or parabolic cylinders. Surfaces with this property, that 
holds if and only if the matrix of the second derivatives of z with respect to x and 
y is singular, are called “flat surfaces.” It is one of the classical results of 
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differential geometry, that among twice differentiable surfaces, a piece of plane 
can be mapped isometrically only onto flat ones. 

To the point (s = 0, t = 0) of the rectangle there corresponds the point (x = R, 
y = 0,. z = 0) in space. In a neighborhood of this point, the Taylor expansion 
shows that z = 3(x — R)y/R is true modulo third and higher powers of the 
distance from between (x, y) and (R,0). The strip is, therefore, locally approxi- 
mated by a hyperboloid at that point. Therefore, the surface is not flat, and 
consequently, the model fails to represent any paper model of the Moebius strip. 


2. The simplest flat surfaces are planes and cylinders. A Moebius strip consist- 
ing of three planar parts and three cylindrical parts, smoothly joined together, was 
found by Sadowsky [1930, no. 1]. I had no access to his papers, and before I read 
about them in Wunderlich [1962], I found such a model myself. This model is best 
described in physical terms. Consider a long cylinder of radius r, and a long paper 
rectangle of width w. Imagine the rectangle lying on a table, and the cylinder lying 
on the rectangle at an angle, that is, so that the axis of the cylinder forms an acute 
angle of, say, a with the long sides of the rectangle. Clearly one can get hold of 
one end of the rectangle, and pulling it up, wrap the rectangle tightly half a turn 
around the cylinder, and then let it continue at an angle of 7 — a to the axis, on 
the plane that is parallel to the table, a distance of 27 above it. 

Each one of the long sides of the rectangle will become a space curve consisting 
of a piece of a helix, and two straight-line pieces, each tangent to the helical piece 
at one of its end. By rolling out the cylinder onto a plane, the helical piece would 
become the hypothenuse of a right triangle with one side of length mr, and an 
angle of @ facing it. Its length is therefore zr /sin a. Now project the picture down 
onto the table. Each of the two space curves becomes a pair of segments forming 
an angle of 7 — 2a, whose missing tip is replaced by a sine curve. Note that the 
missing tip consists of two sides of an isosceles triangle with an angle of 7 — 2a 
facing a base of length wr cot a, and its length is 7r/sin a, the same as the helical 
part of the space curve. Since the segment that was on the higher plane does not 
change its length when it is projected onto the table, and the other segment stays 
put, the whole space curve retains its length when it is first projected onto the 
table, and then its rounded tip is replaced by continuing the segments until they 
meet. 

There is a “physical” consequence of this fact: when a (long) rectangle is folded 
once on the table, a cylinder, say of radius r, can be inserted into the fold, so that 
the rectangle wraps tightly around the cylinder for half a turn, and then continues 
in the horizontal plane that, lies 27 above the table, directly over its previous 
position on the table. 

A flat smooth Moebius strip will now be built from three of these configura- 
tions. Choose three angles a, B and y, that sum to 7; then 7 — 2a, 7 — 2 and 
m7 —2y also add up to zr. For the corresponding radii choose r, r and 2r 
respectively. Start with a rectangle lying on the table. Put a cylinder of radius r on 
it, so that its axis forms an angle of a with the long sides of the rectangle, and 
wrap the rectangle half a turn around it. The rectangle continues now on a plane 
that lies 2r above the table. Now repeat the procedure with the second cylinder, 
sufficiently far from the first one so as not to interfere with it (in “reality” a slab of 
thickness 2r will have to be inserted under the part of the rectangle that is r above 
the table, to keep it there). After this operation the rectangle continues on a plane 
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4r above the table. In the third step, where a cylinder of double radius is used, 
wrap downwards, causing the strip to return to the plane of the table, where it can 
be made to meet the first part that never left the table. 


v 


The surface that was constructed here is easily seen to have a continuously 
varying tangent plane: it is a smooth surface. Since each half-turn wrapping flips it 
around once, and it meets itself after three of them, it is nonorientable, and 
represents the Moebius strip. On the debit side, it is not analytic at the “seams” 
where cylinders meet planes. In fact, the second derivatives that are needed for 
defining the second-order Taylor series approximation are not defined at these 
points; still, the surface is “flat” in the extended sense: it is locally isometric to a 
piece of a plane. 

The construction can be modified so that higher derivatives will exist. All that is 
required, is that the circular cross-sections of the cylinders will be replaced by 
cross sections of the form 


where s is chosen so that the (generalized) helix will have the right length of 
mr/sin a, and the surface will have 2n — 1 derivatives. By using more complicated 
cross sections, derivatives of all orders may be made to exist, but this will also not 
yield analyticity; as long as there are planar portions of the model, it cannot be 
analytic. 
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3. Let us now have a closer look at the dimensions of the rectangle out of which 
the strip was made. The cylindrical parts of the surface will appear on the 
rectangle as three disjoint strips, two of width wr and one of width 27rr, that form, 
with the long sides of the rectangle, angles of alternating signs, and of magnitudes 
a, B and y, respectively. Let the width w of the rectangle be chosen as the unit of 
length. Clearly, the length of the rectangle must exceed the sum of the cotangents 
of a, B and y. The average of the angles is 7/3, and for positive acute angles the 
function cotangent is convex. By the Jensen Inequality, the average of the cotan- 
gents is at least the cotangent of the average, or cot(7/3), which is 1/ V3. The 
length of the rectangle therefore exceeds y3. 

Conversely, starting with any rectangle whose length exceeds V3 times its width, 
one can, for a = B = y = 7/3, and sufficiently small r, carry out the construction 
described above, and obtain a smooth flat Moebius strip. The question of how’ 
short a Moebius strip can be, is not completely solved by these considerations: for 
rectangles with length /width < V3 only this particular construction fails, and so a 
length to width ratio of more than V3 has been shown to suffice, but not to be 
necessary. In fact, in a certain sense there are arbitrarily short Moebius strips, if 
the assumption of a smooth embedding is relaxed, so as to admit “planar folded 
Moebius strips.” These are again best described in physical terms. While the 
models that correspond to what can be made out of paper, without stretching and 
tearing, are flat, in the sense of preserving the lengths of curves, they do not have 
to be smooth, or even one-to-one. If the latter condition is violated by the surface 
cutting through itself, the model cannot represent a paper surface. Consider, 
however, a sheet of paper folded flat along a line. It is reasonably approximated by 
a mapping of a rectangle onto the plane, and this mapping is at some points 
two-to-one. The folded paper may then be bent, yielding a two-to-one mapping 
onto nonplanar surfaces. The extension to finitely many folds is obvious. Martin 
Gardener has constructed folded Moebius strips in which that side of the rectangle 
that gets identified with its opposite is arbitrarily long compared to the two other 
sides. The bound y3 is thus replaced by 0. After the description of Gardner’s 
model, the bound v3 will make a come-back, in a slightly different role. Start with 
a rectangle of dimensions W and L. Pick a positive odd integer n, and divide the 
rectangle into n parallel strips of dimensions W/n by L. Folding along the dividing 
lines back and forth, accordion-fashion, an n-fold covered strip of length L and 
width W/n is obtained. 

No matter how large W is, compared to L, for sufficiently large n, this strip will 
be narrow enough to permit its being bent and glued into a Moebius strip. Note 
that the fact that n is odd permits the different layers of the folded strip to be 
glued to each other individually, in a manner that is equivalent to the identification 
of sides required by the configuration of the Moebius strip. 


4. Compare Gardner’s folded model with another folded model, namely, the 
limiting configuration obtained from the three-plane-three-cylinder model, when 
the parameter r approaches 0. The latter, that can be constructed by folding a 
rectangle three times, is, by definition, the limit of a family of smooth flat models. 
The value of a parameter like length /width for such a model is the limit of its 
values for the flat smooth models that approximate it. Inclusion of such folded 
models won’t change the least upper bound of a parameter. This is not the case for 
Gardner’s model: it cannot be approximated by flat smooth surfaces. More 
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generally: Any paper surface in the making of which a folding-line is bent or folded, 
is a finite distance away, in an appropriate sense of distance, from all smooth flat 
surfaces. This follows from the fact that every smooth flat surface is a ruled 
surface. For every point, one of the lines that pass through the point on the paper 
remains a straight line as the paper is bent to make the required surface. This 
condition is violated when the paper is folded, and then refolded, with the second 
fold cutting through the first one: let the acute angle between the two fold-lines be 
a. Then a lines of slope 6 through the point of intersection of the two folding-lines 
will be broken at an angle of 2 max(a, 8), which is at least twice the angle between 
the folding-lines. If, in making the surface, the folding line is bent, the straight 
lines through any of its points will also be bent by an angle bounded from below. 
Thus Gardner’s model does not disprove the conjecture that the length of a smooth 
flat Moebius strip exceeds ¥3 times its width. The model that Barr describes in 
Chapter 3 of his book [1964], where a paper square is folded along one diagonal, 
and then along the other, also does not imply the violation of this bound, since its 
making involves the folding of a fold. If there is a counterexample to the 
conjecture, it won’t be found by examining flat folded models. Indeed, the length of 
a flat folded Moebius strip without refolded folds is at least ¥3 times its width. The 
bound is attained by the configuration consisting of three equilateral triangles on top 
of each other, the first connected to the second along one side, the second to the third 
along another side, and the third to the first along the third side. 

This is proved by first eliminating all models in which the center-line of the 
rectangle becomes, after folding, a polygon of more than three sides. There remain 
only the models where it becomes a triangle, with the three fold-lines bisecting its 
three exterior angles. On the rectangle, the folding lines cut the center lines at 
angles a, —B and y. The cotangents must add up at least to the length over the 
width, and the proof concludes like above. Note, however, that here the bound is 
attained, because there is no extra room needed for any cylinders. 


5. Three decades after the work of Sadowsky, the step from flat smooth models 
to flat analytic, and even algebraic models was accomplished by Wunderlich [1962]. 
Chicone [1984], who was evidently not aware of Wunderlich’s article, studied the 
problem, and came up with a family of flat analytic surfaces in 3-space, homeomor- 
phic to the Moebius strip, but he failed to check whether they contain parts 
isometric to a strip obtained from a rectangle. To understand what is missing in 
Chicone’s model, consider the embedding of a cylindrical (“‘untwisted”) band in 
the surface of a cone. It is easy to find a homeomorphic embedding, but it cannot 
be trimmed down to an isometric image of a cylindrical band, since there are no 
closed geodesics on a cone. Having seen Chicone’s manuscript, but also unaware of 
the work of Wunderlich, I constructed a different flat algebraic model, also 
isometric to a rectangle [1988]. 

Wunderlich’s model and my own are both based on the same observation: the 
bisecting line of the rectangle that is parallel to the long sides, becomes a (closed) 
geodesic on the surface. If a curve is a geodesic on a flat smooth surface, and the 
curve has no straight parts, then the surface is the rectifying developable of the 
curve. This is a flat surface, which forms the envelope of all the rectifying planes of 
the curve. The normal to the surface at any point of the curve will be the principal 
normal to the curve at that point. Therefore, a simple, closed curve without 
straight parts, whose principal normal changes its direction by 180 degrees when 
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the curve is traversed once, will yield the required model: its rectifying developable 
will contain a band around the curve, isometric to a Moebius strip made from a 
rectangle; and if the band is narrow enough it will not intersect itself. A simple way 
to make sure that the principal normal will flip around, is to make the curve 
symmetric under a rotation by 180° around an axis that meets the curve at two 
points, one where the principal normal is parallel to the axis, and one where it is 
perpendicular to it. The second point must be a point of zero curvature of the 
curve. Wunderlich chose a parametrization in which this point is approached when 
the parameter ¢ of the curve approaches +. The coordinates of the curve are 
given in his model as rational functions (that is, quotients of polynomials) of the 
parameter. In my model, the point of zero curvature corresponds to ¢t = 0, and the 
fact that the curve is closed is interpreted as the curve having a periodic 
parametrization. This suggested trying for a parametrization by trigonometric 
polynomials. The simplest curve that has the required properties was found to be 


x=sint, y=(1- cost)’, z = sint(1 — cost), 


which is indeed an algebraic curve; it is the intersection of the surfaces 


ze=xy, and 8y=x°+ 6x*y + y’. 

As was pointed out to me by Gregory Brumfiel, the Seidenberg-Tarski Theorem 
[1954] can be used to show that not only the curve, but also the strip obtained in 
this manner is part of an algebraic surface. 


6. Let us now take another step in the direction from the abstract to the real 
world, and include some physical properties of paper in our considerations. If the 
substance of paper is assumed to be elastic, with a finite value for the constant in 
Hook’s law, it would be possible to stretch it. Paper that cannot be stretched, yet 
can be bent if a finite force is applied, is well approximated by an infinitely thin 
plate with an infinite Hook’s constant, so that bending any piece of it into part of a 
cylinder of radius r requires an amount of energy proportional to the area of the 
piece divided by r?. Since every small part of a flat surface is locally approximable 
by a cylinder with radius equal to the finite one among the two radii of curvature 
(the other one must be infinite), the total energy expended in making the surface 
out of paper is the integral of r~* over the surface. Left to itself, the paper will 
take on the shape of minimum energy among all shapes in 3-space that are 
compatible with the metric structure of the surface. 

As an example of this phenomenon, consider a cylindrical band, made out of a 
paper rectangle without twisting. The integral of 1/r over any closed geodesic on 
the band is forced to remain 277. Under this constraint, the minimum of the 
integral of the square of 1/r is attained when r is constant, which holds when the 
surface is a circular cylinder. Indeed, this is the shape the surface takes on “in 
the real world,” and if gently deformed, it returns to this shape. Paper Moebius 
strips also appear to have a clear stable shape, if the paper is not too soft, and the 
width of the strip is not too small. Sadowsky [1930, 5] approached the search for a 
analytical Moebius strip from this angle. He did not find an explicit form for the 
shape of minimum energy, but he simplified the expression for the energy: he 
reduced it to a one-dimensional integral, with respect to arc length s along the 
curve whose rectifying developable is the strip. The integrand can be expressed in 
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terms of the curvature k(s) of the curve, its torsion 7r(s) and the function 


WS) = ae 
T— —k— 


ds ds 


as follows: 
5 2k* + w 
[(e + 77) us log 2k — w ds, 
where w is the width of the strip, and s ranges over its length. 

Neither Sadowsky nor Wunderlich solved this variational problem—in all likeli- 
hood it is still open. On the basis of observation of physical models, Wunderlich 
states that “the shape appears to depend only slightly on the width w.” This may 
be true for small values of w; in fact, Sadowsky showed that in the limit, for the 
“infinitely narrow band,” the energy is approximately proportional to the integral 
of (k* + 77)?/k?, and it is plausible that the solution to the minimum problem for 
finite w approaches the (also unknown) solution of this simpler problem when w 
goes to zero. Note, however, that for a fixed curve, when w grows, and approaches 
the minimum of 2k*s along the curve, the energy becomes infinite. The depen- 
dence of the solution on w can, therefore, be expected to become much more 
pronounced as w becomes larger. My own observations of physical Moebius strips 
confirm this, and furthermore, lead to the following conjecture: 


For fixed length L, and width w less than L/ V3, there exists a unique minimum- 
energy smooth flat Moebius strip in 3-space. As w approaches L / v3 from below, the 
strip approaches the configuration of three equilateral triangles on top of each other, 
each two connected along one side. If w exceeds L/ ¥3, there is no smooth Moebius 
strip in 3-space, isometric to the corresponding rectangle. 
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Mixing Property of the Pedal Mapping 


PETER UNGAR 
109 Lee Road, Scarsdale, NY 10583 


In a recent article in the Monthly [2] Peter Lax proved a mixing property of the 
pedal mapping P which maps the set of angles of a triangle into the angles of 
the pedal triangle. His result tells us that if we start with almost any triangle, the 
sequence of successive pedal triangles contains triangles with shapes arbitrarily 
close to all possible shapes. 

In this note we represent the pedal mapping as a shift. We could have stopped 
there and referred to books on ergodic theory for proofs of mixing properties of 
shift mappings but we thought it would help many readers to give brief derivations 
of what is needed here, using only the elements of measure theory and probability 
theory. 

Lax follows Kingston and Synge [1] in representing a triangle of given shape as a 
point x in an equilateral triangle E = A PQR. The barycentric coordinates of x 
are the angles of the triangle in units of 7. In this representation the pedal 
mapping is the result of the following operations. 

Connect the midpoints of the sides of E. Denote the four congruent triangles 
into which & is subdivided by Ep, Ey, Ex and Ey where E> is adjacent to P ete. 
On the interior of a corner triangle the pedal mapping P is the dilatation by a 
factor 2 which maps the interior of the corner triangle onto the interior of &. On 
the interior of E,,, P is a reflection in the centroid of E combined with a 
dilatation by 2. The boundaries of the small triangles correspond to right angled 
triangles and degenerate triangles for which the pedal triangle is degenerate and P 
is undefined. 

We want to consider iterates of P. The mapping P is not defined on the 
boundaries of the four small triangles. Its square is not defined on the preimages 
of these, which are the boundaries of the subdivisions of Ep,..., E,,, etc. We get 
rid of these loose ends by restricting P to the set E’ obtained by removing from E 
its boundary points and the boundary points of all triangles obtained by successive 
subdivisions. The removed points form a set of measure 0. 

Let us choose units so that the measure of E’ is 1, so that we can employ the 
language of probability theory. 

The mixing property we are going to prove is the following. 


Let S be the intersection of E' with a triangle obtained by repeated subdivisions of E. 
For any point x © E' denote by N(x, S,n) the number of points x, P(x),...,P”~ (x) 
that lie in S. Then for almost all x and every S, 


N(q,S,n 
lim N(4, Sn) = measure of S. 
no n 


898 


1990] NOTES 899 


Once we have proved this the same statement immediately follows if S is a 
union of finitely many sets of the above kind. The convergence theorems on 
Lebesgue integrals could be used to extend the result to any measurable set S but 
we shall not do that; one can get that extension immediately from the argument of 
Lax who uses Birkhoff’s ergodic theorem. 

We label Ep, Ey, Eg and Ey with the digits 0, 1, 2 and 3. Each of these 
triangles is subdivided into 4 smaller triangles. These will be numbered by a 
sequence of 2 digits. Small triangle ij is that subtriangle of i which is mapped into 
j by P. The triangles in the next level of subdivision are labeled by 3-digit base-4 
numbers, etc. Thus for each point x in our probability space E’ we have a base-4 
sequence x,x, °:: which represents the nested sequence of triangles of successive 
subdivisions which contain x. 

Let us recall what is meant by saying that a real number is normal. In this 
paragraph only, let x be a real number. The property of normality is an assertion 
about the digits behind the decimal point, so assume for simplicity that 0 <x < 1. 
Let b be an integer > 1. Let 0.x,x,... be the infinite expansion of x in base b. 
For any block B, of k base-b digits, B, = (a,,...,a,), let N,(x, B,) be the 
number of occurrences of the block B, in the sequence x,,..., x, If N(x, B,)/n 
—+ b-* as n > © for every k and every B, then x is said to be normal to base b. 
We say x is normal if it is normal to every base. Borel showed that almost all real 
numbers are normal. ; 

In our work x is a point in E’ instead of a real number. The x, are random 
variables defined on our probability space FE’. Moreover, they are independent 
since the measure of a subtriangle specified by any given ndigit sequence is (+)”. It 
follows just as in the case of digit sequences representing real numbers that with 
probability 1 the digit sequence of x contains every block of length k with 
frequency 47“. 

The digit sequence of Px is obtained by discarding the leftmost digit of the 
sequence of x. A point x is in the subdivision triangle 21301, say, if the first 5 
digits of its representation are these. The image of a point under the n-th iterate 
of P is in that triangle if the five digits starting with the n + 1-st are as above. 
Borel’s theorem tells us that with probability 1 the frequencies of occurrences of 
all possible k-digit sequences tend to 1/4*. This completes the proof of the mixing 
property of P. 

For completeness we include a short proof that almost all numbers are normal.’ 
It suffices to show that the numbers which are not normal to any base form a set of 
measure 0. To show this it suffices to show that for almost all numbers the 
frequency of any digit in the infinite representation in base b approaches 1/b. 

Indeed, let B, be a block as in the definition of normal numbers. The base-b 
integer a = a,,..., a, is a possible digit in the base-b* representation of x. If we 
know that for almost all x the frequency of any digit approaches b~* in base b* 
we can draw the following conclusion about the frequency of occurrences of B, in 
the expansion to base b. 


"For a careful and clever treatment which gives very precise information about the difference 
between the number of occurrences of a digit minus the expected number, see William Feller’s 
Introduction to the Theory of Probability, vol. 1, ch. 6. 
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For almost all x the following is true: If we look only at blocks of k digits preceded 
by 0,k,2k,... other digits, the frequency of B, in the base-b representation of x 
approaches b~*., 


Since each digit is a random variable independent of the preceding digits, a 
similar statement must also be true for blocks preceded by 1,k + 1,... digits, etc. 
So we can deduce that the frequency of any block of length k approaches the 
required limit for almost all x if we can prove this for any digit a to any base b. 

Let X,, be the random variable which has value 1 if x, = a and is 0 otherwise. 
Then X,, X,,... are independent random variables with mean 1/b and variance 
(b — 1)/b* < 1. The number of occurrences of a among the first n digits of x is 


N,, has mean n/b and variance < n. Chebyshev’s inequality enables us to conclude 
P(IN, /n — 1/b| > n7%4) <n7¥?, (1) 


Consider a number x such that the frequency of the digit a does not approach 
1/b in its base-b representation. Then 


> 0. (2) 


We observe now that (2) must also hold if the limsup is taken over the subse- 
quence of the integers n which are perfect fourth powers, n = q*. This is because 
N,/n can change between (q — 1)* and q* by at most 4q°/n = 4/q and this 
approaches 0. Consequently (2) implies 


N« 1 
q’ ob 


(3) 


for infinitely many q. By (1) the probability that (3) holds for even a single value of 
q>Qis <1/07+1/(0+1)?+-:: . Since this tends to 0 as OQ > ~, the 
probability of the set of x such that the frequency of a in the representation of x 
does not approach 1 /b is 0, which is what we wanted to prove. 
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Integral Representation of a Finite Spike 


R. E. MiroLLo 
Department of Mathematics, Boston College, Chestnut Hill, MA 02167 


S. H. Strocatz! 


Department of Mathematics, MIT, Cambridge, MA ‘02137 


Consider the function f defined by the following integral: 


7 (u+cosa)(1 + ucosa) 


flu) = da, (1) 
J, (1 +u2 + 2ucosa)”” 
where u > 0. We will show that 
_f2, u=1 
flu) = {0 u # il. (2) 


Thus the innocent-looking integral in (1) represents a discontinuous function which 
is zero everywhere except for a finite spike at the point u = 1 (FiGurE 1). We ran 
across this integral in the course of a stability calculation for the equilibrium states 
of a certain many-body dynamical system. 


Fic. 1 


The most straightforward proof of (2) involves three steps: (a) Show f(1/u) = 
uf(u). Thus it suffices to prove (2) for u < 1. (b) Verify f(u) = 0 for 0 < u < 1 by 
a series expansion in powers of u. (c) Compute f(1) = 2 as an elementary definite 
integral. 

There is a more appealing proof of (2). It involves the fact that the integrand in 
(1) may be expressed explicitly as an exact differential. To see this, rewrite the 
numerator in (1) as u + ucos*a + u? cosa + cosa = usin? a + cosa(1 + u* + 


'The research of S. H. Strogatz was supported by an NSF Mathematical Sciences Postdoctoral 
Research Fellowship, DMS 8605761. 
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2u cos a), using the identity u = usin? a + ucos’ a. Thus (1) becomes 


7 u sin? a COs a 
pu) =f” | Ss *@ + — S82 | ae 


-o|(1+u2+2ucosa)” (1+u2+2ucosa)” 
7 d 
=f Faq Lub a)] da 


=y,(7) —y,(-7), (3) 
where 


sin @ 


(4) 


y,(a) = ——————73 : 
(1 + u? + 2ucosa)”” 


When u # 1, y,(7) = y,(—77) = 0, and hence (3) implies that f(u) = 0. Now 
assume u = 1. Then (1 + u* + 2ucos a)'/* = ¥2(1 + cos a)'/* = 2cos(a/2), and 
so y,(a) = (sin a)/(2 cos(a/2)) = sin(a/2) if a # +7. Hence lim,., ,,y (a) = 
+1. Thus (3) implies f(1) = 1 — (—1) = 2, as required. 

There is a simple geometric interpretation of these results (Figure 2). Consider 
two circles of unit radius, and let u be the distance between their centers. Choose 
a coordinate system with the origin at the center of one of the circles, and the 
x-axis along the line joining the centers. Given the point P = (u + cosa,sin a), 
define the point Q by the projection shown in FiGurE 2. 


Then the function y,(a@) may be interpreted as the y-coordinate of Q, since by 
similar triangles y(Q) = y(Q)/||OQ|| = y(P)/||OP|| = sin a/( + u? + 
2u cos a)'/* = y,(a). For u # 1, Q executes a closed path as a runs from —7 to 
a. Thus the net change in y,(q@) along the path is zero, which shows f(u) = 0 
when u # 1. However, note that for u near 1, the points P, P’ are splayed apart by 
the mapping PQ (Ficure 3). In fact, for u = 1, QO is not defined when 
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a= +7. Asa- +7, Q > (0,+ 1) and so y,(a) changes by 2 as a runs from 
— to mw. Thus f(1) = 2. Note that the mapping P — Q cannot be extended to a 
continuous map at u=1, a = +7. It is this discontinuity which underlies the 
curious result (2). 


On the Irrationality of 7? 


D. DESBROW 
Department of Mathematics, University of Edinburgh, Edinburgh, EH9 3JZ Scotland 


1. Introduction. In 1761 Lambert [7] showed, using continued fractions, that 
tan a is not rational when a is a nonzero rational from which it follows immedi- 
ately that 7 is irrational. Both these results were new; the latter being but a 
prelude to Lindemann’s celebrated proof of 1882 that 7 is transcendental. 

That in fact 7? is irrational by much the same reasoning with continued 
fractions was certainly known to Legendre [8, Note IV] by 1808. In 1873 Hermite 
[2] showed the irrationality of m* essentially from the nontrivial observation that, 
for the positive integer n, 


1 qr \2nt1 1 ayn aT 
—(5| fax ) COs yee 
is a polynomial in 77/4 with integer coefficients and degree the integer part of 
n/2. It is then easy to deduce, on the assumption that 7° is rational, the untenable 
existence of a positive integer less than 1. Variations on this theme are much 
favoured in text-books discussing the irrationality of 7. Furthermore Hermite’s 
method has been taken up and extended by others; most generally by Inkeri and 
Estermann. In [5] (but see also [4] for a similar, though earlier, version in English) 
Inkeri shows that there is no complex number a@ with a* and a~'tana both 
rational (except possibly 0 if one allows a~' tan a = 1 when a = 0 on the grounds 
of continuity). Later Estermann [1], unaware of Inkeri’s result, proved an equiva- 
lent form very elegantly. 

The purpose here is to show yet another path through this well-trodden terrain. 
Specifically we offer a novel proof of the following theorem which subsumes 
Inkeri’s result. 
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THEOREM. There is no nonzero complex number a with a* and atanha both 
Gaussian rationals. 


By way of explanation, a complex number is a Gaussian rational if both its real 
and imaginary parts are rational. 


We have the following corollaries. 


(i) w* (and so 7) is irrational. 
(ii) tanh a (and so e“) is not a Gaussian rational when a is a nonzero Gaussian 
rational. 


Clearly (i) follows from the choice a = vi and (ii) is immediate, noting only that 
z e*—-i| 


tanh — = 
oD e+ 


for each complex z not an odd integer multiple of zi. 

There are of course other proofs of the irrationality of a. Two in particular are 
both noteworthy and ingenious. One, again by Hermite [3, p. 74] but less elegant 
than his method outlined above, uses a sequence of power series expansions. The 
other by Niven [9] uses an integral; cleverly modifying Hermite’s integral method 
as above. Both ideas have been widely developed. Most notably in relation to our 
theorem and its corollaries, Inkeri [5] (again [4] should be helpful) uses Hermite’s 
power series approach to give an alternative proof of his result cited above while 
Niven’s integral approach has been used by Iwamoto [6] and Wachs [12] to 
establish the irrationality of m* and by Novak [10] to show that e® is not a 
Gaussian rational when a is a non-zero Gaussian rational. Seemingly unknown to 
Novak, his result was already proved by Schneider [11] who used, however, an 
interpolating series for the exponential. 

The proof offered here uses no more than the Taylor expansion of cosh ayz 
about 1 and some ideas prompted by [1] and [11]. 


2. Proof of the theorem. Let a be a nonzero complex number. We begin with 
the observation that w = cosh ayz is an entire solution of 


4zw" + 2w' — a2w = 0. (1) 


[Here /0 = 0 and yz = jre’’/* when z = re’ with r > Oand -7 <6 <7] 
For any complex z, 


w = cosh ayz = Fare" | (2) 
0 


so that w is certainly entire and the rest is routine verification. 
By Taylor’s theorem therefore, for some complex do, @,,a5,... and all complex 
z, real nonzero p, 


w = cosh ayz = Yaz — 1)’, (2) 
0 
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where for each n, 


a. = 
" QOnri 


1 cosh ayz 
[an & (3) 
c(z-—1) 
and C is the circle with centre 1 and radius p’. 
Substituting the expansion (2) into the equation (1) we are led immediately to 
the recurrence 


a 
a, = cosha, a, = — sinha, 
0 1~ 5 (4) 
4(n+1)(n +2)a,,,+2(2n4+ 1)(n + 1l)a,,, —a’a, = 0. 


n 


Now suppose that a* and atanha are both Gaussian rationals. Then a, = 


cosh a # 0 and there are positive integers k and K such that ka* and Ka tanha 
are Gaussian integers. For each n let 


bh, =n!(2k)" Ka, /ao. 
Immediately from (4) we have the recurrence 
bo, =K, b, =kKatanha, 
big tk(2n + 1)b,,, —k?a*b, = | 


It is evident from this that, for each n, b, and b, ,, are Gaussian integers not both 
0, so that 


|b,,| + load Z 1. (5) 


On the other hand for each n, using (3), 


Kn\(2k)” 
» | - Ae 


n 


cosh ayz 
J n+1 dz 
c(z-1) 
Kn!(2k)"eleW +e”) 


lay|p7” 


21a, 
< 


by the usual estimates, noting that |cosh z| < e!?! for each complex z. 
Now for each n > 1 choose p = n so that 


lDnl < — 


~ layin2” 


KniA"  -K [ A\" 
a 


< — 
lal 


where A = 2ke7'*!. Thus as n > ~, b, — 0; contradicting the inequality (5) and so 
proving the theorem. 
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The Width of a Rose Petal 


S. C. ALTHOEN AND M. F. WYNEKEN 
Department of Mathematics, University of Michigan, Flint, MI 48502 


“Don’t hurry, don’t worry. You’re only here for a short visit. So be sure to stop 
and smell the roses” (cf. [1]). Roses are a beautiful class of polar graphs too often 
passed by too quickly for lack of an interesting remark. (However, see [2] for a 
thorough discussion of roses and computer graphics.) We investigate three aspects 
of the famous cosine roses 


r= cos née, n = 2,3,4,..., (1) 
and two generalizations: the wonderful yet less well-known “double” roses 


r=a+bcosnd, n = 2,3,4,..., 0<a<b, (2) 
which generalize the limagon, r = a + bcos 9; and the family 


r® = cos née, n = 2,3,4,..., e> 0, (3) 
which generalizes the lemniscate, r* = cos n0. 


Graphs. First let’s look at a few graphs: 


r= cos30 r= cos 46 


Fic. 1 


907 
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ge oP 


r= 1+ 2cos30 r= 1+ 2cos4é6é 
Fic. 2 

r® = cos36 r!78 — cos 36 
Fic. 3 


On the one hand, the little petals in FiGURE 2 grow longer as a — 0. Thus 
r = cos2k6 has 4k petals, while r = cos(2k + 1)@ has only 2k + 1 petals. On the 
other hand, these little petals shrivel up as a ~ b. Thus r= 1+ cosn@ is an 
n-petalled rose for n = 2,3,.... 


Petal widths. The graphs of equations (1) and (3) are symmetric about the line 
containing the polar axis and look generally like the graph in Ficure 4 for 
—T/2n <O0<7/2n. 


Fic. 4 
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A natural problem is to find the maximum width 2y, of the petals of r = cos n@. 
Although one can approach this problem by first converting to rectangular coordi- 
nates, it is most satisfying to attack the problem intrinsically. To this end consider 
FIGURE 5. 


Fic. 5 


The angle & is the angle from the radius vector to the tangent and, as is shown 
in calculus, tan W = r/(dr/d@). For (x,, y,,) to be a maximum we must have the 
tangent line / parallel to the polar axis. Thus, we require & = 7 — 0, so tangy = 
tan(7 — 6) = —tan@ and so 

r 


dr /d0 


= —tand@ 


or 


g— =0 4 
+ ft — =. 
r+ tang (4) 


In the case of the four-leaf rose r = cos 26, (4) yields 


1 — tan’ 0 
2tan @ = cot26 = —————_, 
2 tan @ 


so tan @ = ¥1/5. Then the width is 2y, = 2rsin 0 = 2cos26 sin@ = 276 /9. 

In general, if r = cos n8, (4) yields n tan @ = cot né which is difficult to solve 
because the formula that expresses cotn@ as a rational function of tan @ 1s 
complicated. Another approach to finding the width uses the fact that cos @ is a 
polynomial in cos 6. Because cos 76 is the real part of e'”*: 


cos nO = Re(cos @ + isin 0)” 


Re ) (al sin‘ @ cos"~* @ 


O<k<n 


i 


y (sn | —1)” sin?” @ cos”~2” @ 


O<m«<n/2 


> (om DC — cos? 6)” cos”~ 2” g. 


O<m<n/2 
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We shall denote this polynomial by P,, so that P,(cos @) = cos n@. Observe that 
when n is even each term of P, has even degree, and when n is odd each term of 
P, has odd degree. Then, at least in principle, one can find the petal width of 
r = cosné@ by doing the following. First differentiate the equation y =r sin @ = 
cos n@ sin 8 with respect to 6 and set dy /d@ equal to zero. Denoting the smallest 
positive critical value by 6,, we have 


cos 6, cos n6,, = nsin 6, sin n6,,. (5) 


If we then add ncos@,cosné, to both sides of this equation and apply the 
difference formula for cosine, we obtain 


nN 
cos 6, P. (cos @,) = ———P cos @,), 
n a n) n +] 2 — 10 1) 


which is a polynomial equation in cos 6, of degree n + 1 and with the degrees of 
terms all of the same parity. In the case when n is odd we can view this as a 
polynomial equation in cos’ 6, of degree (n + 1)/2, and in the case when n is 
even we can divide both sides by cos @, and then view this as a polynomial 
equation in cos’ 6, of degree n/2. One could, in theory, solve these equations 
exactly in terms of radicals for n = 2,3,...,8. For arbitrary n, Newton’s method 
could be applied to find the approximate petal width. 

The reader can also do the width calculation with the equation r = a + bcos n6, 
0 <a <b. For example, for n = 2 the maximum petal width of the large petal in 
FicureE 6 is (2V6 /9\b + a)?/*b~'/*. The width of the little petal is (2V6 /9) 
(b — a)?/*b~'/*. We leave the case r* = a + bcos20,0 <a <b, as an exercise. 


Fic. 6 


Asymptotic results. In the search for asymptotic results, standard calculus limits 
have interesting geometric applications. First observe that 0 < 76, < 7/2, so 
6, — 0 as n > ©, Then from (5) we obtain 


(n6,)' cos 6,'cos nO, = (nO) ‘nsin@, sin né, 
sin 6, - 
= sin 16, ; 
0 


n 


thus a~'cos a = sina or 

a~' = tana, (6) 
where a is a limit point of the sequence {n6,},.,. From (6) we observe that a 
cannot equal 0 or 77/2 so that a must lie in the open interval (0, 7/2), and must in 


fact be the limit of the entire sequence since the graphs of a~‘ and tan q@ intersect 
in only one place on this interval. The corresponding x-coordinates x, = 
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cos n6, cos 8, will converge to cos a, and a calculator reveals a = .860333589... 
and cos a = .652184623.... 

Now consider the equation r* = cosn6é, e > 0. In solving for the smallest 
positive critical value 6, , equations (5) and (6) become 


n 
cos@, .cosné, , = — sind, ,sinnd@, , (5,) 
> > E > > 


and 
ea,' = tana, (6,) 


E 


where n6, , > a, aSn — ©. Let x, , be the x-coordinate of the critical value 6, ,, 
sO X,, = (cos no, ~)'7* COS 8, «5 and let x, be the limit of x, as n > %, so 

= (cos a)'7. It is interesting to compute the limit of x, as « — 0. From 6, ) 
one can see not only that a, — 0 as e — 0, but one can also obtain implicitly the 


derivative of a, with respect to «. Hence, we may apply |’H6pital’s rule as follows: 


_ Incosa, 
lim Inx, = lim 
e—0O e—0O E 
da 
= lim — tana 
e—0O ° d 
—tan a, 
= lim 


e>0 a, sec? a, + tana, 
—sina,/a, 


iim ——_——_—_— 
e-0 seca, + (sina,/a,) 


N|— 
Ne 


and so x, > 1/ve ase > 0. 

We note that these calculations can also be applied to the other intersection 
points of a~' with tan a. These intersection points relate to the x-coordinates of 
the horizontal tangents of the other. petals in the limit as n > 

The reader can easily generalize to the equations r* = a + bcosn0,0 <a <b, 
a+b=1, e >0, to again obtain lim, _,, x, =1/ve. If a+b> 1, the graph 
blows up as e — 0, and if a + b < 1, the oraph shrinks to the pole as « —> 0. 
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The Sum of Absolute Deviations and the Sum of 
Squared Deviations 


Davip L. FARNSWORTH 
Department of Mathematics, Rochester Institute of Technology, Rochester, NY 14623 


This note points out a similarity in the sum-of-squares and the sum-of-absolute- 
values settings of data analysis. In the usual sum-of-squares setting, the sum of the 
squared deviations from any real number ¢ can be written as a quadratic function 
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of t: 


f(t) = Via -t) = Ye, - 8) + a(t - x)" (1) 
i=] i=1 
for n data values x, The difference of f(t) and its minimum is n(t — x), a 
succinct, clearly nonnegative expression whose value is readily perceived to be zero 
at t = x. These results, which appear in many statistics textbooks, are obtained by 
completing the square or by differentiating with respect to f¢. 

The sum of the absolute values of the deviations, g(t) = Lx, — ¢l, attains a 
minimum value at t = m, the sample median (see, e.g., [1, p. 209] or [2, p. 400]). 
We now present a decomposition g(t) = Y|x, — m| + D,(t) + Dt), which paral- 
lels (1) in that both D, and D, will be transparently nonnegative always, and equal 
to zero when ¢t = m. Let 


N(r) = #{x, > m} —#{x; < m}, ift<m 
t{x, <m)} —#{x, > my}, ift >m, 


and let )}* denote a sum over all x, strictly between ¢ and m. Then 
n 
g(t) = Yo l|x; —m| +N |t —m|+2>*|x; - ¢l. (2) 
i=] 
Since N, > 0, all three terms on the right are nonnegative, and since the sum )™ is 
vacuous for t = m, the second and third terms are zero for t = m, as required. 
To prove (2), assume t < m, and note that for each x,, 


|x, — m| — |t — ml, ifx,<t 
|x, — tl = (|x, -— m| —|t — ml + 2|x,-tl, ift<x,<m 
Ix; — m| + |t — ml, ifm <x,. 


Result (2) now follows by summing over all x;. For ¢ > m the proof is virtually 
identical, and if t = m, (2) holds trivially. 
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The Lebesgue Integral Without Measure Theory 


A. VAN DAELE 
Department of Mathematics, University of Leuven, Leuven, Belgium 


Very often, a course in measure theory is the first occasion where students learn 
about the Lebesgue integral on R and R”. However, sometimes it is convenient to 
have some notions of Lebesgue integration theory on R” (like completeness of 
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_*(R”) and the dominated convergence theorem) without going into measure 
theory. We mainly think of students in physics and engineering. There are several 
more or less well-known procedures to develop the Lebesgue integration theory on 
R” without introducing too many notions of general measure theory (see, e.g., 
[1, 2]). In most cases they start from the Riemann integral and use the notion of 
null sets. In this note we will show how some of the well-known techniques can be 
used to provide a short and fairly complete treatment of Lebesgue integration on 
R”. | 

We will only deal with the integral on R. But the same method can be applied to 
R” (and in fact to many other more general cases). For more details see [3]. 

The starting point is the Riemann integral on continuous functions. 


1. DEFINITION. Denote by K(R) the complex vector space of continuous com- 
plex functions with compact support in R. If f < K(R), we use J(f) for the 
Riemann integral of f. 

Sufficient knowledge of the Riemann integral on continuous functions is as- 
sumed. 


2. DEFINITION. Let f: R — C be any complex function. Define || f|| € [0, 2] by 


h, < K(R),h, > Oand |f(t)|< ) A,(t) for all te R}. 


n=1 


fll = nt y I(h,) 


n=1 


Remark that infinite sums are allowed. There are no convergence problems 
because we are working in the extended positive real line. We have the following 
properties. 


3. PRoposiITION. Let f, f,, f,,... be complex functions on RK and let A © C. Then 
i. if f © K(R), then lf || = IC f), 

ii. AF Il = IAT IIfIl 

iii. if |f(O| < LP _ f,@) for all t € R, then |If\l < UP _yllfi ll. 


The proof of this result uses standard techniques of measure theory; see, e.g., 
[1]. Dini’s theorem is used in the proof of (i) and the other properties are more or 
less immediate consequences of the definition. From (iii) follows that || f]| < |lg|| if 
|f| <|g| and that || f+ gll < |Ifll + llgll. In particular, || - || is a semi-norm. 


4. DEFINITION. Denote by -'(R) the set of complex functions f on R such that 
for all « > 0 there exists a g © K(R) such that || f — g|| < e. Define I(f) = lim J(f,,) 
if f © _7'(R) and if (f,,) is a sequence in K(R) such that ||f — f,|| — 0. 

It is very easy to see that -'(R) is a complex vector space containing K(R) as a 
subspace and that f, |f|e“\(R) if fe7(R). It is also easy to show that 
lim /(f,,) exists and does not depend on the choice of the sequence. Moreover, the 
definition of J on .#'(R) extends the Riemann integral on K(R). The functional / 
remains linear and positive on -'(R) and still Jf) =||f|| holds on -7'(R). 
These results follow from the corresponding results for the Riemann integral 
because J is the unique continuous extension of the Riemann integral to -'(R). 

We now come to the main properties. 


5. THEOREM (Completeness of -'(R)). If (f,,) is a sequence in .#'(R) such that 
lf, —f,|| ~ 0 as n,m — ©, then there is an f € £*(R) such that ||f — f,|| > 0. 
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Proof. Let (f,,) be a Cauchy sequence in -“'(R). By taking a subsequence if 
necessary, we may assume that ||f,,, —f,||< 1/2” for all n. Let f be any 
complex function on R such that f(t) = lim f,(¢) if lim f(t) exists. The function f 
may have any value at the other points of R. Choose n € N. If ¢ © R and lim f(t) 
exists, we have 


f(t) — f(t) = x (Fert) —f,(t)) and 


f(t) - fi O| < Lo | fear) -— AO). 
On the other hand, if this last sum is finite, then the first one will also converge 
and so lim f(t) exists. Therefore, if t © R and lim f,(¢) does not exist, then this 
sum is infinite and the inequality is trivial. So our inequality is true for all ¢ € R. 
By (iii) of the proposition we get 


°° 1 
If - fill < » II feat — fy < ant 


From this it follows that f € .'(R) and that f, > f in 7'(R). 
By similar methods we can prove the monotone convergence theorem. 


6. THEOREM. Suppose that (f,,) is a sequence in .£'(R) such that 0 <f, <f, 
< ++: and that sup I(f,,) < %. If fis a function on R such that f(t) = lim f(t) for 
all t, then f € #'(R) and I(f) = lim I(f,). 


Proof. For all n and all t © R we have 


f(t) —firlt) = Lo feat) — fi(4)) 
k=n 
and by the proposition 


lf fall < Lo Whew — fall 


k=n 
= » I fet — fi) — lim I( fin) _ I(f,)- 


The right-hand side converges to 0 if n > ©. It follows that f © ./'(R). So also 
lf —f,ll = 10 -—f,) = 1 f) — ICf,,) and we obtain I(f) = lim J(f,,). 

By a standard procedure the dominated convergence theorem can be obtained 
from the monotone convergence theorem (see, e.g., [1, 2]). 


7. THEOREM. Suppose that (f,,) is a sequence in -2'(R) and that f is a complex 
function such that f(t) = lim f,(t) for all t © R. Suppose that g = '(R) and that 
If,(t)| < g(t) for all t € R. Then f € #*(R) and I(f) = lim ICf,). 


Proof. By considering real and imaginary parts we may assume that the func- 
tions f, f,, f.,... are real. If we apply the monotone convergence theorem to the 
sequence (g + sup, <,<1 fen We Obtain that sup, ., f, © (R). If we apply 
the monotone convergence theorem once more to the sequence (g — sup, .; f,) 
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we find that lim sup f,, € 7'(R) and so f € “'(R). Moreover we obtain I(f,) < 
I(sup,, <; f,) if n <k so that 


lim sup I( f,,) = inf ( sup (fe) < inf (1( sup fi) =I(f). 


Similarly, or by applying the same formula to —f, and —f, we get I(f)< 
lim inf /(f,,). Therefore, J(f) = lim I(f,,). 


We only used techniques that are well known in integration theory. Neverthe- 
less, using some simple new ideas (as in the proof of the completeness of -'(R)), 
we have obtained a shorter and simpler treatment of the main results on the 
Lebesgue integral on R. We finish this note by showing that also .7(R) can be 
treated in such a short way. In fact, the same is true for all _7?(R) with 1 < p < 
(see [3]). 


8. DEFINITION. Let || fll2 = F277 for all f: R- ce. 


In the following proposition we have collected all the necessary results on || - ||2 
to obtain _“*(R) and its properties. 


9. ProposiTION. Let f, 8, f,, fo,... be complex functions on R and A € C. 
i. If |lfll2 < © and ||gll2 < © then || fell < Ilfllallglle, 

ii. |[Af Ilo = IAI If lle, 

iii. if |f(@| < LF_,If,@| for all t, then |lflle < LF_ fille. 


Proof. If ||f\l2 < 1 and |lgll. < 1 we find 
I fell = lf llel Il < SILA? + lel? < ACUIAI I + Well) < 1. 


In the general case we just have to multiply f and g by the appropriate constants 
to prove (i). Again (ii) is trivial. To prove (iii), observe that 


2 
el < CUI < YE Wfllellfylle = ( EIfall 


if all || f, || < c. Hence (iii) follows in all cases. 

It is clear that now we can proceed by defining -“*(R) as the closure of K(R) 
with respect to this new seminorm. And by the same arguments, -/7(R) will be a 
complex vector space and -/7(R) will also be complete. Moreover, the product of 
two functions in .“*(R) will be in -'(R) because of the definition and the 
Cauchy-Schwarz-like inequality proved in the proposition. 
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ELEMENTARY PROBLEMS 


E 3409. Proposed by Ioan Tomescu, University of Bucharest, Bucharest, 
Romania. 


Suppose G is a connected k-chromatic graph which is neither a complete graph 
nor a cycle on m vertices with m = 3 (mod 6). Prove that in any k-coloring of G 
there exist two vertices of the same color having a common neighbor. 


E 3410. Proposed by Peter Freyd, University of Pennsylvania, Philadelphia, PA. 


(a) Given a positive integer m, let f(m) be the period-length of the Fibonacci 
sequence taken modulo m. Prove that f(m) < 6m for all m and that equality 
holds for infinitely many values of m. 

(b) Prove the analogous assertion for the Lucas sequence with 6 replaced by 4. 
(The Fibonacci sequence {F,}"_., satisfies Fy = 0, F; = 1, and F.,,=F,+ F,_, 
for n > 1; the Lucas sequence {L,}"_, satisfies Lp = 2,L, =1,L,4, =L, +L, 1 
for n = 1.) 


E 3411. Proposed by Donald E. Knuth and Boris Pittel, Stanford University, 
Stanford, CA. 


Find a closed formula for 


1 
» oe 
k,+k,+ ++: +k,=m ky '(k,!+ ky!) °°: (k,!+ k!+ °°: +k,,!) 


916 
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and 
1 


kK, +k 5+ oe +k,=m 


where both sums are extended over all n-tuples of nonnegative integers with 
sum m. 


E 3412. Proposed by Eric Wepsic, Boston, MA. 


Suppose that we place n balls (numbered from 1 to n) into n urns (numbered 
from 1 to n) in the following way: The ith ball is placed into an urn chosen 
randomly from the first i urns. Let P(n, k) be the probability that an urn chosen at 
random from the n urns contains exactly k balls G.e., nP(n,k) is the expected 
number of urns containing exactly k balls). Find lim, _,,, P(n, k) for fixed k. 


E 3413. Proposed by Robert McNaughton, Rensselaer Polytechnic Institute, 
Troy, NY. 


Suppose n,, 7”, are positive integers and m,, m, are integers such that m,/n, < 
m,/n,. Let V be the semigroup under componentwise addition formed by all pairs 
(p,q), where p and q are integers, q > 0, and m,/n, < p/q <m,/n). Call a set 
B a generating set for V if B C V and every element of V is equal to a finite sum 
of elements of B, repetitions allowed. 

(a) Prove that V has a generating set B in which q < max(n,,n,) for every 
(p,q) €B. 

(b) Prove that V has a unique minimal generating set, i.e., one contained in 
every other generating set. 


E 3414. Proposed by Gerald Myerson, Macquarie University, New South Wales, 
Australia. 


Suppose we construct a sequence of rectangles as follows. We begin with a 
square of area one. We then alternate adjoining a rectangle of area one alongside 
or on top of the previous rectangle. The figure shows the first five rectangles in the 
sequence. 

Find the limiting ratio of length to height. 
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SOLUTIONS OF ELEMENTARY PROBLEMS 
Looking at the Regular Polyhedra 


E 3167 [1986, 650]. Proposed by Ernesto Bruno Cossi, Universidade Federal do 
Rio Grande do Sul, Brazil. 


Let P. be any one of the five regular polyhedra inscribed in a unit sphere. For 
each polyhedron P, determine the smallest and the largest number of vertices of P, 
which can be seen from a point on a concentric sphere of radius R > 1. 


Solution by Matthew Cook (student), University of Illinois at Urbana-Champaign, 
and the editors. Presumably the problem asks for the complete evaluation of the 
two functions f, and F, on (1,0) defined as follows: f,(R) is the minimal number 


of vertices of P. visible from a point on a concentric sphere of radius R, while 


L 


F(R) is the maximal number of vertices of P, visible from a point on a concentric 
sphere of radius R. (The unit sphere is considered to be transparent; only the 
surface of P. has “hiding power.”) Clearly both f,; and F, are non-decreasing step 
functions taking only a finite number of values. 

Suppose P, is the regular tetrahedron, P, is the cube, P, is the regular 
octahedron, P, is the regular dodecahedron, and P,; is the regular icosahedron. 
For simplicity let us refer to the center of the unit sphere as the ongin. 

The first two cases are comparatively trivial. For the regular tetrahedron 
f,CR) = 3 and F,(R) = 4 for all R, while for the cube f,(R) = 4 and F,(R) =7 
for all R. In both cases the minimum number f,(R) occurs on a ray from the origin 
through the centroid of a face, while the maximum number F,(R) occurs on a ray 
from the origin through a vertex. 

The planes of the faces of each regular polyhedron divide the exterior of the 
polyhedron up into a finite number of parts. In the interior of any such part the 
number of vertices visible remains constant. One could go beyond the question 
raised in the problem and ask for the complete description of all of these parts and 
the number of vertices visible in the interior of each one. This description is rather 
easy in the case of the tetrahedron and cube. 

In the case of the octahedron we could take the vertices to be the points 
(+1,0,0), (0, + 1,0), and (0,0, + 1). Then the planes of the eight faces have the 
equations x + y +z = +1. In the interiors of the various parts into which these 
planes divide the exterior of the octahedron, either three, four, five or six vertices 
are visible. 

Using this information it is easy to see that f,(R) and F,(R) each take on two 
distinct values. Thus f,(R) = 3 for1 <R< v3 (equality occurring on a ray from 
the origin through the centroid of a face) and f,(R) = 4 for V3 < R < © (equality 
occurring on a ray from the origin through the midpoint of an edge). Similarly 
F,(R) = 5 for 1 < R < y3 (equality occurring on a ray from the origin through a 
vertex) and F(R) = 6 for V3 < R < ~ (equality occurring on a ray from the origin 
through the centroid of a face). 

Now we turn to the dodecahedron, for which the situation is somewhat more 
complicated. Although it is hard to visualize the regions of space formed by 
extending a dodecahedron’s faces, it is not hard to categorize them. But first, it will 
be very useful to observe that if three faces A, B,C are arranged so that A is 
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adjacent to B, and B is adjacent to C, but A is not adjacent to C, then whenever 
all three are visible, we must have at least one of two situations: Either the face 
adjacent to A, B, and C must be visible or the two other faces adjacent to B must 
be visible. (See FicurE 1.) 


Fic. 1 


In terms of the picture, we claim that when D is not visible (but A, B, and C 
are) we can see E and F. We shall only show that E must be visible, since a 
symmetric argument would do the same for F. 

Let P be the point of intersection of the extensions of faces A, E, F, C, and D, 
directly out from the center of B. Let L be the line of intersection of the 
extensions of & and D, and let M be the line of intersection of the extensions of 
A and C. Clearly, L and M both pass through P. 

The region in which neither EF nor D is visible is the V-shaped trough formed by 
two half planes bounded by L, one containing E and the other containing D. The 
region in which both A and C are visible is the V-shaped trough formed by two 
half planes bounded by M, one’s extension containing A and the other’s extension 
containing C. These troughs touch only at P. Hence there is no region in which A 
and C are visible but D and E are not. This completes the proof of our claim. 

Now we can start categorizing the regions of space formed by the extensions of 
the faces of the dodecahedron, with the criterion of number of visible faces. There 
is only one kind of region in which just a single face can be seen; it is a five-sided 
pyramid extending from the surface of the face to the point of intersection of the 
extensions of the five adjacent faces. (P was such a point.) Since the regions are 
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defined by which faces are visible within them, each different kind of region will be 
defined by the pattern of visible faces. Clearly there is only one kind of region in 
which exactly two faces can be seen, since they must be adjacent faces. Such a 
region consists of the space between two nearby pyramids of the kind described 
above. 

There is only one kind of region in which exactly three faces are visible, since of 
the two possible contiguous arrangements of three faces, one necessitates the 
visibility of even more faces, as was shown in the claim above. 

If four faces are visible, there must be two which are adjacent not to each other 
but to a third visible face. This is the premise of our initial claim, so if we are to 
see only one face besides these three, we know which it must be. Thus there is only 
one kind of region in which four faces are visible. 

If exactly five faces are visible, then we must be able to see one face from each 
of five of the six pairs of opposite faces. Let us denote the hidden face of the 
diagram above by 4’, B’,..., so that A’ is opposite A, B' is opposite B, and so on. 
Now suppose that D and D’ are the opposite faces neither of which we see. Of the 
other ten faces, five are adjacent to D, and five are adjacent to D’. We must be 
seeing at least three contiguous faces from one of these two groups. Without loss 
of generality, suppose we see faces A, B,C. Then, by our claim, since we do not 
see D, we just see FE and F. Thus there is only one possible arrangement for five 
visible faces. 

Finally, for six faces, there are two different kinds of regions corresponding to 
the only two possible arrangements of six visible faces. Clearly, no more than six 
faces can be seen at once from any viewpoint: at most one face from each of the six 
pairs of opposite faces can be seen. 


peOa 


(ii) (iii) (iv) 


comiey 


(v) (vi) (vii) 


Fic. 2 
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Number of visible faces: 1 2 3 4 5 6 6 

Arrangement of visible faces: (i) Gi) Gi) Gv) W) Wi) ~~ Wii) 
from FIGURE 2 

Number of visible vertices: 5 8 10 12 14 15 16 

Minimum distance from origin: 1 1 1 1.512 1.777 1.777 4.236 

Maximum distance from origin: 1.777 1.512 4.236 © 00 00 00 


Compiling this data, we see that 
f4(R) =5  forl <R< 1.777 
f,(R) = 10 for 1.777 < R < 4.236 
f,(R) = 12 for 4.236 < R < 
and that 
F,(R) =10 for1l < R < 1.512 
F,(R) = 12 for 1.512 < R < 1.777 
F,(R) = 15 for 1.777 < R < 4.236 
F,(R) = 16 for 4.236 < R < , 


The case of the icosahedron is still more complicated and we shall not present it 
here. Visual inspection of physical models and a modicum of reasoning gives us 


3<f(R)<8, 6<F,(R) < 11. 


Thus fully half of the twelve vertices can be seen somewhere on the concentric 
sphere of radius 1 + e€, and all but one of the vertices can be seen from a suitable 
vantage point on the concentric sphere of radius 10, say. 


No solutions were received before the customary deadline. The above solution was prepared by Mr. 
Cook at the request of the editors. 


A Discrete Maximum Problem 


E 3253 [1988, 258]. Proposed by Sydney Bulman-Fleming and Edward T. H. 
Wang, Wilfrid Laurier University, Waterloo, Canada. 


(a) Let n and ¢ be natural numbers with n odd and t <n. Suppose x; is an 
integer not divisible by n for i= 1,2,...,t. Determine the maximum possible 
number of pairs (i, j) with 1 <i <j <¢t such that x, + x, = 0 (mod n). 

(b) Let k, n, and t be natural numbers with n relatively prime to k and t <n. 
Suppose x, is an integer not divisible by n for i =1,2,...,t. Determine the 
maximum possible number of k-tuples (i,,i,,...,i,) with 1 <i, <i, < ++: <i, 
<t such that x, +x, + ++: +x; = 0 (mod n). 


Solution of (a) by Walther Janous, Ursulinengymnasium, Innsbruck, Austria. The 
answer is |¢*/4]. Without loss of generality we may assume 1 <x, <n — 1. Put 
n = 2r+ 1, where r > 0. (The problem is vacuous when n = 1.) Let A be the 
number of x; such that 1 <x;<r. Since x; +x; =0 (modn) if and only if 
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x; = 2r+1-—-x,, the maximum number of admissible pairs (i, j) for a given value 
of A is A(t — A). If t is even, we have A(t — A) < t*/4, with equality, for 
example, when ¢/2 of the x; have the value 1 and the ¢/2 remaining x; have the 
value n — 1. If t is odd, we have A(t — A) < ((t — 1)/2H(t + 1)/2}, with equality, 
for example, when (t — 1)/2 of the x; have the value 1 and the (t+ 1)/2 
remaining x, have the value n — 1. In either case the maximum number of 
admissible pairs (i, j) is |t7/4]. 

The assumption t <n is unnecessary, but the assumption that n be odd is 
essential. For if m is even, we could take all the x; to be n/2 and obtain ¢(t — 1)/2 
admissible pairs. 


Editorial remark on Part (b). When n > k > 2 the maximum number of admissi- 
ble k-tuples (i,,i,,...,i,) appears to occur (at least approximately) when 
(kK — 1)t/k] of the x; have the value 1 and the ¢ — |(k — 1)t/k]| remaining x, have 
the value n — k + 1. In that case there are 


[Ie Da \t — |(k — 1)t/k]} ~ (tye 


admissible k-tuples. Again the assumption t <m seems unnecessary, but the 
assumption (n, k) = 1 appears essential. For if (n,k) = d > 1, we could take all 
the x, to have the value n/d and obtain 


(>a 
k})~ ki 


Part (a) was solved also by the proposers. No solutions were received for part (b), which should have 
been marked with an asterisk. 


admissible k-tuples. 


A Sequence Generated by Translation and Reversal 


E 3254 [1988, 258]. Proposed by David Singmaster, Polytechnic of the South 
Bank, London, England. 


If a, and c are given positive integers, define a sequence {a,} by letting a, ,, be 
the positive integer obtained by reversing the digits in the ordinary decimal 
expansion of a, +c (nm =0,1,2,...). For example, if a, = 18 and c = 2, the 
sequence begins 18, 2, 4, 6, 8, 1, 3, 5, 7, 9, 11, 31, 33, 53, 55, 75, 77, 97,.... 

(a) If c = 1, show that the sequence {a,} is eventually periodic and that all such 
sequences have the same ultimate cycle. 

(b) If c = 2 and 1 < a, < 99, show that the sequence {a,} is eventually periodic. 

(c)* If c = 2, show that the sequence {a,} is eventually periodic for any choice 
Of do. 


Solution to (a) found by all respondents. Let a have k > 2 digits. The successor 
of a has no more than k digits, and if a ends in 9, then its successor has fewer 
than k digits. Since iteration brings a 9 to the rightmost position (in at most 17 
steps), the process ultimately reaches the cycle (1,...,9) independent of ap. 


Solution to (b) by J. C. Binz, University of Bern, Switzerland and Walther Janous, 
Ursulinengymnasium , Innsbruck, Austria (independently). For c = 2 and1 < ay) < 99 
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Because b’0 < 10° and yé <5- 10°, we have yé + b’0 < 6: 10°, which implies 
that yé + b’0 = ef, where e < 5 and f has s digits. In particular, d’(c + y)6 + b’0 
=di(c+e)f, with c +e <9. Furthermore, (e + c)d + 2 < 10- 10°, so (e + c)d 
+ 2=(e+ y)6, with e + y < 9a single digit. We now have 

Ong2= (file t+c)d + 2)' = (fle + y))' =8(y + e)f=8(y + y)d + 5, 
as desired. 

This leaves the case that cd =5-10°—1=49---9 (s+ 1 digits). Now 
n<5-1075~! — limplies b < 10°~! — 1, and thusn + 2=(b + 1)-5- 10541. 
With y,6 playing the roles of c,d for n+-2 as above, we have y = 0 and 
6 =0-:-01(s digits). The two-step iteration is 


a,.. = ((a, + 2)' + 2)' = ((d’'8d + 2 + b'0)' + 2)’ = ((d'95 + b'0)' + 2)’. 
Because 6 has as many digits as b’0, we have 96 + b’0 = 9b'1, and 
An.. = (1b9d + 2)’ = (1(b + 1)08)’ = 606 + (6 + 1)'0, 
as desired. 


Editorial comment. Bondesen also pointed out that, although every starting 
value proceeds to a cycle when c = 2, there are arbitrarily large cycles; in fact, the 
value 1075*! + 10° + 1 occurs in a cycle of length 9 - 10° for s = 1,2,3,..., so 
that in particular 1011 occurs in a cycle of length 90. Further L. L. Foster noted 
that a, = 1011 is the smallest value of a, not ultimately leading to the cycle of 
length eighty-one given in the above solution of (b). 

The proposer noted that when c = 3, already the starting values a, = 1 and 
a, = 2 lead to cycles of different lengths. 


Complete solutions were received from A. Bondesen, L. L. Foster, and A. Riese. Parts (a) and (b) 
only were solved by J. C. Binz, W. Janous, R. S. Lehman, S. Pedersen (Denmark) and C. H. Webster. 


The Lattice of Connected Subgraphs of a Graph 


E 3281 [1988, 655]. Proposed by Sandi Klavzar and Marko Petkovsek, University 
of Ljubljana, Ljubljana, Yugoslavia. 


Given a graph G, let C(G) be the inclusion ordering on the vertex subsets of G 
inducing connected subgraphs. Prove that C(G) is a lattice if and only if G is 
connected and every cycle of G induces a complete subgraph of G. 


Solution by John Henry Steelman, Indiana University of Pennsylvania, Indiana, 
PA. Suppose C(G) is a lattice. Note that a single vertex trivially induces a 
connected subgraph. If a and b are any two vertices of G, then their supremum in 
C(G) induces a connected subgraph containing both a and b. Thus all vertices of 
G belong to the same component, and G is connected. If a and 5b are distinct 
vertices contained in a cycle of G, then there are two sets of vertices S$, and S, 
that induce connected subgraphs and intersect in {a, b}. The infimum of two sets in 
C(G) is contained in their intersection. Since each of a and b is a lower bound for 
S, and S, and C(G) is a lattice, the infimum must equal {a, b}. Hence {a, b} 
induces a connected subgraph, which implies that ab is an edge and the cycle 
induces a complete subgraph. 
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Conversely, suppose that G is connected and every cycle of G induces a 
complete subgraph. We first establish the existence of infima. Let {S,} be any 
nonempty collection of vertex subsets of G that induce connected subgraphs. If 
(| S; contains just one vertex or no vertices, then this is the infimum. Otherwise, 
suppose {a, b} C 1 S;. Fixing a particular S,, let P be a minimal subset of vertices 
that induces a connected subgraph such that {a,b} c P C S;; this set must induce a 
chordless path from a to b. | 

We claim that P c ()S;,. Otherwise, there is some k such that P Z S,, but then 
S, also contains a path Q from a to b. Let u be the vertex of P before its first 
vertex not in Q, and let v be the next vertex of P that also belongs to Q. Then u 
and v belong to a cycle, and the hypothesis on G implies that uv is an edge. This 
chord of P contradicts its minimality, thus establishing the claim. 

Since P C 1) S;, we conclude that a and b are in the same component of the 
subgraph induced by 1) S;. With a, b chosen arbitrarily, this subgraph is connected. 
It follows that (S, is the infimum of {S,}. Finally, let {7,} be the collection of all 
common upper bounds for {S,}. Since the set of all vertices is such an upper bound, 
{T.} is nonempty. The supremum of {S,} is the infimum of {7,}, and we conclude 
that G is a lattice. 


Editorial comment. The graphs in which every cycle induces a clique are also 
known as “‘Husimi trees.” For general graphs, the ordering C(G) has been studied 
in other contexts. 


Salved also by R. J. Hendel, A. Pedersen (Denmark), P. M. Shimura (student), and the authors. 


The Angle Bisectors of a Triangle 


E 3282 [1988, 762]. Proposed by D. M. Milosevié, Pranjani, Yugoslavia. 


Given a triangle ABC, suppose that w,,w,,w. are the lengths of the angle 
bisectors, s is the semiperimeter, r is the radius of the incircle, and R is the radius 
of the circumcircle. Prove that 


w2+we+w2<s?—r(R/2-7r), 
with equality if and only if the triangle is equilateral. 


Solution by Walther Janous, Ursulinengymnasium, Innsbruck, Austria. Starting 
from Stewart’s formula for the length of an angle bisector 


w, = be — a*bc/(b + c)* = be — a2bc/(2s — a)’, 
we get 
w2 + we + w2 = be + ca + ab — abc{ f(a) + f(b) + f(c)}, 


where f(t) =t/(2s — t)* for 0 <t < 2s. Since f(t) = (85 + 2t)/@s — t)*, the 
function f is strictly convex on the interval (0, 2s). Thus 


f(a) + f(b) + flc) = 3f({a + b + c}/3) = 3f(28/3) = 9/(8s) 
and 
w2+wet+w2<bet+ca+ab—9/(8s), 


with equality if and only if the triangle is equilateral. Now equating two well-known 
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formulas for the area of a triangle gives s*r* = s(s — aXs — bs — c) or 
sr? = (s —a)(s —b)(s —c) =s? —(a+b+c)s* + (be + ca +t ab)s — abc 
= —s?+ (be + ca + ab)s — abc 
or be + ca + ab =r? + 5? + abc/s. Thus 
we +wetw2 <r? ts? —abc/(8s). 
Using the familiar relation abc/s = 4rR, we get the claimed inequality. 


Editorial comment. Stewart’s formula can easily be derived using the law of 
sines and the law of cosines. The proposer remarked that the above inequality and 
Euler’s inequality 2r < R immediately give w? + wz + w?2 < s*, with equality only 
if the triangle is equilateral. (Cf. 5.1 and 5.34 of O. Bottema et al., Geometric 
Inequalities, Groningen, 1969.) H. W. Guggenheimer mentioned that the inequality 
we + we +w2 <s? was part of E1628 [1963, 891; 1964, 687]. 

Solved also by J. Anglesio (France), R. Bellot-Rosado (Spain), E. Braune (Austria), J. Fukuta 
(Japan), L. Kuipers (Switzerland), I. Paasche (W. Germany), R. Prielipp, I. A. Sakmar (Turkey), 


V. Schindler (E. Germany), J. M. Stark, M. Vowe (Switzerland), and the proposer. Three incorrect 
solutions were received. 


A Double Impossibility 


E 3331 [1989, 524]. Proposed by Walter Rudin, University of Wisconsin, Madison. 
Exhibit a positive continuously differentiable function f on [0, 0) such that 
f(x) >f*(x4) (O<x<-), 

or prove that-no such function exists. 


Solution by Eugene Levine, Adelphi University, Garden City, NY. No such 
function exists, since if f is such a function we obtain the contradiction 


1 ‘(x 1 1 1 

foe pv , pz F re - 

f(9) “0 0 f*(x) f(0) = fC/F(0)) sf (0) 
Editorial comment. Several solvers proved more general results. One possible 

extension is that, given a >0, e>0, and m2>0O, no positive differentiable 

function exists satisfying f(x) > af'*t*(x) on[m,~). Also, several solvers pointed 


out that the problem has the same answer if f7(x) is understood to mean f(f(x)) 
instead of (f(x))?. 


Solution by William S. Zwicker, Union College, Schenectady, N.Y. under the 
alternative interpretation. Assuming that f?(x) means f(f(x)), we prove that there 
is no positive differentiable function f on [0, 0) which satisfies f’(x) > f( f(x)) for 
0 <x < o, Assume f is such a function. Since f’(x) > f(f(x)) > 0, f is strictly 
increasing, so that f(x) > f(f(x)) = fC f(O)) > 0. This means that f’(x) is bounded 
away from 0, and so lim,_,,, f(x) =~. Hence lim, _,,, f(f(x)) =, and so 
lim, _,.. f(x) = %. If we set h(x) = f(x) — (x + 1), then h’(x) also approaches », 
which implies that h(x) approaches «. Thus we may choose an x with f(x) > x + 
1. Now applying the mean value theorem to the interval [x, f(x)], we obtain a 
point c in the interval (x, f(x)) for which f(f(x)) = f(x) + f(a f(x) — x) > 
f'(cK f(x) — x) = FUMIO) — x) > fFOI FOX) — x) > ffx), which is a 


contradiction. 
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Solved also by 105 others and the proposer. Most solvers (including the proposer) interpreted the 
problem as Levine did, but over 10% pointed out the ambiguity. 


A Deranged Recurrence 


E 3335 [1989, 525]. Proposed by D. E. Knuth, Stanford University, Stanford, CA. 


Solve the recurrence 
Xp =4,X, =), Xy49 =Xn41 +%X,/(n + 1) forn > 0 
both exactly (in terms of familiar functions of n) and asymptotically. 


Solution I by Jungmin Woo (student), Yonsei University, Seoul, South Korea. The 
solution is x, =(n + Ila + (b — 2a)URIM-D*/k!]~(n + Dia + (b — 2a)/e). 
Let y, =x,/(n + 1) and z, =y, —y,_, for n> 0, z) =a. Then y, =a, y, = 
b/2, z,=b/2—-—a, and (n+ l)y, =ny,_,+ yY,-2 for n> 2. This yields 
(n + 1)z,, — —Oy,-1 — Yn—2) = —Z,~-1, SO 


w= bailed) (Sao 


for n > 1. Since y, = yyL"_,z;, we have y, =a + (b — 2a)L7_(-1)'**/@ + DI. 
This yields x, as claimed above, since L?_(-1)'*'/G + D!= LZto(— D*/K!. 
Solution II by Hans Kappus, Rodersdorf, Switzerland. This is a second order 
homogeneous linear recurrence, which means that x, = Ay, + Bz, for linearly 
independent solutions {y,},{z,} and suitable constants A, B. With y, =n + 1 
serving as one solution, put z, =w,/n! to obtain the recurrence w,,, = 


(n + 2Xw, + w,, 1), the numbers u, = w,_, thus satisfying the well-known recur- 
rence D,,, =(n + 1XD,+D,,,) for the derangement numbers D, = 
niv”_ (—1)*/k!. Hence z, =D,.,,/n! satisfies the given recurrence. Using the 


initial conditions, we obtain x, = a(n + 1) + (b — 2a)D,,,/n!. Since D, =n! /e 
+ O(1/n), we have x, =(n + Dla + (b — 2a)/e] + OU/(n + DY. 


Solution II by J. M. Borwein and P. B. Borwein, Dalhousie University, Halifax, 
Nova Scotia. The generating function A(y) = Lx,y”*'/(n + 1) satisfies the dif- 
ferential equation A’ — a — by = y(A' — a) + yA obtained by multiplying the re- 
currence by y”*? and summing over n > 0. The initial condition is A(O) = 0. By 
standard techniques for solving first-order differential equations, we obtain A(y) 
=(a—b)+a/ —y)+ (b — 2a)e~”/(1 — y). For n > 0, the coefficient of y”** 
in the power series expansion of A is a + (b — 2a)U745(—1)*/k!, which yields 
the same formula for x, as above. Asymptotics can be determined as above. 


Solved by the proposer and seventy-seven readers. One incorrect solution was received. 


A Triangle from a Tetrahedron 


E 3337 [1989, 641]. Proposed by Murray Klamkin, University of Alberta, Edmon- 
ton, Alberta. 


Suppose the two longest edges of a tetrahedron are a pair of opposite edges. 
Prove that the three edges incident to some vertex of the tetrahedron are 
congruent to the sides of an acute triangle. 
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Solution by Jesus Ferrer, Universidad Complutense, Madrid, Spain. Let O, A, B,C 
be the vertices of the tetrahedron, and suppose that OA and BC are the longest 
edges. If the edges incident to each vertex fail to be the side lengths of an acute 
triangle, then the law of cosines implies the following inequalities: 


OA? > OB* + OC’, BC* > OB? + AB’, («) 
OA? > AB* + AC?, BC? > OC* + AC?. 
(The inequalities (*) are valid a fortiori if some triples do not satisfy the triangle 
inequality.) Summing these inequalities, we obtain 
OA? + BC? > OB* + OC? + AB* + AC’. 
Expressing the vector BC as OC — OB (and similarly for AB and AC), we can use 
the fact that squared length equals vector inner product to rewrite this as 
0 > OA? + OB? + OC? — 20A - OB — 204 - OC + 20B- OC 
=||OA — OB - OI’, 


that is, OA — OB — OC= 0. Thus OC= BA and OB= CA, which requires OBAC 
to be a plane parallelogram. Since equality must hold in all four parts of (*), in 
fact OBAC is a rectangle. 

In all other cases at least one of the inequalities (*) fails, and the lengths of 
segments incident to the corresponding vertex are the side lengths of an acute 
triangle. 


Solved also by C. Athanasiadis (student), J. Balogh (student, Hungary), L. Bankoff, R. J. Chapman 
(United Kingdom), O. P. Lossers (The Netherlands), J. H. Nieto (Venezuela), A. Nijenhuis, and the 
proposer. Two incorrect solutions were received. 


POSTMORTEM COMMENTS ON ELEMENTARY PROBLEMS 
Weitzenbock Generalized 


E 3150 [1986, 400; 1988, 658]. Proposed by George A. Tsintsifas, Thessaloniki, 
Greece. 

Let ABC be a triangle with sides a, b, c and area F. It is well known that 
a? +b? +c? > 4Fy3. If p,q,r are arbitrary positive real numbers, prove that 


D q r 
a’ ae c* > 2Fy3. 
qtr r+ p + q 


Remark by the editors. The first'of the two published solutions was submitted by 
Victoria Pambuccian and was correctly attributed to her in the copy submitted by 
the editors and in the galley proofs. Unfortunately, someone subsequently changed 
the name of the solver to Victor Pambuccian without consulting the editors. 


3-Player Blackjack 


E 3186 [1987, 71; 1989, 736]. Proposed by I. A. Sakmar, University of Western 
Ontario. 


On a current TV game show, three contestants for the final showcase prizes 
compete by spinning a wheel, which is divided into twenty equal sections num- 
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bered in steps of 5 from 5 to 100. The aim is to get as close to 100 as possible 
without exceeding a sum of 100 in two spins. Each contestant completes one or 
both spins before the next contestant’s turn, and if either the 1st or 2nd contestant 
exceeds 100, that contestant is out of contention. In case of a tie, the first 
contestant to attain that score is chosen. 

In the running of the program, it is implied that it is a good policy for the first 
contestant to stop if the number from the first spin is greater than 50. What is the 
number at which the first contestant should stop on her first spin? 


Remark by the editors. Peter A. Griffin has pointed out that (a) the published 
solution ignores the possibility of collusion (an unlikely event on the actual show) 
and (b) the numerical value given for the probability that A wins (assuming no 
collusion) should be 0.341223 instead of 0.341771. 


Producing a Ranking From Recent Results 


E 3240 [1987, 996; 1989, 529]. Proposed by C. L. Mallows, AT&T Bell Laborato- 
ries, Murray Hill, NJ. 


The tennis players in a certain club compete in singles matches. After each 
match is played, a ranking of the players is computed according to the following 
rule. Starting with the most recent match and working back through time, use the 
match results to build up a partial order. Ignore any match that is inconsistent with 
more recent results. Stop when the partial order is a complete order. 

Prove or disprove: a player cannot improve his position by intentionally losing a 
match. 


Remark by the editors. The proposer’s solution published in 1989 gave a scenario 
in which a player benefited by throwing a match in a competition of seven players. 
In an editorial comment, we asked if there was such an example with fewer than 
seven players. B. Hayes has provided the following example with only six players. 
Suppose that the following results are obtained in the order given, where ab 
means a beats Db: 

61 26 64 36 34 42 23 31 21 52 15 41. 
The ranking after the last match is 3 >6>4>1>5 > 2, but if 4 now plays 5 
and loses, the ranking becomes 5 > 4>2>3>6> 1. Player 4, by losing, rises 
from third place to second. 


ADVANCED PROBLEMS 


6643. Proposed by Gérard Letac, Université Paul Sabatier, Toulouse, France. 


For positive real t put 


Prove that ¢ increases from 0 to 4 when t varies from 0 to +, 


6644. Proposed by Peter Rogerson, SUNY at Buffalo. 


A needle with length L between 1 and y2 is tossed at random upon the 
Cartesian plane. Find the probability that it comes to rest not crossing any line of 
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the form x = m or y = n, where m and n are integers. (The case L < 1 is treated 
on pp. 255-256 of J. V. Uspensky’s Introduction to Mathematical Probability, 
McGraw Hill, 1937. Uspensky attributes that case of the problem to Laplace.) 


6645. Proposed by Robert Kreczner, University of Wisconsin-Stevens Point. 


(a) Show that the sequence {(sin n)”}"_, is dense in (0, 1). 
(b)* Show that the same sequence is dense in (—1, 0). 


SOLUTIONS OF ADVANCED PROBLEMS 
Great Expectations 


6576 [1986, 1036]. Proposed by Hans U. Gerber, University of Lausanne, Switzer- 
land. 


Suppose X,, X5,... are independent identically distributed real random vari- 
ables with ECX,) = w. Put S, =X, 4+ X,+ °:: +X, for k = 1,2,... 

(a) If p <p <1, where p = —0.278465... is the real root of xe!~* = —1, 
show that the series 


CO 


> Ske **/k! 
k=1 


converges with probability one. 
(b) If X,, X,,... are positive and if w < 1, show that the expectation of 


y. Ske~ Sk /k! 
k=1 


is w/ — pw). 
*(c) In (b) is it possible to relax the condition that the random variables are 
positive? For example, would it suffice to assume E(|X,|) < » and p <p < 1? 


Solution I by Christian Hipp, Institut fiir Mathematische Stochastik, Hamburg, 
Germany. For part (a) note that p < w < 1 implies |we'~“| < 1. So by the strong 
law of large numbers, 


1 1 
— S;, exp(1 — Rok <1 


k 
with probability 1. Stirling’s inequality 


k! > (Qa)? kkt/2e-k 


lim sup 
k 


implies that for k > 1, 


Hence, 


< © 


ye 


k 


1 
Sy xP — Si) 


with probability 1. 
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For part (b) the random variables X,, X>,... are positive. Let N(t), t > 0, be a 
Poisson process with intensity 1. Then, a straightforward calculation shows that, 
given X,, X5,..., 


1 
prone = P{N(X, +--+: +X,) =k}. 


Let Y,, Y>,... be the interarrival times of N(t), t > 0. The Y’s are i.i.d. and have 
an exponential distribution with mean 1. Since 


N(X, +77 +X,) =k eY, + +¥, Xb +X, <Y, t+ + YG, 


the mean M of 


ae | 
— Oko —- Sx 
» 7 Sie 
k=1 
equals the expected number of times k > 1 for which 
Y,t-°° +¥, <X, + °° +X, < VY, t+ +¥% a4. 
Now consider the compound Poisson process S(t), ¢ > 0, with interarrival times 
Y,, Y>,.-.. and jumps X,, X,,.... In this process, M is the mean number of 
crossings of zero of the path 
t— S(t), t>0 
or, equivalently, the mean number of ruins of ¢ — S(t) (ruin occurs whenever 
t < S(t)). By the strong Markov property, the number of ruins has a geometric 
distribution with parameter 


P{ar:¢t< S(t)} = EX =p. 
Hence, M = p/(1 — p). 
Solution II (for (b)) by O. P. Lossers, Eindhoven University of Technology, 
Eindhoven, the Netherlands. Let ¢ be defined by g(s) = Ee~“**, Then o*(s) = 
Ee~&*)5«° and so 


oO ioe) oo xke-* 
M=E ¥) Sfe-«7ki= ¥ f dF**(x), 
k=1 k=170 


where F** is the distribution function of S,. It follows that 


EGS 


By Lagrange’s formula (see e.g., G. Pélya and G. Szegé, Aufgaben und Lehrsitze 
aus der Analysis, Erster Band, Springer, 1964, problem 209, p. 125) we have 


°° “| d \* 1 
+ E“[4feo) -cio 
where u and ¢ are related by ug(t) =¢t. Since g(—1) = 1 and —¢(—-1) = yp, 
putting u =t = —1 yields M = w/(1 — pw) as required, for 0 < pw < 1. 


Editorial comment. No solutions were received for part (c), but O. P. Lossers 
considered as an example the case where X, has a negative exponential distribu- 
tion with mean uw > 0. He gave M for uw > 1. In fact, the same calculation holds if 
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—X, has a negative exponential distribution, with EX, = uw < 0, where wp lies in 
the interval specified below. 

In particular, if X, (or —X;,) has a negative exponential distribution with mean 
ww, then S, (or —S,) has a Gamma distribution with parameters k and pw. Hence, 


letting I. = (0, 2), (or (—~, 0)), 


k-1 
—S, k,-s —s/p 
ES,e [ste Tk ak° ds 
_ (2k) LL f 
— «T(K) law | 
Now, 
P(2k) — (2k—-1)! 1 (2k)! 1(-1\(-4)! 
[(kK)k! (k—-1)!k! 2 klk! 2k 
Therefore, 
~~ Lem (_# \'_ 1s /-12\(_=4# 
gai kK! T(K) aso 55 k | (1+ p)° 
4 —1/2 
~3{[1- Ap -1} =5{|*)-4, 
2 (1 + #) 2\;|l-4 


where the series converges if (and only if) - ;< w/(1 + uw)? < §. This inequality 
holds except when —3 — V8 <p <—-3+ V8 (approximately —5.83 < uw < —.17) 


or when wu = 1. Thus 


vi 
—-3+v8 <pw<l 
1+yp v8 e 
1 
M= 1 1<p<owor-eo<p< —34+ V8 
— w 
oa) w=1 


undefined -—3-— V8 <p < —3 +78. 


O. P. Lossers conjectures that this is always true whenever the original series 


converges. 
Parts (a) and (b) were solved also, vila upcrossing probabilities, by Boris Pittel and by the proposer. 
Solutions via Laplace transforms and Lagrange’s formula were given also by Terence R. Shore, Western 


Maryland College Problems Group, and a referee. 


Rays in Normed Linear Spaces 


6586 [1988, 963]. Proposed by Charles §. Allen, Drury College, Springfield, MO. 
In an infinite-dimensional normed linear space does there exist a convex set 
which does not contain a ray but whose closure does contain a ray? 


Solution by Thomas W. Starbird, Jet Propulsion Laboratory, Pasadena, CA. We 
show the answer is yes by following Ann Griesel’s suggestion to construct such a 
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set in an arbitrary infinite-dimensional topological linear space X. In fact, we shall 
construct in such an X a sequence x,, (i > 1, j > 1) whose convex hull contains no 
ray, but whose closure (and hence the closure of its convex hull) contains a ray. 

Let {e,}, i = 0,1,2,..., be a collection of linearly independent points in X. Let 
{r,}, i = 1,2,... be an enumeration of the positive rationals. Set 


Xi; = 19 + e;/), t>1,j>1. 


Since X;; > 7;€9 aS j > %, the closure of {x,,} contains the ray from 0 in the 
direction @). We now show that the convex hull of {x,,} cannot contain a ray. 

Fix two distinct points a and b in a ray R, and assume that they are in the 
convex hull of {x;;}. Hence they are convex combinations of finitely many {x,,}, and 
belong to the linear span L of {e,,e,,...,e,} for some integer n. Thus R itself is 
contained in L. If R is also in the convex hull of {x,,} then any z in R has the form 


z=) Ya, ;Xj;5 
toj 
where the a;; are nonnegative and 
» a; =1. 
if 


But the a@,, are 0 for i > n since the e, are linearly independent—the formula for 
x,, has only nonnegative coefficients so no cancellation can occur. Thus R is in the 
convex hull of the x,;, with i <n. But this is merely the convex hull of 7 line 
segments, namely the segments (7;€),7;@€) + e;] for i= 1,2,...,n, and hence 
cannot contain a ray. 


Editorial comment. Victor Klee showed in 1953 that in every infinite-dimen- 
sional real vector space X there is even a ubiquitous convex set containing no ray. 
A subset S of X is called ubiquitous if every point of X is an endpoint of some 
open segment contained in S. See pp. 107-108 of V. Klee, Convex sets in linear 
spaces III, Duke Math. J. 2001953), 105-111. In the opposite direction S. K. Chung 
(Hong Kong) showed that a convex set in a finite-dimensional Hausdorff topologi- 
cal vector space contains a ray if its closure does. 

Solved also by S. K. Chung (Hong Kong), Nir Cohen (Israel), A. A. Jagers (The Netherlands), Victor 
Klee, O. P. Lossers (The Netherlands), Charles Riley, John Henry Steelman, Richard B. Tucker, and 
the proposer. 


Conjugacy Classes of Mobius Transformations 


6589 [1989, 65]. Proposed by Kevin Brown, Kent, WA. 


Given a positive integer g, let n(g) denote the number of conjugacy classes of 
MObius transformations each member of which generates a cyclic group of order g. 
Characterize those g greater than 6 for which n(g) = 1 (mod6). (A Mobius 
transformation is a linear fractional transformation of the extended complex plane. 
Cf. Chapter II of Carathéodory’s Theory of Functions, Volume I, Chelsea, 1954.) 


Solution by the proposer. The numbers g that occur are all numbers of the form 
p*? and 2p’, where d is any positive integer and p is any prime congruent to 11 
modulo 12 (the smallest such g is 121). For the proof we represent each MGbius 
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transformation by its corresponding matrix of determinant 1. Now it is well known 
that a Mobius transformation generates a cyclic group of order g if and only if Q, 
the square of its trace, is 


ka 
O= tcos?( | with (k,g) = 1. 


(Cf. L. R. Ford, Automorphic Functions, McGraw-Hill, 1929, §8.) Also, two such 
transformations are similar if and only if they have the same squared trace. Hence 
the number of conjugacy classes of Mébius transformations that generate a cyclic 
group of order g is the number of distinct values of QO with (k, g) = 1. In terms of 
Euler’s phi function this is p(g)/2. Hence in terms of the decomposition of g into 
prime factors, we must find all 


g= | [pf > 6 
for which 


p(g)/2 = sl 1(p,; ~ 1)pé-' = 1mod 6. 


This rules out more than one odd prime factor for g, and the problem is now 
reduced to a simple exercise (details omitted) in elementary number theory. 


Solved also by Thomas McDonald and Allan Pedersen (Denmark). 


The Evenly Spaced Integer Topology 


6590 [1989, 65]. Proposed by Jestis Ferrer, Oliva (Valencia), Spain. 


Let Z be the set of integers with the topology generated by the sets a + kZ = 
{a + kx: x © Z}, where a € Z, k € Z, k # 0. (This topology, known as the Evenly 
Spaced Integer Topology, is studied in considerable detail in Steen and Seebach’s 
Counterexamples in Topology.) 

Suppose k and / are relatively prime positive integers with k > 2, and let B, , 
be the set of all (positive) prime numbers congruent to / modulo k. Find the 
derived set of B, ;, i.e., the set of accumulation points of B, , = 1+ kZ. 


Solution by A. A. Jagers, Universiteit Twente, Enschede, The Netherlands. Let Bi ; 
denote the derived set of B,,. We prove that Bi, = {1} if /=1 modk and 
Bi, ={-l}if 1 = —1 mod k, but that By , = © otherwise. Let 1 + mZ be a basis 
neighborhood of 1. Then 1 +kmZ is a smaller neighborhood of 1 containing 
infinitely many primes by Dirichlet’s theorem. All these primes belong to B, ; if 
! = 1modk. Hence 1 belongs to B; ; if / = 1 mod k. In the same way — 1 belongs 
to Bi, if / = —1 mod k. On the other hand, if ¢ is an integer other than —1, 0, or 
1, then ¢tZ is a neighborhood of ¢ (and also of 0) that contains no (positive) primes 
except possibly ¢ itself. Thus only 1 and —1 can occur as accumulation points of 
sets of primes. Finally, note that / + kZ is a closed neighborhood of B, ,, so that 
any accumulation point of B, , must belong to / + kZ. 


Editorial comment. Jagers also points out that Bj, = {—1} (and, e.g., BE; = ©) 
instead of the incorrect B,; = © given in §58.5 of Steen and Seebach’s Counterex- 
amples in Topology. 


Solved also by Michael Barr (Canada), Jesis Angel Hernandez Isla (Spain), Nicholas Passell, and 
the proposer. One incorrect solution was received. 
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Integrals That Were Personal Friends of Ramanujan 


6600 [1989, 366]. Proposed by P. J. De Doelder, Technical University, Eindhoven, 
The Netherlands. 


If m is a positive integer, prove that 


[70 au = 2(-1)"2m) 102m + 1) 


0 Sin u 
m-1 (—1)/(q7/2)7*" 
+ 2(-1)"*'(2m)! © Blam — 2j) 


j=0 
and 
njzuen me m=1 (—1)/(a/2)” | 
i ——— du = 2(-1)""*"(2m — 1)! x ay Bm ~ 2;), 
where 
ora) 1 co (-1)* 
Mts) = x (2k +1)’ BCs) = x (2k +1)" 


Solution by Nathaniel Grossman, Department of Mathematics, UCLA. We note 
first that 
ore) uP 

J sinh w 
when p is a nonnegative integer. These two evaluations can be verified by 
expanding 1/sinh u and 1/cosh u into series of powers of e~“, and then integrat- 


ing term by term. 
Let C denote the positively oriented, rectangular contour with corners at 0, 


7/2, 7/2 + 1A, and iA. By Cauchy’s Theorem, 


zP 
J dz = 0. 
csin z 


Denote the sides of C by B, R, T, and L (for bottom, right, top, and left). 
We let A — ~ and find by parametrizing the top segment of C that 


= o| = o0(1). 


Further obvious parametrizations yield 


00 (a/2 + iu)” + iu)” 


J = if ~coshu 
P ip\(m\i . co yeu 
= —f{— | 7e7tl d 
YF (SF) | some “ 


jJ=0 
ap lf im)’ . 
= 2pti? y= {-F] B(p -j +1), 
—o J! 


J 


=2pI)Mp +1) and f° ——— du = 2(p!)B(p + 1) 


cosh u 


zP 
rsin z , 
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and 


= —j[P du = —2p!\i?A(p +1). 
i} | saa i. PUPA Dp ) 


I 
i) 
S 
°S 
~ 
o—~ 
S 
+ 
ray 
~~ 

| 
i) 
S 
*S 

+ 
MM» 


Now take p = 2m, an even, positive integer. We know that the integral is 
real-valued, so by equating real and imaginary parts we find that 


oe” = 2(—1)"(2m)!A(2m + 1 
PO Fy Me = (2m)!A(2m + 1) 
m=1 (—1)!(a/2)7*" 


+2(-1)"*'(2m)! ¥ 


Py aya pr 8m — 2j), 


as proposed, and also 


2 


m 1 T J 
Xu Qn! =] B(2m — 2] +1) =0. 


Note that in the latter identity every term is a rational multiple of 7?”*'. 
When p = 2m — 1, an odd positive integer, separation into real and imaginary 
parts gives 


2m-1 m-1(— 1) 9) 
[au = (-1)"*'Q2m - 1D CVG?) 
0 sin u j=0 (27)! 


as proposed, and also 


A(2m) = y (70) (= 


2j+1 
— 2 _ ce ; 
© ap } BQm 2) -1) 


Note that in the latter identity every term is a rational multiple of +2”. 


Editorial comment. A. A. Jagers gave a short elegant proof by bringing f(u) = 
log tan(u /2) into the integrand via an integration by parts, expanding f(u) into a 
Fourier series, and integrating termwise. C. Georghiou and Lynn R. Purser 
pointed out that somewhat more general results are given in Bruce C. Berndt, 
Ramanujan’s Notebooks Part I, Springer-Verlag, New York, 1985, Chapter 9, Entry 
14, p. 261. 


Solved also by Jean Anglesio (France), C. Georghiou (Greece), A. A. Jagers (The Netherlands), 
Lynn R. Purser, Wolfgang Schuster (Germany), and the proposer. 
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POSTMORTEM COMMENTS ON ADVANCED PROBLEMS 
Shapiro’s Cyclic Inequality 


4603 [1954, 571]. Proposed by H. S. Shapiro, New York University. (The present 
address of the proposer is: Royal Institute of Technology, Stockholm, Sweden.) 
Given x; => 0,7 = 1,2,...,n. Establish 
X4 X5 Xx 


n 
+ + eve ——_——_ >—, 
Xo +X3 Xz +X, X, +X, xX, +X, 7 2 
equality occurring only if all denominators are equal. 


Remark by the editors. It is now known that the assertion of the problem is true 
only in the following two cases: 

(i) n is even and n < 12 

(ii) n is odd and n < 23. 
For more detailed information see B. A. Troesch, The Validity of Shapiro’s Cyclic 
Inequality, Math. Comp., 53 (1989) 657-664 and the references given there. 


Density of Pairs with Same Prime Factors 
5735 [1970, 532; 1971, 680]. Proposed by Paul Erdés and T. Motzkin, University 


College of Swansea, Wales. 


Denote by F(n) the number of pairs 1 < a < b <n for which a and b have the 
same prime factors. Prove that lim, _,,, F(n)/n exists and is finite. 


Remark by the editors. The solution published in 1971 was incorrect. It began 


with the erroneous assertion that if n = rj'r5? --- rZ*, where the r,; are distinct 
prime numbers and the a; are positive integers, then 
F(n) — F(n — 1) = ajay +++ ay. (*) 


The left-hand side of (*) is equal to the number of pairs (a,n) such that] <a <n 
and a and n have the same prime factors, while the right-hand side of (*) is equal 
to the number of such pairs (a, n) which satisfy the additional condition that aln. 
For example, if n = 50, the left-hand side of (*) counts the four pairs (10, 50), 
(20, 50), (40, 50), and (50,50), while the right-hand side of (*) counts only the two 
pairs (10,50) and (50,50). Thus the value given for lim F(n)/n in the published 
solution is too low. 

For a correct proof of the assertion of the problem see H. N. Shapiro, Integer 
vectors with interprimed components, Math. Comp., 27 (1973) 455-462, or Alan H. 
Stein, An application of Dirichlet series in several complex variables, Comm. Pure 
Appl. Math., 28 (1975) 303-321. 
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REVIEWS 


EDITED BY JOSEPH KONHAUSER 
Department of Mathematics, Macalester College, St. Paul, MN 55105 


The Emperor’s New Mind. By Roger Penrose, Oxford University Press, New York, 
1989, xiii + 466 pp. 


MICHAEL BARR 
Department of Mathematics and Statistics, McGill University, Montreal, Quebec, Canada H3A 2K6 


This is a fascinating book, one that I recommend every mathematically sophisti- 
cated person read. I am a bit surprised by the number of people who have bought 
it, enough to put it on the bestseller list for several months. It certainly refutes the 
assertion in Hawking’s A Brief History of Time that every formula contained in a 
book would reduce its sales by half. 

The purpose of this book is to demonstrate that the human mind cannot be 
modeled by a Turing machine or other such mechanical device. The argument falls 
into two parts. The first part attempts to show that a human can best any machine 
by solving a problem or by proving a theorem that the machine cannot. If, in 
particular, the machine purported to be a model of that person’s reasoning, there 
would be a contradiction. The more speculative second part describes how quan- 
tum theory provides a possible explanation for the first part. 

This second part includes the most lucid exposition of quantum theory that I 
have ever read. The most lucid, but in the end not lucid at all. In fact, I am 
beginning to think that the phrase “lucid explanation of quantum theory”’ is simply 
an oxymoron. No matter: Penrose himself seems to agree. He certainly feels that 
there must be an explanation of reality of which quantum theory is simply the 
visible reflection. 

He also gives a very clear (and this time convincing) description of time’s arrow 
and the meaning of increasing entropy, as well as a description of how the universe 
began with minimum entropy. 

The debate over whether human reasoning can be captured by a machine has a 
long history. The opening salvo was fired by J. R. Lucas [1961], although Lucas 
does not claim it was original with him, but that it was in the air at the time and he 
decided to formalize it and answer all the objections. He didn’t answer quite all of 
them, a point to which I will return. 

A second active participant in, this debate has been J. R. Searle, who summa- 
rized his position in the January, 1990 Scientific American [Searle, 1990]. Searle 
posits the following axioms: 

Axiom 1. Computer programs are formal (syntactic). 

Axiom 2. Human minds have mental contents (semantics). 

Axiom 3. Syntax by itself is neither constitutive of nor sufficient for semantics. 
and, not surprisingly, concludes that human minds cannot be modeled by computer 
programs. 

His main argument for Axiom 3 is a bizarre scenario known as the Chinese 
room experiment. It is hard to see what this experiment would show if it were 
actually carried out except that a person in a room full of instructions written on 
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3-by-5 cards could simulate a computer. Essentially, the purpose of the thought 
experiment is to allow Searle to assert that a computer cannot be said to 
understand a natural language. 

This may be so; many people seem emotionally committed to this viewpoint. 
Many people in the last century (and a surprising number in this) are emotionally 
committed to the point of view that humans are not evolved from what they see as 
the lower animals. Theories of human exceptionality are rampant, but one should 
not assert them as axioms and expect to be taken seriously. 

On the other hand, the outcome might surprise Searle, who doesn’t appear to 
be aware of the dangers of supposing he knows what would happen in an 
experiment he hasn’t done. Perhaps the man in the room, in the process of 
carrying out the experiment, would actually learn written Chinese purely from the 
syntax. 

Let me offer a couple of arguments as to why there may, in fact, be no 
distinction between syntax and semantics. These arguments are not claimed to be 
conclusive; they only show that it is possible to hold the opposite point of view. 

As any elementary school teacher will tell you, the only way to teach the 
meaning of a new word is to give several examples of its use in context. Dictionary 
definitions that don’t show how the word is used are insufficient, and, once 
samples of use in enough contexts are given, turn out to be unnecessary. The 
extensive use of citations by the Oxford English Dictionary is another illustration 
of this fact. 

But even more convincing evidence (to me at least) comes from my mathemati- 
cal experience. In showing that every first order theory either has a model or can 
be demonstrated to be inconsistent, one uses the very sentences of the theory to 
build the model. In other words, there are always syntactic models and these are 
often the first ones one sees. Of course, one is happier with semantic models 
because one understands them better. But one understands them better because 
they are built from models of a theory that one understands better. The models of 
the earlier theory are either syntactic models of that theory or are derived, in their 
turn, from some more primitive theory. Follow it all the way down and it seems 
that all these semantic models may ultimately be based on syntax. 

This doesn’t prove that Searle is wrong, but his argument is seriously incom- 
plete. And, as a matter of fact, Penrose does not consider that Searle’s argument is 
conclusive, although he feels it has considerable force. 

Lucas’ argument is based on Gédel’s theorem. It is identical in all relevant 
aspects to Penrose’s, which I, will discuss in considerable detail below. Oddly 
enough, Penrose fails to mention this fact, although he is aware of Lucas’ article, 
referring to it in an end note to the last chapter, where he says, “See Lucas (1961) 
for an argument that Gédel’s theorem implies non-computability and Good (1969), 
Benacerraf (1967) and Bowie (1982) for various counter-arguments.” He also 
presents no discussion of the counter-arguments, which is even more surprising, 
since they refute his main argument as well. 

In the library of the university that I visited, I found only Lucas’ original article 
and one of the three refutations, that of Bowie [1982]. Since that is completely 
convincing, I will reproduce it in some detail. 

Penrose bases his conclusions on two arguments, one of which is the Lucas 
argument. Before turning to that one, let us look at the one based on the 
undecidability of the “halting problem” for a Turing machine (or computer). 
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The argument begins on page 64 in a section titled, ““How to outdo an 
algorithm”’ and the title is almost self-explanatory. To be perfectly definite, by a 
Turing machine 7 we will mean a machine (computer program, whatever) that 
takes as input a natural number n and either produces as output a natural number 
T(n) or else runs forever, in which case we follow Penrose and write T(n) = O. 

It is well known that it is undecidable in general whether a Turing machine, 
given an input, will ever stop computing. For some machines, for some inputs, it is 
decidable, but there can be no general procedure that will always decide this 
question. No matter what decision procedure your algorithm uses, there is always 
some machine and input on which it will fail. 

On the other hand, it is easy to imagine a machine H that takes as input Turing 
machines T (or you could use descriptions of Turing machines or even just Gédel 
numberings of such machines) and behaves as follows: 


1 if T stops on input m 


A(T,m) = 
( ) ‘Oo Oc ottherwise. 


This does not follow our definition of Turing machine because it takes as input a 
machine-natural number pair, but it can easily be coded to be a Turing machine. 
An example of such a machine is the one that, given input m, simulates the 
running of T on m and outputs 1 if T stops (and runs forever if T doesn’t stop). 
Such a program H has the property that if H(T, m) = 0, then T will run forever 
on input m. On the other hand, it might happen that T runs forever on input m 
and so does H(T,m). Since we cannot know if H is going to stop, we get no 
information on the pair (T, m) in-that case. 

Suppose now that we have such a program H and that {T,}, n © N is an 
enumeration of the Turing machines. The function f defined by 


f(n) =1+ T,(n)H(T,, 2) 


(and f(n) = C if the right-hand side is undefined) can be computed by some 
Turing machine so there is a k for which f(n) = T,(n). 

Now Penrose argues that it is impossible for H(7T,,k) = 1, for that gives an 
outright contradiction. It is also impossible that H(T,, k) = 0, for that implies that 
T,(k) = f(k) = O, while the right-hand side is 1. This is actually the case only if 
you have defined multiplication in such a way that 00 = 0. This is not the way 
most actual compilers arrange things (at the very least, it will depend on the order 
of evaluation), but it is perfectly possible to implement this kind of multiplication 
and I am willing to suppose that this is what Penrose had in mind. He now 
continues with the argument that although H(T,,k) = 0 and so cannot decide if 
T,(k) terminates, we can see that it is impossible for T,(k) to terminate because 
then AH(T,,k) = 1, a contradiction. Thus we can decide a question that the 
algorithm H cannot and therefore we can beat any algorithm. (He isn’t claiming 
that we can beat every algorithm simultaneously, but that claim isn’t required for 
his argument.) 

Where is the flaw? It is simply this: the whole chain of reasoning is based on the 
fact that H behaves as described above and that we can prove it. For it is knowing 
how H behaves that allows us to find a contradiction and to conclude that 7, runs 
forever. If H were an algorithm that did behave in this way, but for which that 
fact were unprovable, then the 7, constructed would still have the property that 
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H(T,, k) runs forever and so H cannot detect whether or not T,(k) is defined and 
neither can we. For our knowledge of the behavior of H(T,, k) is predicated on the 
partially effective nature of H and if we cannot know that, then we may not know 
the behavior of T, either. 

Let me now turn to the Penrose/Lucas argument from Gédel’s theorem. In 
fact, the point at issue is really very close to the computational argument above. 
Gédel showed that for every set of axioms which is sufficiently strong for arith- 
metic there is what is now called a Gédel statement: a statement that is true in 
ordinary arithmetic but that cannot be proven from those axioms. Moreover, it is 
even possible to give an algorithm to construct such a sentence, given a set of 
axioms, much as 7, is constructed above to confound the machine H. But now, 
Penrose and Lucas argue, a person can readily see that this proposition is true 
since it was constructed to be true but unprovable in this system. Therefore there 
can be no axiomatization of the human thought processes. 

It is a little harder to see what is wrong with this process. If we begin with a 
standard set of axioms, such as Peano arithmetic, then there is a Gddelian 
sentence and no one can doubt that the argument is correct and, assuming that 
those axioms are consistent (an important qualification as we will see), the 
sentence is true and is not provable from these axioms. But suppose we begin with 
a set of axioms for human thought (assuming such possible, which is what Penrose 
wants to refute). Or to be more precise, axioms for the thought processes of a 
single human being. It is not so clear that these axioms are consistent. In fact, it 
might be more reasonable to suppose the contrary. The Gdédel sentence can be 
constructed as before, but now there is no reason to believe it true. 

In a nutshell Bowie’s argument is this: “Lucas must produce a Gédel sentence 
for any axiom set (and not only those for which he can prove consistency). If he 
believes all these Gédel sentences are true, then he is inconsistent.” Later he adds, 
“Of course he would also be saved from inconsistency if, without a procedure, he 
produced Gédel sentences on just those occasions when he is challenged with a 
consistent machine, withholding them otherwise. But I, for one, do not believe he 
can do this.”” Nor do I believe that Penrose can. 

You cannot escape this trap by sticking to axiom systems that can be proved 
consistent within themselves. For one thing, no such axiom system is strong enough 
to do arithmetic. But even if you lower your sights to relative consistency (relative 
to Peano arithmetic, say), it is still the case that you cannot prove consistency for 
any system as smart as yourself. 

One way to summarize this argument is to point out that if I suppose that my 
reasoning is axiomatized by some set A of axioms, then I can still construct a 
Gédel sentence for A, but unless I can prove that A is consistent, I have no reason 
to believe that sentence is true. But if A is strong enough to do Peano arithmetic, 
then, according to another Gédel theorem, I must use a stronger axiom set than A 
to show that A is consistent. Thus J have no way of knowing if this Gédel 
sentence is true and so the contradiction that Lucas/Penrose hope to derive 
disappears. On page 116, Penrose dismisses the undecidability of consistency, 
which he describes as, “historically perhaps the most important part of his 
[Gédel’s] argument”, as irrelevant to the purposes of his book. To the contrary, 
this undecidability is the key to the unraveling of his argument. 

On page 110, there is a curious statement, “The insight whereby we concluded 
that the Gédel proposition P,(k) is actually a true statement of arithmetic is an 
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example of a general type of procedure known to logicians as a reflection principle: 
thus by ‘reflecting’ upon the meaning of the axiom system and rules of procedure, 
and convincing oneself that these indeed provide valid ways of arriving at mathe- 
matical truths, one may be able to code this insight into further true mathematical 
statements that were not deducible from these very axioms and rules.” This is 
nothing less than a chain that logicians use their reflections and insights, rather 
than formal reasoning to prove their theorems. This is an unbelievable howler of 
an error that is a libel on the world of logicians. There are subjective aspects in the 
choices of which sets of axioms you accept and the reflection principle (which is, in 
fact, a rather esoteric axiom of set theory that says that certain properties of very 
large infinite cardinal numbers have “reflections” in properties of more ordinary 
cardinals) is just as objective as any other axiom of logic. 

One of the striking things about all these arguments, including Searle’s, is that 
they are completely formal. It is possible to imagine a planet full of robots that we 
have programmed to study philosophy and logic, using some heuristic procedures 
that are even now being worked on, to have discovered all these arguments and to 
be drawing exactly the same conclusions. Except that if their programming were 
actually consistent, they would see the flaws in the arguments and would not 
thereby conclude that their own thought processes could not be modeled by a 
computer. 

None of this refutes Penrose’s main contention. It is entirely possible that 
human thought may not be modeled by a Turing machine. But nothing in the book 
establishes it either. As far as I am concerned, it is an empirical question that 
cannot be settled by any amount of armchair philosophizing, nor by any purely 
formal reasoning. 

One thing not discussed is the possibility that a machine equipped with a 
random number generator might model a human mind. I don’t regard this as a 
serious argument against Penrose’s main thesis because such a hybrid is no longer 
a Turing machine. 

There is an enormous amount of valuable material in this book. There is the 
quantum theory mentioned above and the discussion of Time’s arrow. There is also 
a considerable amount of good mathematical exposition. For example, the discus- 
sion of both Turing machines and Gédel’s theorem are probably the best elemen- 
tary discussion of these matters I have seen. There is a good introduction to 
lambda calculus, to the Mandelbrot set and much, much more. The only real 
criticism is that there is no valid evidence brought to bear on its main thesis. 
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AMS, 1990, xix + 762 pp, $215. [ISBN: 0-8218-1131- 
2] Proceedings of an AMS-SIAM Summer Seminar 
held July 18-29, 1988 at Colorado State University in 
Fort Collins. The coverage is quite broad, including 
topics from complementarity theory and differential 
equations. SM 


Functional Analysis, T(17: 1). Essential Results 
of Functional Analysis. Robert J. Zimmer. Lect. in 
Math. Ser. University of Chicago Pr, 1990, ix + 157 
pp, $14.95 (P); $34.95. [ISBN: 0-226-98338-2; 0-226- 
98337-4] A concise and easy-to-read presentation of 
fundamental results in functional analysis. Variety 
of topics including convexity, fixed point theorems, 
compact operators, compact groups, spectral theory, 
ergodic theory, C*-algebras, Fourier transforms, and 
many more. Exercise sets follow each chapter. MLR 
Analysis, T(15-17), S, P, L. Special Functions. 
Z.X. Wang, D.R. Guo. ‘Transl: D.R. Guo, X.J. 
Xia. World Scientific, 1989, xvi + 695 pp, $75; $38 
(P). [ISBN: 9971-50-659-9; 9971-50-667-X] A com- 
pendium of the various principal special functions in 
common use and their basic properties and deriva- 
tions. Some of the less important formulas are put 
into the exercises, together with relevant hints for 
the more difficult ones. KS 


Algebraic Geometry, P. Néron Models. Siegfried 
Bosch, Werner Litkebohmert, Michel Raynaud. Ser. 
of Modern Surveys in Math., Band 21. Springer- 
Verlag, 1990, x + 325 pp, $85.50. (ISBN: 0-387- 
50587-3] The definitive treatment of Néron models 
in the setting of schemes. The authors provide moti- 
vation for the theory and some background material 
from algebraic geometry. Other topics include bira- 
tional group laws, descent, the Picard functor, and 
Jacobians. SG 


Algebraic Geometry, P. Lecture Notes in Mathe- 
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matics-1420: Real Analytic and Algebraic Geometry. 
Eds: M. Galbiati, A. Tognoli. Springer-Verlag, 1990, 
366 pp, $35.90 (P). [ISBN: 0-387-52313-8] A col- 
lection of twenty-one research papers presented at a 
1988 conference in Trento. SG 


Differential Geometry, P. Geometric and Topo- 
logical Invariants of Elliptic Operators. Ed: Jerome 
Kaminker. Contemporary Math., V. 105. AMS, 
1990, xi + 297 pp, $37 (P). [ISBN: 0-8218-5112-8] 
Proceedings of a conference held July 1988 at Bow- 
doin College. Contains twelve papers. MLR 


Differential Geometry, T?, S*(16-17). Solid 
Shape. Jan J. Koenderink. MIT Pr, 1990, xiv + 699 
pp, $65. [ISBN: 0-262-11139-X] A largely expository 
text on the differential geometry of submanifolds of 
three-dimensional space, written from the perspec- 
tive of an engineer “hooked on differential geometry.” 
Probably not ideal for a main text (no exercises, few 
computations, often vague definitions and theorem 
statements), but contains some marvelous intuition 
that makes it well worth reading. JO 


Differential Geometry, T(17-18: 1, 2). Mod- 
ern Differential Geometry for Physicists. Chris J. 
Isham. Lect. Notes in Physics, V. 32. World Scien- 
tific, 1989, vii + 182 pp, $42; $22 (P). [ISBN: 9971- 
50-956-3; 9971-50-957-1] Lecture notes from a first- 
year course for physics graduate students. Chapters 
on manifolds, Lie groups, and fibre bundles. Author 
describes exposition as “pithy,” and in need of sup- 
plementary discussion. He should be heeded. JO 


Geometry, T?(16-18), S?, P. Fractal Geome- 
try: Mathematical Foundations and Applications. 
Kenneth Falconer. Wiley, 1990, xxii + 288 pp, 
$33.95. [ISBN: 0-471-92287-0] Begins with various 
notions of dimension and methods for their calcula- 
tion. Foundational part concludes by examining ge- 
ometrical properties of fractals. The second part, on 
examples and applications, includes self-similar and 
self-affine sets, examples from number theory and 
pure mathematics, dynamical systems, Julia sets, 
random fractals, and Brownian motion. Aimed at re- 
searchers, this book contains exercises and thus may 
also be useful as a course text. KS 


Algebraic Topology, P. Braid Group, Knot The- 
ory and Statistical Mechanics. Eds: C.N. Yang, 
M.L. Ge. Adv. Ser. in Math. Physics, V. 9. 
World Scientific, 1989, vii + 329 pp, $68; $38 (P). 
[ISBN: 9971-50-828-1; 9971-50-833-8] Nine articles 
on these three topics illustrating the fascinating in- 
terplay among them. In particular, one pdint of fo- 
cus is the role of the Yang-Baxter equations as the 
key to solvability in each of these areas. A warn- 
ing: the typesetting in several articles is somewhat 
primitive. MPR 


Operations Research, P, L. Twenty-five Years 
of Operations Research in the Netherlands: Papers 
Dedicated to Gijs de Leve. Eds: Jan Karel Lenstra, 
Henk Tijms, Ton Volgenant. CWI Tracts, V. 70. 
Mathematisch Centrum, 1989, 181 pp, Dfl. 48 (P). 
[ISBN: 90-6196-385-0] A collection of sixteen pa- 
pers contributed by current and former students and 
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colleagues of Gijs de Leve on the occasion of his silver 
jubilee as Professor at the University of Amsterdam. 
A wide variety of topics are represented, with most 
papers dealing with some aspect of either a Markov 
process or optimization on a network. SM 


Dynamical Systems, P. Bifurcation and Chaos in 
Simple Dynamical Systems. J. Awrejcewicz. World 
Scientific, 1989, vill + 126 pp, $26. [ISBN: 9810- 
200382] Collection of the author’s results, including 
an analytical method of solving the Hopf bifurcation 
problem, a numerical method for studying bifurca- 
tion behavior of nonlinear differential equations, and 
an analytical condition for special types of Hopf bi- 
furcations. JO 

Dynamical Systems, T*(16: 2), L. Chaotic Dy- 
namics of Nonlinear Systems. S. Neil Rasband. Wi- 
ley, 1990, x + 230 pp, $39.95. [ISBN: 0-471-63418-2] 
A very nice text intended primarily for physics stu- 
dents interested in the mathematics of chaos. Most 
of the “classical” topics are represented, e.g., one- 
dimensional maps, universality, fractal dimension, 
differential dynamics, conservative dynamics, and 
measures of chaos. The emphasis throughout is on 
numerical calculations (on PC’s) to illustrate the key 
ideas and theorems. Plenty of exercises and exam- 
ples. Also includes an extensive bibliography and a 
glossary (very useful!). MPR 

Dynamical Systems, S(16-18), P, L. Elemen- 
tary Symbolic Dynamics and Chaos in Dissipative 
Systems. Hao Bai-lin. World Scientific, 1989, xv + 
460 pp, $78; $38 (P). [ISBN: 9971-50-682-3; 9971-50- 
698-X] A physics-oriented introduction to dynamical 
systems, emphasizing symbolic dynamics. Lots of 
detailed examples. BC 


Control Theory, P. Optimal Radar Tracking Sys- 
tems. George Biernson. Wiley, 1990, xiv + 594 
pp, $64.95. (ISBN: 0-471-50673-7] A detailed en- 
gineering study of the Altair Radar System located 
on the Kwajalein Atoll. The emphasis is on a practi- 
cal application of optimal control theory, particularly 
Kalman filtering. The presentation should be of in- 
terest to any engineer or mathematician concerned 
with any type of target tracking. SM 


Control Theory, T(18: 1, 2), P, L. Nonlinear 
Dynamical Control Systems. Henk Nijmeijer, Arjan 
van der Schaft. Springer-Verlag, 1990, xiii + 467 pp, 
$49.95. [ISBN: 0-387-97234-X] A graduate-level text- 
book applying geometric control theory to nonlinear 
systems. Many examples of both discrete and contin- 
uous time systems. Also included is a self-contained 
discussion of the necessary differential geometry. The 
book is well-written with plenty of exercises. SM 


Control Theory, P. Lecture Notes in Control and 
Information Sciences-141: Root Clustering in Pa- 
rameter Space. S. Gutman. Springer-Verlag, 1990, 
viii + 153 pp, $27.10 (P). [ISBN: 0-387-52361-8] 
Certain control problems require the eigenvalues of 
a matrix to “cluster” in a given part of the complex 
plane. A theoretic treatment, with lots of detailed 
examples. BC 


Elementary Statistics, T*(13-14: 1, 2). Ele- 


1990] 


mentary Statistics in a World of Applications, Third 
Edition. Ramakant Khazanie. Scott Foresman, 
1990, 716 pp, $32.95. [ISBN: 0-673-16751-8] Revi- 
sion of the author’s 1986 Second Edition (First Edi- 
tion, TR, April 1979; Second Edition, TR, Novem- 
ber 1986). Most significant change is the inclusion of 
Minitab labs at the end of each chapter with detailed 
explanations of the commands. New exercises based 
on real data and some new material (e.g., box plots, 
Bayes’ theorem, sampling techniques) have also been 
added, and review exercises have replaced the chap- 
ter tests. RSK 


Elementary Statistics, T? Statistics in Scientific 
Investigation: Its Basis, Application, and Interpreta- 
tton. Glen McPherson. Texts in Statistics. Springer- 
Verlag, 1990, xxvi + 666 pp, $49.95. [ISBN: 0-387- 
97137-8] Applied statistics textbook assuming as 
pre-requisites high school algebra and understanding 
of scientific modelling. The book includes much dis- 
cussion about statistics but models few illuminating 
examples. Ill-suited for those with just an algebra 
background and no scientific experience. TH 


Elementary Statistics, T(13-16: 1), L. Under- 
standing Statistics in the Social Sciences. William 
Ray Arney. WH Freeman, 1990, xvii + 524 pp, 
$19.95 (P). [ISBN: 0-7167-2006-X] Delightful intro- 
duction to the concepts and language of statistics 
with a bit of history interwoven. Covers basic prob- 
ability, descriptive statistics, hypothesis testing (in- 
cluding tests for categorical data, and one- and two- 
way ANOVA), point and interval estimation, mea- 
sures of association, and linear regression models. 
Good exercises-and illustrative examples. RWJ 


Computational Statistics, P. MAXDATA: A 
Time Series Database System. Wilhelm A. Hen- 
nerkes. Springer-Verlag, 1990, xiii + 150 pp, $35 
(P). [ISBN: 0-387-52209-3] Handbook describing 
the MAXDATA database system, designed to be a 
user-friendly software package for personal comput- 
ers for the analysis and management of large eco- 
nomic data sets. RSK 

Statistics, T(15-17: 1), S, L. Finding Groups in 
Data: An Introduction to Cluster Analysis. Leonard 
Kaufman, Peter J. Rousseeuw. Ser. in Prob. & 
Math. Stat. Wiley, 1990, xiv + 342 pp, $49.95. 
[ISBN: 0-471-87876-6] Describes some robust parti- 
tional and hierarchical (divisive and agglomerative) 
clustering methods of general applicability. Prefaced 
by guidelines for the choice of a method and instruc- 
tions on how to prepare data for clustering. Includes 
detailed instruction on how to use the author’s For- 
tran software, computer output of several examples, 
and about 80 exercises. RWJ 


Statistics, P. Lecture Notes in Statistics-62: Test- 
ing Problems with Linear or Angular Inequality Con- 
straints. Johan C. Akkerboom. Springer-Verlag, 
1990, xii + 291 pp, $35.90 (P). [ISBN: 0-387-97232-3] 
This monograph is the published doctoral thesis of 
the author. It presents a likelihood ratio method for 
testing problems with an alternative hypothesis re- 
stricted by linear inequality constraints. The mono- 
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graph focuses on the practical application of Pincus’s 
idea to replace the linear inequalities with one angu- 
lar inequality, a circular cone, and then to apply a 
likelihood ratio test. JJ 


Statistics, T(18: 1) Accelerated Testing: Statisti- 
cal Models, Test Plans, and Data Analyses. Wayne 
Nelson. Ser. in Prob. & Math. Stat. Wiley, 1990, 
xiv + 601 pp, $69.95. [ISBN: 0-471-52277-5] Ex- 
cellent applied practical text on accelerated life (re- 
liability) testing, the process of testing products at 
higher than normal stress or usage levels in order to 
reduce test time, and then relating the accelerated 
failure distributions back to normal time/usage lev- 
els. Text contains many real data sets and specific 
examples analyzing these data. The newest devel- 
opments in the accelerated testing field are covered. 
Analysis methods from simple graphical techniques 
to advanced computer programs are presented. Life- 
stress models include inverse power, Eyring’s, Ar- 
rhenius’, Zhurkov’s. Complete data analyzed with 
least squares, censored data with maximum likeli- 
hood methods. Test plans are discussed, as are mod- 
els for handling data with varying stress. Product 
data analyzed from electronics, insulations, semicon- 
ductors, plastics, metals. JJ 


Statistics, S(18), P. Spline Models for Observa- 
ttonal Data. Grace Wahba. CBMS-NSF Reg. Conf. 
Ser. in Appl. Math., V. 59. SIAM, 1990, xii + 169 
pp, $24.75 (P). [ISBN: 089871-244-0] Monograph 
from a 1987 lecture series held in Columbus, Ohio. 
Concentrates on work by the author and associates. 
Develops univariate smoothing splines, thin plate 
splines, splines on the sphere, etc., using reproducing 
kernel Hilbert space theory. Convergence properties, 
confidence intervals, parameter selection. References 
to numerical methods and software. TH 


Statistics, P. Lecture Notes in Statistics-63: Es- 
tumation tn Semiparametric Models: Some Recent 
Developments. Johann Pfanzagl. Springer-Verlag, 
1990, 112 pp, $17.60 (P). [ISBN: 0-387-97238-2] 
Constructs asymptotically optimal sequences of esti- 
mates for functionals of distributions having a finite 
dimensional parameter and which admit a sufficient 
statistic for the nuisance parameter. RWJ 


Statistics, T*(16-17: 1), P, L. Categorical Data 
Analysis. Alan Agresti. Ser. in Prob. & Math. Stat. 
Wiley, 1990, xv + 558 pp, $49.95. [ISBN: 0-471- 
85301-1] Divided into four main parts: descriptive 
and inferential methods for two-way contingency ta- 
bles; basics of model building, emphasizing loglinear 
and logit models, for two-way and multi-way tables; 
applications and generalizations of these models; and 
the theoretical foundations for modeling categorical 
data. Very good problem sets, both theoretical and 
applied. Extensive set of references. RSK 


Statistics, P. Distributions of Test Statcstics: 
Ezact & Asymptotic, Null & Non-Null, Meth- 
ods/Comparisons/Research Frontiers. Ed: A.M. 
Mathai. Amer. Ser. in Math. & Management Sci., 
V. 18. American Sciences Pr, 1989, 182 pp, $89.75 
(P). [ISBN: 0-935950-20-6] Collection of eight re- 
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view articles concerned with tests of hypotheses on 
one or more multivariate normal populations. Cen- 
ters on distributions connected with likelihood ratio 
test statistics. (Also issued as a special issue of the 
American Journal of Mathematical and Management 
Sciences, Volume 9, Nos. 1 and 2.) Note price! RSK 


Artificial Intelligence, T(17-18: 2), S, P, L. 
Probabilistic Reasoning in Ezpert Systems: Theory 
and Algorithms. Richard E. Neapolitan. Wiley, 
1990, xiii + 433 pp, $44.95. [ISBN: 0-471-61840- 
3] Graduate-level text on the use of probability and 
graph theory (networks) in expert systems. Includes 
exercises and examples. BC 


Artificial Intelligence, S, P. Lecture Notes in Ar- 
tificial Intelligence-419: A Methodology for Uncer- 
tainty in Knowledge-Based Systems. Kurt Weichsel- 
berger, Sigrid Pohlmann. Springer-Verlag, 1990, 132 
pp, $16.50 (P). [ISBN: 0-387-52336-7] In the area 
of expert systems, it is important for the system de- 
signer to be able to represent and manipulate incom- 
plete, missing, or uncertain information. This text 
reviews the current research in the area of artificial 
intelligence, and presents one specific methodology 
for dealing with this problem, based on probability 
theory. The authors then give an example of how 
their theory can be applied to expert system in the 
area of diagnostic medicine. GMS 


Computer Science, T(16-17: 1). Data Analysis 
for Data Base Design, Second Edition. D.R. Howe. 
Edward Arnold (US Distr: Routledge, Chapman & 
Hall), 1989, xiii + 317 pp, $27.50 (P). [ISBN: 0-7131- 
3688-X] This edition is designed to make data anal- 
ysis techniques more readily available to students of 
systems analysis. The most significant change from 
the previous edition is the addition of a chapter on 
the SQL relational data base language. The book 
is divided into four parts: data base management 
systems, relational modelling, entity-relational mod- 
elling, implementation. Case studies are used, and 
problems and assignments given. JJ 


Applications, P, L. Environmental Effects on Bus 
Durability. Sandra L. Arlinghaus, John D. Nystuen. 
Monograph, No. 11. Institute of Mathematical Ge- 
ography, 1989, vi + 104 pp, $15.95 (P). This mono- 
graph considers the effects of climate, terrain, con- 
gestion, and the combination of all three factors on 
the durability of buses serving public transit agen- 
cies in the United States. For each factory the work 
discusses, in detail, how the almost 200 cities studied 
are classified into “peer” groups, and what implica- 
tions these classifications have for maintenance of the 
bus fleet. An interesting application of statistical and 
geographic data analysis. SM 


Applications (Behavioral Science), P, L*. The 
Anthropology of Numbers. Thomas Crump. Stud. in 
Social Anthropology. Cambridge University Pr, 
1990, x + 200 pp, $49.50. [ISBN: 0-521-38045- 
6] The first scholarly work to examine the role 
of numbers (including numeracy and numerology) 
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from a broad cultural perspective. Detailed logi- 
cal, psychological, and linguistic foundations pro- 
vide a basis for an exploration of the role numbers 
play in various cultures’ cosmology, economy, mu- 
sic, games, politics, architecture, and language. Pro- 
fusely illustrated with examples from societies past 
and present—especially from the Far East where the 
author lived for several years. Extensive end notes; 
thorough bibliography; comprehensive index. A mar- 
velous resource for anyone interested in the role of 
mathematics in culture. LAS 


Applications (Communication Theory), S(17- 
18), P. Time Series Analysis, Identification and 
Adaptive Filtering, Second Edition. Daniel Graupe. 
Robert E Krieger, 1989, xvii + 420 pp, $49.50. 
[ISBN: 0-89464-315-0] Self-contained text on appli- 
cations of time series analysis to signal identification 
and adaptive filtering of unknown signals from un- 
known noise. This edition adds a chapter on time 
series with time-varying parameters. (First Edi- 
tion, TR, February 1985.) BC 


Applications (Physics), L. Fractal Growth Phe- 
nomena. Tamas Vicsek. World Scientific, 1989, xi + 
355 pp, $67. [ISBN: 9971-50-442-X] Basic concepts 
in the study of fractal growth (concentrating on ge- 
ometric aspects). Part I serves as an introduction 
explaining the basics of fractal geometry: fractal di- 
mension, measure, and many examples (experimen- 
tal and computer generated). Parts II and III apply 
fractal geometry to the physics of growth phenom- 
ena. MLR 

Applications (Social Science), T, S(15-17), L. 
Understanding Quantitative History. New Liberal 
Arts Ser. Loren Haskins, Kirk Jeffrey. MIT Pr, 1990, 
xxv + 366 pp, $27.50. [ISBN: 0-262-08190-3] Case 
studies from published historical research (primarily 
American history) organized in chapters to introduce 
prospective historians and students of other social 
sciences to basic tools of descriptive statistics: ta- 
bles, charts, graphs, means and medians, samples, 
standard deviation, scatter charts, and regression. 
Requires only first year high school algebra; uses 
no formulas. Numerous exercise questions invite the 
reader to analyze the case studies with care. (An- 
swers to all questions are provided at the end of each 
chapter.) One of the Sloan “New Liberal Arts” text- 
books intended to infuse greater attention to technol- 
ogy and applied mathematics into traditional liberal 
arts disciplines. LAS 


Reviewers 


SB: Steve Benson, St. Olaf; BC: Barry Cipra, St. Olaf; SG: 
Steven Galovich, Carleton; GG: George Gilbert, St. Olaf; RH: 
Rhonda Hatcher, St. Olaf; TH: Timothy Hesterberg, St. Olaf; 
RWJ: Roger W. Johnson, Carleton; JJ: Jason Jones, St. Olaf; 
RSK: Richard S. Kleber, St. Olaf; LCL: Loren C. Larson, 
St. Olaf; SM: Steve McKelvey, St. Olaf; JO: Jeff Ondich, 
St. Olaf; MLR: Margaret L. Reese, St. Olaf; MPR: Matthew P. 
Richey, St. Olaf; KS: Karen Saxe, St. Olaf; GMS: G. Michael 
Schneider, Macalester; KES: Kay E. Smith, St. Olaf; LAS: 
Lynn Arthur Steen, St. Olaf; PZ: Paul Zorn, St. Olaf. 
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Report of the Conference/Workshop to Develop Curriculum 
and Teaching Methods for Calculus at the College Level, 
Ronald G. Douglas, Editor. 

MAA Notes #6 

249 pp., 1987, Paperbound, ISBN-0-88385-056-7 

Catalog Number - NTE-07 


Price: $12.50 


Should calculus be taught differently? Can it? Common wisdom says 
‘*no’’—which topics are taught, and when, are dictated by the logic of the 
subject and by client department. The surprising answer from a four-day 
Sloan Foundation-sponsored conference on calculus instruction, chaired by 
Ronald Douglas, is that significant change is possible, desirable, and neces- 
sary. Meeting at Tulane University in New Orleans in January, 1986, a diverse 
and sometimes contentious group of twenty-five faculty, university and foun- 
dation administrators, and scientists from client departments, put aside their 
differences to call for a leaner, livelier, more contemporary course, more 
sharply focused on calculus’s central ideas and on its role as the language 
of science. 

This volume contains the results of that conference and the papers pre- 
sented to the conferees. These are certain to be the point of departure and 
basis for efforts to strengthen and reshape calculus in the next decade. 
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Exponential Sums Automorphic 
and Differential Representations of 
Equations Unitary Groups in 
Nicholas M. Katz Three Variables 


This book is concerned with the theory of linear 


differential equations in one complex variable with Jonathan D. Rogawski 


polynomial coefficients, and the theory of one par- The purpose of this book is to develop the stable 
ameter families of exponential sums over finite trace formula for unitary groups in three variables. 
fields. After reviewing results from representation The stable trace formula is then applied to obtain a 
theory, the book discusses results about differential classification of automorphic representations. This 
equations and their differential galois groups (Gg) work represents the first case in which the stable 
and one-parameter families of exponential sums trace formula has been worked out beyond the case 
and their geometric monodromy groups (Grong): of SL (2) and related groups. Many phenomena 
The final part of the book is devoted to comparison — which will appear in the general case present 
theorems relating Ggq and G,gqg Of Suitably “corre- themselves already for these unitary groups. 
sponding” situations, which provide a systematic Annals of Mathematics Studies, 123 

explanation of the remarkable “coincidences” Paper: $19.95 ISBN 0-691-08587-0 


found “by hand” in the hypergeometric case. Cloth: $60.00 ISBN 0-691-08586-2 
Annals of Mathematics Studies, 124 In Japan order from United Publishers Services 


Paper: $22.50 ISBN 0-691-08599-4 . . . 
Princeton University Press 


Cloth: $65.00 ISBN 0-691-08598-6 
In Japan order from United Publishers Services 
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GREAT MOMENTS IN 
MATHEMATICS BEFORE 1650 


Howard Eves 

This outstanding volume (along with its 
companion Volume 7) has become an MAA 
bestseller. It contains 23 fascinating math- 


ematical episodes from antiquity to about 
1650. 


270 pp., Paper, 1982, ISBN 0-88385-310-8 
List: $17.00 MAA Member: $13.00 
Catalog Number DEP-05 


GREAT MOMENTS IN 
MATHEMATICS AFTER 1650 


Howard Eves 


This is a companion volume to Volume 5. 
270 pp., Paper, 1982, ISBN 0-88385-311-6 
Paper List: $17.00 MAA Member: $13.00 
Catalog Number DEP-07 (Paper) 
Hardbound List : $27.00 

MAA Member: $20.00 

Catalog Number DOL-07 (Hardbound) 


Both books available in 
paperbound editions as a package: 
List: $31.50 MAA Member: $23.00 
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Howard Eves made a valuable contri- 
bution to the Dolciani Mathematical Ex- 
position series. . . The twenty lectures 
included are a delight to read. They 
place each "great moment” in its his- 
torical context and lay special empha- 
sis an human aspects of each achieve- 
ment. No algebra or geometry teacher 
should be without this book. 


Tom Walsh in The Mathematics Teacher 


He has the knack of bringing ancient 
mathematics alive, at the same time re- 
lating it to modern ideas and develop- 
ments. His work is readable and infor- 
mative and indeed bears reading a sec- 
ond time. His exposition is clear and 
he is not content with generalities, but 
gives actual examples of early meth- 
ods. 


E.H. Lockwood in 
The Mathematical Gazette 


ORDER FROM 
» The Mathematical Association 
f of America 


1529 Eighteenth Street, N.W. 
Washington, D.C. 20036 


J sin co un 8) a3 
[2 


rox + 1) 


Options Quit Remove Simplify Transfer moVe Window approX 
Enter option 
User D: EXAMPLE .MTH Free :97% Derive Algebra 


2000 years of mathematical knowledge on a disk 


DERIVE, the successor to muMATH, is a powerful computer algebra system 
for your PC compatible computer that provides the following capabilities: 

@ Exact and approximate arithmetic to thousands of digits . 
@ Equations, complex numbers, trigonometry, calculus, vectors, and matrice 
@ 2D and 3D function plotting with zooming capability 

@ MDA, CGA, EGA, VGA, and Hercules graphics and text support 

@ Attractive 2D mathematical display of formulas 

@ Easy to use menu-driven interface with on-line help 

@ Ideal for engineers, scientists, students and teachers 

H@ $200 plus shipping: Call or write for information. 


System requirements: IBM PC or compatible computer, MS-DOS version 2.1 or later, 512K 
memory, and a 54 inch (360K) or a 3172 inch (760K) diskette drive. Or NEC PC-9801 or 

compatible computer, MS-DOS version 2.1 or later, 512K memory, and a 54 inch (640K) 

diskette drive. 

DERIVE and muMATH are trademarks of Soft Warehouse, Inc. Hercules is a trademark of Hercules Computer Technology, Inc. 


IBM is a registered trademark of International Business Machines Corp. MS-DOS is a registered trademark of Microsoft Corp. 
NEC is a registered trademark of Nippon Electric Company. 


Soft Warchouse: 


3615 Harding Avenue, Suite 505 ¢ Honolulu, Hi 96816 
manseaean 
©1988 Soft Warehouse, Inc. 


(808) 734-5801 after noon PST 
Handcrafted software for the mind. 


Carus Mathematical Monograph, #19 
Field Theory and its Classical Problems, 


by Charles R. Hadlock 
323 + xvi pp. Hardbound 
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Field Theory and its Classical Problems is one of those rare monographs that will cap- 
ture and hold a mathematically prepared reader (one reasonably familiar with basic cal- 
culus and linear algebra) until the last page. Mathematicians will be intrigued by the 
development which places Galois Theory in an historical perspective and allows it to 
unfold from the emergence of the classical construction problems to the discovery of 
their solutions. The book presents some of history’s greatest and most elegant mathematics. 
Read what some reviewers say about this monograph: 

‘‘Hadlock has produced a pedagogic masterpiece . . . His idea of beginning 

with the three Greek problems and letting them fire the reader’s spark plugs 

1s brilliant . . . (his) ability to inject vitality and enthusiasm into mathematical 

text is remarkable. (George*Piranian, University of Michigan) 


‘‘In the preface, the author claims to have written the book for himself. . . 
Fortunately for us, he chose to share his work with the mathematical commu- 
nity. I suggested the book as collateral reading in a one-semester course in Galois 
Theory and the students found it very readable and most helpful in establish- 
ing a motivation and direction . . . This is a delightful book for both student 
and teacher.’’ (John D. Leadley, Mathematical Reviews) 
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The Department of Mathematics at Ohio State offers several options for graduate study: a broad 
Ph.D. degree program with specialization in nearly all branches of contemporary mathematics, 
pure and applied; an M.S. degree program with options in pure or applied mathematics, an 
elective focus in Actuarial Science, and a dual M.S. degree program with Computer Science; a 
content oriented M.A. degree program designed for secondary level teachers of mathematics. 
MS/PhD applicants must present scores from the GRE Advanced Subject test in Mathematics 
and should have at least a one year background in advanced calculus and linear/abstract algebra. 


Students are encouraged to begin their studies at Ohio State in June, 1991. A number of 
Summer Fellowships providing stipends ranging from $1400 — $2000 in addition to a waiver of 
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THE HISTORY OF MATHEMATICS 


Esther R. Phillips, Editor 


Esther Phillips has brought together a col- 
lection of articles showing the sweep of re- 
cent scholarship in the history of mathemat- 
ics. The material covers a wide range of 
current research topics: algebraic number 
theory, geometry, topology, logic, the rela- 
tionship between mathematics and comput- 
ing, partial differential equations, and alge- 
braic geometry. 


320 pp., 1987, ISBN 0-88385-1 28-8 
List: 36.50 MAA Member: $28.00 
Catalog Number MAS-26 


This is an excellent book! It is a 
very interesting and exciting book to 
read. The author does an extremely 
nice job of bringing together most, if 
not all, the mathematicians that were 
involved in a particular area of mathe- 
matics. The sources listed at the end of 
each section give the reader an oppor- 
tunity to look up other resources per- 
taining to the particular subjects, a fea- 
ture that is definitely lacking in many 
history books. The content of the book 
is choice. The professional mathemati- 
cian would definitely want to have a 
copy of this book. 
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next step along the way. Unlike other 
texts that use a “solutions technique” 
approach, Differential Equations: A 
Modeling Approach puts into per- 
spective how the various solutions 
techniques students learn actually fit 
together — the “why” as well as the 
“how-to.” 
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Martin Gardner has charmed readers for over fifty 
years with his delightful books and articles on sci- 
ence. He is best known for the popular column, 
“Mathematical Games,” which appeared in Scien- 
tific American for twenty-five years. Generations 
of scientists and mathematicians have been 
inspired by his writing and the MAA is proud to 
include his name in its list of authors. 


This book was drawn from Gardner’s column 
in Isaac Asimov’s Science Fiction Magazine. The 
riddles presented here incorporate the responses 
of his initial readers, along with additions sug- 
gested by the editors of the New Mathematical 
Library. Each chapter (riddle) poses a problem 
answered in the First Answers section. The solu- 
tion in turn raises another problem that is solved 
in the Second Answers section. This may suggest 
a third question and in several instances there is 
a fourth. Gardner draws us from questions to 
answers always presenting us with new riddles— 
some as yet unanswered. There are 125 different 
pieces altogether. 


Solving these riddles is not simply a matter of 
logic and calculation, although these play a role. 
Luck and inspiration are factors as well, so begin- 
ners and experts alike may profitably exercise their 
wits on Gardner’s problems, whose subjects range 
from geometry to word play to questions relating 
to physics and geology. 


We guarantee that you will solve some of the 
riddles, be stumped by others, and be amused by 
almost all of the stories and settings that Gardner 
has devised to raise these questions. 
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